Zach Furman

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

not monolithic black-box computations, but something more like circuits.

1390.767 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

This is reminiscent of the picture we started with.

1395.892 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Solomonoff induction frames learning as a search for simple programs that explain data.

1399.256 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

It is a theoretical ideal, provably optimal in a certain sense, but hopelessly intractable.

1404.701 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The connection between Solomonoff and deep learning has mostly been viewed as purely conceptual.

1410.588 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

A nice way to think about what learning should do with no implications for what neural networks actually do.

1415.978 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

But the evidence from mechanistic interpretability suggests a different possibility.

1421.609 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

What if deep learning is doing something functionally similar to program synthesis?

1427.601 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Not through the same mechanism, gradient descent on continuous parameters is nothing like enumerative search over discrete programs.

1432.381 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

But perhaps targeting the same kind of object.

1440.171 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Mechanistic solutions, built from parts, that capture structure in the data generating process.

1443.254 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

To be clear, this is a hypothesis.

1449.502 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The evidence shows that neural networks can learn compositional solutions, and that such solutions have appeared alongside generalization in specific, interpretable cases.

1453.272 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

It doesn't show that this is what's always happening, or that there's a consistent bias toward simplicity, or that we understand why gradient descent would find such solutions efficiently.

1463.494 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

but if the hypothesis is right, it would reframe what deep learning is doing.

1473.404 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The success of neural networks would not be a mystery to be accepted, but an instance of something we already understand in principle.

1478.249 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The power of searching for compact, mechanistic models to explain your observations.