Zach Furman

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Instead, we write a program.

2185.378 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

we break the problem down hierarchically into a sequence of simple, reusable steps.

2187.655 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Each step, like a logic gate in a circuit, is a tiny lookup table, and we achieve immense expressive power by composing them.

2193.41 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

This matches what we see empirically in some deep neural networks via mechanistic interpretability.

2201.223 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

They appear to solve complex tasks by learning a compositional hierarchy of features.

2207.291 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

A vision model learns to detect edges, which are composed into shapes, which are composed into object parts, wheels, windows, which are finally composed into an object detector for a car.

2212.878 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The network is not learning a single, monolithic function.

2224.493 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

It is learning a program that breaks the problem down.

2228.54 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

This parallel with classical computation offers an alternative perspective on the approximation question.

2232.047 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

While the UAT considers the case of arbitrary functions, a different set of results examines how well neural networks can represent functions that have this compositional, programmatic structure.

2238.178 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

One of the most relevant results comes from considering Boolean circuits, which are a canonical example of programmatic composition.

2249.097 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

It is known that feedforward neural networks can represent any program implementable by a polynomial-sized Boolean circuit using only a polynomial number of neurons.

2256.626 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

This provides a different kind of guarantee than the UAT.

2266.538 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

It suggests that if a problem has an efficient programmatic solution, then an efficient neural network representation of that solution also exists.

2270.393 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

This offers an explanation for how neural networks might evade the curse of dimensionality.

2278.806 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Their effectiveness would stem not from an ability to represent any high-dimensional function, but from their suitability for representing the tiny, structured subset of functions that have efficient programs.

2284.254 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The problems seen in practice, from image recognition to language translation, appear to belong to this special class.

2295.453 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

There's a details box here with the title Why Compositionality, specifically.

2302.562 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Evidence from depth separation results.