Zach Furman

"Deep learning as program synthesis" by Zach Furman

This is also where the post will shift register.

"Deep learning as program synthesis" by Zach Furman

The remaining sections sketch the structure of these problems and gesture at why certain mathematical frameworks, singular learning theory, algebraic geometry, etc.

3178.515 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

might become relevant.

3187.967 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

I won't develop these fully here, that requires machinery far beyond the scope of a single blog post, but I want to show why you'd need to leave sure at all, and what you might find out in open water.

3190.009 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Subheading The Representation Problem

3201.183 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The program synthesis hypothesis posits a relationship between two fundamentally different kinds of mathematical objects.

3204.847 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

On one hand, we have programs.

3212.421 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

A program is a discrete and symbolic object.

3215.486 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Its identity is defined by its compositional structure, a graph of distinct operations.

3218.973 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

A small change to this structure, like flipping a comparison or replacing an addition with a subtraction, can create a completely different program with discontinuous, global changes in behavior.

3224.742 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The space of programs is discrete.

3235.694 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

On the other hand, we have neural networks.

3238.637 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

A neural network is defined by its parameter space, a continuous vector space of real valued weights.

3242.06 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The function a network computes is a smooth, or at least piecewise smooth, function of these parameters.

3249.468 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

This smoothness is the essential property that allows for learning via gradient descent a process of infinitesimal steps along a continuous loss landscape.

3255.874 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

This presents a seeming type mismatch.

3264.517 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

How can a continuous process in a continuous parameter space give rise to a discrete, structured program?

3267.289 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The problem is deeper than it first appears.