Zach Furman

"Deep learning as program synthesis" by Zach Furman

Gradient descent navigates a continuous parameter space using only local information.

"Deep learning as program synthesis" by Zach Furman

If both processes are somehow arriving at similar destinations, compositional solutions to learning problems, then something interesting is happening in how neural network loss landscapes are structured, something we do not yet understand.

1512.335 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

We will return to this issue at the end of the post.

1525.939 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

So the hypothesis raises as many questions as it answers.

1529.345 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

But it offers something valuable.

1533.192 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

A frame.

1535.675 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

If deep learning is doing a form of program synthesis, that gives us a way to connect disparate observations about generalization, about convergence of representations, about why scaling works, into a coherent picture.

1537.217 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Whether this picture can make sense of more than just these particular examples is what we'll explore next.

1550.613 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

There's a details box here with the title clarifying the hypothesis.

1556.58 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The box contents are omitted from this narration.

1561.103 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Subheading.

1564.767 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Why this isn't enough.

1566.168 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The preceding case studies provide a strong existence proof.

1568.25 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Deep neural networks are capable of learning and implementing non-trivial, compositional algorithms.

1572.094 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The evidence that Inception V1 solves image classification by composing circuits, or that a transformer solves modular addition by discovering a Fourier-based algorithm, is quite hard to argue with.

1578.421 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

And, of course, there are more examples than these which we have not discussed.

1589.572 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Still, the question remains.

1595.242 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Is this the exception or the rule?