Zach Furman

"Deep learning as program synthesis" by Zach Furman

One way to account for this is to hypothesize that the models are not navigating some undifferentiated space of arbitrary functions, but are instead homing in on a sparse set of highly effective programs that solve the task.

2871.312 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

If, following the physical church-turing thesis, we view the natural world as having a true, computable structure, then an effective learning process could be seen as a search for an algorithm that approximates that structure.

2883.753 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

In this light, convergence is not an accident, but a sign that different search processes are discovering similar objectively good solutions, much as different engineering traditions might independently arrive at the arch as an efficient solution for bridging a gap.

2895.764 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

This hypothesis, that learning is a search for an optimal, objective program, carries with it a strong implication.

2910.005 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The search process must be a general-purpose one, capable of finding such programs without them being explicitly encoded in its architecture.

2916.959 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

As it happens, an independent, large-scale trend in the field provides a great deal of data on this very point.

2925.311 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Rich Sutton's bitter lesson describes the consistent empirical finding that long-term progress comes from scaling general learning methods rather than from encoding specific human domain knowledge.

2932.18 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The old paradigm, particularly in fields like computer vision, speech recognition, or game playing, involved painstakingly hand-crafting systems with significant prior knowledge.

2942.795 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

For years, the state of the art relied on complex, hand-designed feature extractors like SIFT and HOG, which were built on human intuitions about what aspects of an image are important.

2953.735 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The role of learning was confined to a relatively simple classifier that operated on these predigested features.

2964.8 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The underlying assumption was that the search space was too difficult to navigate without strong human guidance.

2971.29 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The modern paradigm of deep learning has shown this assumption to be incorrect.

2977.68 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Progress has come from abandoning these handcrafted constraints in favor of training general, end-to-end architectures with the brute force of data and compute.

2982.187 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

This consistent triumph of general learning over encoded human knowledge is a powerful indicator that the search process we are using is, in fact, general purpose.

2990.821 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

It suggests that the learning algorithm itself, when given a sufficiently flexible substrate and enough resources, is a more effective mechanism for discovering relevant features and structure than human ingenuity.

3000.375 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

This perspective helps connect these phenomena but it also invites us to refine our initial picture,

3011.952 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

First, the notion of a single optimal program may be too rigid.

3017.72 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

It is possible that what we are observing is not convergence to a single point, but to a narrow subset of similarly structured, highly efficient programs.

3022.425 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

The models may be learning different but algorithmically related solutions, all belonging to the same family of effective strategies.

3031.335 View full episode →

LessWrong (Curated & Popular)

"Deep learning as program synthesis" by Zach Furman

Second, it is unclear whether this convergence is purely a property of the problem's solution space, or if it is also a consequence of our search algorithm.

3038.843 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment