Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Jacob Hilton

๐Ÿ‘ค Speaker
204 total appearances

Appearances Over Time

Podcast Appearances

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

It has 10 parameters and almost perfect 100% accuracy with an error rate of roughly 1 in 13,000.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

This means that the difference between the model's logits complex formula omitted from the narration is almost always negative when complex formula omitted from the narration and positive when complex formula omitted from the narration.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

We'd like to mechanistically explain why the model has this property.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

To do this, note first that because the model uses ReLU activations and there are no biases, delta is a piecewise linear function of x subscript 0 and x subscript 1 in which the pieces are bounded by rays through the origin in the x subscript 0-x the subscript 1 plane.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Now, we can standardize the model to obtain an exactly equivalent model for which the entries of complex formula omitted from the narration lie in complex formula omitted from the narration.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

by rescaling the neurons of the hidden state once we do this we see that complex formula omitted from the narration from these observations we can prove that on each linear piece of delta complex formula omitted from the narration with complex formula omitted from the narration and moreover the pieces of delta are arranged in the x subscript 0 the dash x subscript 1 plane according to the following diagram there's an image here

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Here, a double arrow indicates that a boundary lies somewhere between its neighboring axis and the dashed line.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Complex formula omitted from the narration.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

But we don't need to worry about exactly where it lies within this range.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Looking at the coefficients of each linear piece, we observe that

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

In the two blue regions, we have underscore or underscore zero.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

In the two green regions, we have complex formula omitted from the narration.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

In the two yellow regions, we have complex formula omitted from the narration.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

To within around one part in, complex formula omitted from the narration.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

This implies that complex formula omitted from the narration.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

In the blue and green regions above the line, complex formula omitted from the narration.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Complex formula omitted from the narration In the blue and green regions below the line Complex formula omitted from the narration Complex formula omitted from the narration Is approximately proportional to Complex formula omitted from the narration In the two yellow regions Together, these imply that the model has almost 100% accuracy

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

More precisely, the error rate is the fraction of the unit disc lying between the model's decision boundary and the line, complex formula omitted from the narration, which is around 1 in, complex formula omitted from the narration.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

This is very close to the model's empirically measured error rate.

LessWrong (Curated & Popular)
"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Mean squared error versus compute.