Jacob Hilton

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

It has 10 parameters and almost perfect 100% accuracy with an error rate of roughly 1 in 13,000.

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

This means that the difference between the model's logits complex formula omitted from the narration is almost always negative when complex formula omitted from the narration and positive when complex formula omitted from the narration.

531.914 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

We'd like to mechanistically explain why the model has this property.

546.438 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

To do this, note first that because the model uses ReLU activations and there are no biases, delta is a piecewise linear function of x subscript 0 and x subscript 1 in which the pieces are bounded by rays through the origin in the x subscript 0-x the subscript 1 plane.

550.325 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Now, we can standardize the model to obtain an exactly equivalent model for which the entries of complex formula omitted from the narration lie in complex formula omitted from the narration.

567.523 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

by rescaling the neurons of the hidden state once we do this we see that complex formula omitted from the narration from these observations we can prove that on each linear piece of delta complex formula omitted from the narration with complex formula omitted from the narration and moreover the pieces of delta are arranged in the x subscript 0 the dash x subscript 1 plane according to the following diagram there's an image here

578.455 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Here, a double arrow indicates that a boundary lies somewhere between its neighboring axis and the dashed line.

614.777 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Complex formula omitted from the narration.

620.447 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

But we don't need to worry about exactly where it lies within this range.

622.891 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Looking at the coefficients of each linear piece, we observe that

627.499 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

In the two blue regions, we have underscore or underscore zero.

632.168 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

In the two green regions, we have complex formula omitted from the narration.

636.352 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

In the two yellow regions, we have complex formula omitted from the narration.

641.758 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

To within around one part in, complex formula omitted from the narration.

646.583 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

This implies that complex formula omitted from the narration.

651.488 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

In the blue and green regions above the line, complex formula omitted from the narration.

656.574 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Complex formula omitted from the narration In the blue and green regions below the line Complex formula omitted from the narration Complex formula omitted from the narration Is approximately proportional to Complex formula omitted from the narration In the two yellow regions Together, these imply that the model has almost 100% accuracy

662.138 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

More precisely, the error rate is the fraction of the unit disc lying between the model's decision boundary and the line, complex formula omitted from the narration, which is around 1 in, complex formula omitted from the narration.

684.262 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

This is very close to the model's empirically measured error rate.

696.856 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Mean squared error versus compute.

700.225 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment