Jacob Hilton

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Using only a handful of computational operations, we were able to mechanistically estimate the model's accuracy to within under one part in 13,000, which would have taken tens of thousands of samples.

703.69 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

So our mechanistic estimate was much more computationally efficient than random sampling.

715.469 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Moreover, we could have easily produced a much more precise estimate, exact to within floating point error, by simply computing how close the 8 subscript 0 and 8 subscript 1 were in the two yellow regions.

720.577 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Surprise Accounting As explained here, the total surprise decomposes into the surprise of the explanation plus the surprise given the explanation.

733.476 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

The surprise given the explanation is close to 0 bits, since the calculation was essentially exact.

742.829 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

For the surprise of the explanation, we can walk through the steps we took.

749.112 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

There's a list of bullet points here.

753.599 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

We standardized the model, which simply replaced the model with an exactly equivalent one.

755.762 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

This did not depend on the model's parameters at all, and so doesn't incur any surprise.

761.15 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

We checked the signs of all 10 of the model's parameters and whether or not each of the 4 entries of complex formula omitted from the narration was greater than or less than 1 in magnitude, incurring 14 bits of surprise.

767.118 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

We deduced from this the form of the piecewise linear function delta.

781.322 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

This was another step that didn't depend on the model's parameters and so doesn't incur any surprise.

785.57 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

We checked which of the two linear coefficients was larger in magnitude in each of the four blue and green regions incurring four bits of surprise.

791.738 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

We checked that the two linear coefficients were equal in magnitude in each of the two yellow regions to within one part in.

802.388 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Complex formula omitted from the narration.

808.614 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Incurring around 22 bits of surprise.

811.136 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

That's the end of the list.

815 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

Adding this up, the total surprise is around 40 bits.

816.962 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

This plausibly matches the number of bits of optimization used to select the model, since it was probably necessary to optimize the linear coefficients in the yellow regions to be almost equal.

820.915 View full episode →

LessWrong (Curated & Popular)

"AlgZoo: uninterpreted models with fewer than 1,500 parameters" by Jacob_Hilton

So we can be relatively comfortable in saying that we have achieved a full understanding.

831.39 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment