Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Anjney Midha

πŸ‘€ Speaker
787 total appearances

Appearances Over Time

Podcast Appearances

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

You then do mid-training, which is to say, in a particular domain that you really care about, you inject more capabilities.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

So if you want this model to reason about science or math or physics, then you give it science or math or physics data.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

And then you get a pretty good model that's specialized in that domain.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

And then you deploy it to the real world where you have people using it.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

And...

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

The context feedback, which is when the model is able to do a task well or not and you can verify whether that task was done correctly, gives the model the data it needs to keep improving on that task, on that distribution.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

give me another output or like would you do it again the same way and they like they often say yes or they give like a very similar answer they don't seem to be responding in real time correct so when i say feedback i mean a very specific kind of feedback which i which i call verifiable feedback so when you say that wasn't right or that was wrong that's an opinion okay verifiable feedback is when you can have as close to factual verification as possible the reason

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

That's a great question.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

So let's take reason by example in two or three cases.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

In the case of software engineering, the way software engineers actually code is you write a piece of code and then you submit it to the main code base.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

And then you usually have a peer on your team review the code and approve it or reject it.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

And if it gets approved, that's the first step.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

That's called a PR, a pull request.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

And if another human on your team that you trust approved it, that's one kind of verification of quality.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

And then two...

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

Before that piece of code usually gets deployed to a production system, you have unit tests.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

And those are quite objective tests of, is this code performing the function we need it to?

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

And if it passes both those tests, it's a verifiable piece of code that accomplished the goal.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

So in software engineering, the reason we've seen such a dramatic improvement in capabilities is that a lot of these labs are using feedback from that verification loop.

Odd Lots
Anjney Midha's Plan to Radically Lower the Price of Compute

In the case of another lab I incubated called Periodic Labs, which we started a year ago, and you should come by sometime.