Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andy Halliday

๐Ÿ‘ค Speaker
8321 total appearances

Appearances Over Time

Podcast Appearances

The Daily AI Show
Spotify Engineers Stopped Writing Code

Right.

The Daily AI Show
Spotify Engineers Stopped Writing Code

Right.

The Daily AI Show
Spotify Engineers Stopped Writing Code

That that would need to have, you know, no latency, zero latency.

The Daily AI Show
Spotify Engineers Stopped Writing Code

You have to be right on top of the conversation.

The Daily AI Show
Spotify Engineers Stopped Writing Code

Yeah, I think what Greg brings into the conversation here is the idea of really low latency, vastly accelerated inference, being on board an embodied AI in a robot.

The Daily AI Show
Spotify Engineers Stopped Writing Code

So you want that to happen.

The Daily AI Show
Spotify Engineers Stopped Writing Code

You don't want the robot to be there frozen for a while while it's trying to think about how you're talking.

The Daily AI Show
Spotify Engineers Stopped Writing Code

Let me tie this back to the question that we raised.

The Daily AI Show
Spotify Engineers Stopped Writing Code

I think, Beth, you raised it earlier.

The Daily AI Show
Spotify Engineers Stopped Writing Code

Does it really matter to us that these frontier model developments are getting to the point where they're matching out all the benchmarks we can throw at them?

The Daily AI Show
Spotify Engineers Stopped Writing Code

And I think it does because when you imbue a deep neural network with that level of reasoning capability, you can then distill it.

The Daily AI Show
Spotify Engineers Stopped Writing Code

So the people who are investing hugely in the training runs that are developing these reasoning capabilities are basically conferring to us much smaller, ultimately much smaller and lighter weight, faster, low latency models that can be used by us inexpensively.

The Daily AI Show
Spotify Engineers Stopped Writing Code

So one of the things, by the way, that Google 3.0 DeepThink does is it reduced dramatically the cost per problem by 80%, reduced by 80% the cost for computation per problem of its prior efforts.

The Daily AI Show
Spotify Engineers Stopped Writing Code

So it's not only more capable, it's more efficient also.

The Daily AI Show
Spotify Engineers Stopped Writing Code

Energy wise.

The Daily AI Show
Spotify Engineers Stopped Writing Code

And now you distill that model, take deep thing and create a distillation of it into a much smaller model.

The Daily AI Show
Spotify Engineers Stopped Writing Code

And it retains a good percentage of the capabilities of the larger model, but with even better efficiency and lower latency because it's so much smaller.

The Daily AI Show
Spotify Engineers Stopped Writing Code

And that, I think, is the path that we're on now.

The Daily AI Show
Spotify Engineers Stopped Writing Code

leading up to a time when there will be edge devices and embodied AIs that work in and around us that have that level of reasoning capability, not superhuman yet, but near human capabilities and in real time.

The Daily AI Show
Spotify Engineers Stopped Writing Code

You can shame the models into better performances.