Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Tim Davis

๐Ÿ‘ค Speaker
552 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

that inference is almost in many ways more complex than training at scale now.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

It involves increasingly at the Kubernetes, particularly the Kubernetes style layer in the stack, you have this idea now of splitting out pre-fill and decode components of the transformer architecture.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

You have different ways of managing caching and prompt caching and how you do that across all sorts of different models and all sorts of different hardware.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

And so what you end up needing is, you know, essentially a solution there too.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

And that, you know, today we call it mammoth, but fundamentally that's going to become a cloud, you know, part of our cloud offering.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

And for many developers, our goal is just, look, we can serve across hardware.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

This idea of, you know, talking about analogies, Corey,

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

You know, we love to frame the unified compute model really as like a hypervisor for compute, right?

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

If you could log into a platform and you could just say, here's my throughput, here's my latency, here's my, you know, how much accuracy I'm willing to reduce through methods like quantization, and here's my cost target, just make it work.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

Like when I log into Snowflake,

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

I don't execute a query and then go, oh, well, what CPU machine did Snowflake choose to execute my query?

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

I actually don't know, right?

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

Well, AI should be similar, certainly at least in the cloud to start, similar.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

You should be able to log into a platform, just give your requirements and expect the best TCO for your workload.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

Like the total cost should be highly optimized to whatever those parameters are

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

And software should figure it out.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

It should be that simple for developers then to go off and build applications.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

Grant, hopefully that answers the question of what are we building at Modulo?

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

That is very much what we're building.

The Neuron: AI Explained
The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

Yeah.