Tim Davis

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

that inference is almost in many ways more complex than training at scale now.

1978.253 View full episode →

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

It involves increasingly at the Kubernetes, particularly the Kubernetes style layer in the stack, you have this idea now of splitting out pre-fill and decode components of the transformer architecture.

1984.163 View full episode →

The Neuron: AI Explained

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

You have different ways of managing caching and prompt caching and how you do that across all sorts of different models and all sorts of different hardware.

1996.302 View full episode →

The Neuron: AI Explained

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

And so what you end up needing is, you know, essentially a solution there too.

2004.916 View full episode →

The Neuron: AI Explained

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

And that, you know, today we call it mammoth, but fundamentally that's going to become a cloud, you know, part of our cloud offering.

2009.082 View full episode →

The Neuron: AI Explained

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

And for many developers, our goal is just, look, we can serve across hardware.

2016.493 View full episode →

The Neuron: AI Explained

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

This idea of, you know, talking about analogies, Corey,

2020.619 View full episode →

The Neuron: AI Explained

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

You know, we love to frame the unified compute model really as like a hypervisor for compute, right?

2024.525 View full episode →

The Neuron: AI Explained

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

If you could log into a platform and you could just say, here's my throughput, here's my latency, here's my, you know, how much accuracy I'm willing to reduce through methods like quantization, and here's my cost target, just make it work.

2030.795 View full episode →

The Neuron: AI Explained

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

Like when I log into Snowflake,

2043.916 View full episode →

The Neuron: AI Explained

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

I don't execute a query and then go, oh, well, what CPU machine did Snowflake choose to execute my query?

2046.901 View full episode →

The Neuron: AI Explained

The "Android Moment" for AI Infrastructure: Why Modular Just Raised $250M

I actually don't know, right?