Dario Amodei
๐ค SpeakerAppearances Over Time
Podcast Appearances
Whenever you want to do some kind of like, you know, difficult analysis, like if I, you know, I want to write code, for instance, or, you know, I want to brainstorm ideas or I want to do creative writing, I want the really powerful model. But then there's a lot of practical applications in a business sense where it's like, I'm interacting with a website.
I'm doing my taxes or I'm talking to a legal advisor and I want to analyze a contract. Or we have plenty of companies that are just like, I want to do autocomplete on my IDE or something. And for all of those things, you want to act fast and you want to use the model very broadly. So we wanted to serve... that whole spectrum of needs.
I'm doing my taxes or I'm talking to a legal advisor and I want to analyze a contract. Or we have plenty of companies that are just like, I want to do autocomplete on my IDE or something. And for all of those things, you want to act fast and you want to use the model very broadly. So we wanted to serve... that whole spectrum of needs.
I'm doing my taxes or I'm talking to a legal advisor and I want to analyze a contract. Or we have plenty of companies that are just like, I want to do autocomplete on my IDE or something. And for all of those things, you want to act fast and you want to use the model very broadly. So we wanted to serve... that whole spectrum of needs.
Um, so we ended up with this, uh, you know, this kind of poetry theme. And so what's a really short poem. It's a haiku. And so haiku is the small, fast, cheap model that is, you know, was at the time was really surprisingly, surprisingly, uh, intelligent for how fast and cheap it was. Uh, Sonnet is a medium-sized poem, right? A couple paragraphs. And so sonnet was the middle model.
Um, so we ended up with this, uh, you know, this kind of poetry theme. And so what's a really short poem. It's a haiku. And so haiku is the small, fast, cheap model that is, you know, was at the time was really surprisingly, surprisingly, uh, intelligent for how fast and cheap it was. Uh, Sonnet is a medium-sized poem, right? A couple paragraphs. And so sonnet was the middle model.
Um, so we ended up with this, uh, you know, this kind of poetry theme. And so what's a really short poem. It's a haiku. And so haiku is the small, fast, cheap model that is, you know, was at the time was really surprisingly, surprisingly, uh, intelligent for how fast and cheap it was. Uh, Sonnet is a medium-sized poem, right? A couple paragraphs. And so sonnet was the middle model.
It is smarter, but also a little bit slower, a little bit more expensive. And opus, like a magnum opus is a large work, opus was the largest, smartest model at the time. So that was the original kind of thinking behind it. Yeah. And our thinking then was, well, each new generation of models should shift that trade-off curve.
It is smarter, but also a little bit slower, a little bit more expensive. And opus, like a magnum opus is a large work, opus was the largest, smartest model at the time. So that was the original kind of thinking behind it. Yeah. And our thinking then was, well, each new generation of models should shift that trade-off curve.
It is smarter, but also a little bit slower, a little bit more expensive. And opus, like a magnum opus is a large work, opus was the largest, smartest model at the time. So that was the original kind of thinking behind it. Yeah. And our thinking then was, well, each new generation of models should shift that trade-off curve.
So when we released Sonnet 3.5, it has the same, roughly the same, you know, cost and speed as the Sonnet 3 model. Uh, but, uh, it, it increased its intelligence to the point where it was smarter than the original Opus 3 model, uh, especially for code, but, but also just in general.
So when we released Sonnet 3.5, it has the same, roughly the same, you know, cost and speed as the Sonnet 3 model. Uh, but, uh, it, it increased its intelligence to the point where it was smarter than the original Opus 3 model, uh, especially for code, but, but also just in general.
So when we released Sonnet 3.5, it has the same, roughly the same, you know, cost and speed as the Sonnet 3 model. Uh, but, uh, it, it increased its intelligence to the point where it was smarter than the original Opus 3 model, uh, especially for code, but, but also just in general.
And so now, you know, we've shown results for a Haiku 3.5 and I believe Haiku 3.5, the smallest new model is about as good as Opus 3, the largest old model. So basically, the aim here is to shift the curve, and then at some point, there's going to be an Opus 3.5. Now, every new generation of models has its own thing. They use new data.
And so now, you know, we've shown results for a Haiku 3.5 and I believe Haiku 3.5, the smallest new model is about as good as Opus 3, the largest old model. So basically, the aim here is to shift the curve, and then at some point, there's going to be an Opus 3.5. Now, every new generation of models has its own thing. They use new data.
And so now, you know, we've shown results for a Haiku 3.5 and I believe Haiku 3.5, the smallest new model is about as good as Opus 3, the largest old model. So basically, the aim here is to shift the curve, and then at some point, there's going to be an Opus 3.5. Now, every new generation of models has its own thing. They use new data.
Their personality changes in ways that we kind of try to steer but are not fully able to steer. And so there's never quite that exact equivalence where the only thing you're changing is intelligence. We always try and improve other things, and some things change without us knowing or measuring. So it's very much an inexact science.
Their personality changes in ways that we kind of try to steer but are not fully able to steer. And so there's never quite that exact equivalence where the only thing you're changing is intelligence. We always try and improve other things, and some things change without us knowing or measuring. So it's very much an inexact science.
Their personality changes in ways that we kind of try to steer but are not fully able to steer. And so there's never quite that exact equivalence where the only thing you're changing is intelligence. We always try and improve other things, and some things change without us knowing or measuring. So it's very much an inexact science.
In many ways, the manner and personality of these models is more an art than it is a science.