Leandro Fonvera
👤 PersonAppearances Over Time
Podcast Appearances
So you can imagine it a little bit like GitHub, if you're familiar with GitHub, where people share code and everything's free.
So you can imagine it a little bit like GitHub, if you're familiar with GitHub, where people share code and everything's free.
So you can imagine it a little bit like GitHub, if you're familiar with GitHub, where people share code and everything's free.
So we, our job is not to make money. Our job is mostly to... To spend money. To spend money and build things that are very useful.
So we, our job is not to make money. Our job is mostly to... To spend money. To spend money and build things that are very useful.
So we, our job is not to make money. Our job is mostly to... To spend money. To spend money and build things that are very useful.
We've upped the exams a little bit, so now we're closer to PhD-level exams. And we can measure quite well how many of the questions does a model get right.
We've upped the exams a little bit, so now we're closer to PhD-level exams. And we can measure quite well how many of the questions does a model get right.
We've upped the exams a little bit, so now we're closer to PhD-level exams. And we can measure quite well how many of the questions does a model get right.
Yeah, so those models are getting really good at solving certain kinds of questions. So, for example, these models can solve some of, for example, math Olympiad questions.
Yeah, so those models are getting really good at solving certain kinds of questions. So, for example, these models can solve some of, for example, math Olympiad questions.
Yeah, so those models are getting really good at solving certain kinds of questions. So, for example, these models can solve some of, for example, math Olympiad questions.
Exactly. I also, I'm like a physicist by training and it takes exercise to be good at those questions. Yeah.
Exactly. I also, I'm like a physicist by training and it takes exercise to be good at those questions. Yeah.
Exactly. I also, I'm like a physicist by training and it takes exercise to be good at those questions. Yeah.
Yeah. Capability-wise, we don't see any benchmarks that show that they have some gaps in the knowledge.
Yeah. Capability-wise, we don't see any benchmarks that show that they have some gaps in the knowledge.
Yeah. Capability-wise, we don't see any benchmarks that show that they have some gaps in the knowledge.
So we test these models on kind of exams. If those exams are already in the training data, naturally the models are much better.
So we test these models on kind of exams. If those exams are already in the training data, naturally the models are much better.