Kelsey Piper
๐ค SpeakerAppearances Over Time
Podcast Appearances
It's more interesting in cases where something is actually disputed or in cases where the answer you would find on the first page of Google searches isn't the right answer. I don't think Grok tends to perform well on those. But again, all of these AI models are way better at what they do than they were a year ago.
So whenever we have these conversations, I do try and look a little forward and ask myself, if we have this conversation same time next year, what will we be talking about? And getting better at answering questions accurately is something that we've seen and I think we'll continue to see.
So whenever we have these conversations, I do try and look a little forward and ask myself, if we have this conversation same time next year, what will we be talking about? And getting better at answering questions accurately is something that we've seen and I think we'll continue to see.
So whenever we have these conversations, I do try and look a little forward and ask myself, if we have this conversation same time next year, what will we be talking about? And getting better at answering questions accurately is something that we've seen and I think we'll continue to see.
So I would bet pretty confidently that Grok in a year will have a better batting average, unless it is deliberately manipulated by Elon to lie in favor of his biases.
So I would bet pretty confidently that Grok in a year will have a better batting average, unless it is deliberately manipulated by Elon to lie in favor of his biases.
So I would bet pretty confidently that Grok in a year will have a better batting average, unless it is deliberately manipulated by Elon to lie in favor of his biases.
One thing I do sort of out of morbid curiosity is I invite all the AIs to look at my document of notes and then write the future perfect newsletter for me. Of course, I would never, I never publish that version, but I'm curious, like, are they capable of it? You know, am I soon to be obviated? And they are not capable of it. But Gemini comes the closest.
One thing I do sort of out of morbid curiosity is I invite all the AIs to look at my document of notes and then write the future perfect newsletter for me. Of course, I would never, I never publish that version, but I'm curious, like, are they capable of it? You know, am I soon to be obviated? And they are not capable of it. But Gemini comes the closest.
One thing I do sort of out of morbid curiosity is I invite all the AIs to look at my document of notes and then write the future perfect newsletter for me. Of course, I would never, I never publish that version, but I'm curious, like, are they capable of it? You know, am I soon to be obviated? And they are not capable of it. But Gemini comes the closest.
But almost nobody uses Gemini in the AI studio chat window. Most people see Google's AI either in Google search results, which is a cheaper to run model, or they see integrations being offered to them in eight different products where they don't necessarily want an integration. I'm perfectly happy to write my own emails.
But almost nobody uses Gemini in the AI studio chat window. Most people see Google's AI either in Google search results, which is a cheaper to run model, or they see integrations being offered to them in eight different products where they don't necessarily want an integration. I'm perfectly happy to write my own emails.
But almost nobody uses Gemini in the AI studio chat window. Most people see Google's AI either in Google search results, which is a cheaper to run model, or they see integrations being offered to them in eight different products where they don't necessarily want an integration. I'm perfectly happy to write my own emails.
Yes. They were the first to launch a language model in the form of a chatbot that you could talk with. And they have the largest share of users. And a lot of the recent, like, very cool AI functionality people have seen, like the ability to turn all your family pictures into cartoons, that has come out of OpenAI and out of ChatGPT.
Yes. They were the first to launch a language model in the form of a chatbot that you could talk with. And they have the largest share of users. And a lot of the recent, like, very cool AI functionality people have seen, like the ability to turn all your family pictures into cartoons, that has come out of OpenAI and out of ChatGPT.
Yes. They were the first to launch a language model in the form of a chatbot that you could talk with. And they have the largest share of users. And a lot of the recent, like, very cool AI functionality people have seen, like the ability to turn all your family pictures into cartoons, that has come out of OpenAI and out of ChatGPT.
You can still, if you work really hard at it, find some crazy behavior from OpenAI's models. The way I would say it is how much work you have to put in to get it to say something horrible is much higher for OpenAI than Grok. For Grok, it's pretty easy to lead Grok into saying something horrible, even when they haven't tampered it to talk about South Africa exclusively.
You can still, if you work really hard at it, find some crazy behavior from OpenAI's models. The way I would say it is how much work you have to put in to get it to say something horrible is much higher for OpenAI than Grok. For Grok, it's pretty easy to lead Grok into saying something horrible, even when they haven't tampered it to talk about South Africa exclusively.
You can still, if you work really hard at it, find some crazy behavior from OpenAI's models. The way I would say it is how much work you have to put in to get it to say something horrible is much higher for OpenAI than Grok. For Grok, it's pretty easy to lead Grok into saying something horrible, even when they haven't tampered it to talk about South Africa exclusively.
Great question. Yeah, I asked Claude once, and Claude was like, as an AI language model, I don't have a gender identity.