Dylan Patel
๐ค SpeakerAppearances Over Time
Podcast Appearances
Or it could be clean this data, right?
Here's this table, ton of dirty data in there.
Oh, there's like colons and stuff.
There's an address in one column.
How do I separate out the column?
So the address is like street address, city, zip code.
and I'll try a bunch of stuff, but like, hey, maybe it can't do that yet.
You give it an iterative, like, here's addresses, here's different formats, and you slowly iteratively teach it.
So another example is like, you're in a game, whether it's a tic-tac-toe or Call of Duty or a math puzzle, whatever the game is.
And that's what a lot of these environments initially have been is like, math puzzles, do this math puzzle.
Oh, well, I can't do this one because it's too hard.
Here's an easier one.
Okay, I can spin on this one.
Okay, I'm better enough.
Okay, now I can learn this one.
iteratively step through those to where from q4 of last year to q2 of this year these things hill climbed up math puzzles like crazy and a lot of that was not hey i just know the math a lot of that was here's how i use python to write something that does the math for me and now these things are actually quite good at math the environments can be super varied and it doesn't need to be something that's like clear cut and dry it can be here's here's a medical case
What's wrong with it?
And then you have another model say, well, here's the instructions on how you would grade the result of a case.
Well, it looks like they didn't even try this or didn't even look up this.
Okay, you did that wrong.