John Schulman

I would probably like to see a little bit more science and trying to understand things rather than more like hill climbing on benchmarks and trying to propose new methods.

4569.629 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

And there's been a decent amount of that recently, but yeah, I think we could use more of that.

4581.739 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

And I think that's a good thing for like academics to work on.

4588.265 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

Oh yeah, on the social sciences, on a slightly different note, I think actually,

4592.448 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

I'd be really excited to see more research using base models to do simulated social science because these models have a probabilistic model of the whole world and you can set up like a simulated questionnaire or like a conversation and

4600.139 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

And you can look at how anything is correlated, like any traits that you might imagine, you can see how they might be correlated with other traits.

4620.592 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

So it'd be pretty cool to see if people could replicate some of the more notable results in the social sciences, like moral foundations and that sort of thing, by just prompting base models in different ways and seeing what's correlated.

4630.062 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

What is that Stanford experiment?

4644.096 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

Yeah, well, definitely there's always progress in improving the efficiency.

4694.501 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

Whenever you have a 1D performance metric, you're going to find that different improvements can kind of substitute for each other.

4700.67 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

So you might find that post-training and training

4710.023 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

pre-training both improve the metrics or, uh, like improve, uh, they, they, they'll have a different, slightly different profile of which metrics they improve.

4717.353 View full episode →

Dwarkesh Podcast

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

But, uh, if, if at the end of the day, you have a single number, they're both gonna, they're gonna substitute for each other, uh, somewhat.

4725.06 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment