Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Oly Sourbut

πŸ‘€ Speaker
832 total appearances

Appearances Over Time

Podcast Appearances

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

And a lot of this we might call, these are kind of epistemic qualities or virtues.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

So in fact, one of the pieces of our kind of human reasoning puzzle is, can we ensure that the AI components we're putting into these tools for human reasoning

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

in particular LMs, like particularly salient ELMs, can we ensure that they are virtuous in these ways that we think are necessary, not only as kind of conversation partners and as agents, but as building blocks for tools that we might want to build downstream of them.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

So let's go back to this kind of scenario planning, deep research-esque, but geared towards scenario planning.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

Let's go back to that example.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

If you're...

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

system has systematic blind spots, or even if it's scheming and it wants to hide certain parts of the mechanics of the world or of the specifics of the situation, which is kind of a much more pernicious thing.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

then you might expect that it could perhaps surreptitiously or even inadvertently kind of surface a biased summary of the situation.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

And that could lead you to kind of systematically biased decisions.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

I guess I'm trying to think kind of concretely.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

If there was some kind of political shenanigans going on and there were particular parties that were behaving shadily, you could imagine a sufficiently...

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

either blind spotted or sufficiently kind of scheming model that was part of a system like that.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

You can imagine it just kind of discarding such references before they kind of bubble up into the ecosystem.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

And that can give you these blind spots.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

So yeah, one way we can potentially counteract that is building in these guarantees.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

And that's a strong word.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

I perhaps need to soften that.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

But building in some level of incentives for the developers and also benchmarking and testing and so on for these epistemic, like thoroughness, legibility, and these kind of qualities.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

One way that that gets done in contemporary AI, very often the way that you make progress is by having benchmarks, by having testing suites and this kind of thing.

Future of Life Institute Podcast
How AI Can Help Humanity Reason Better (with Oly Sourbut)

So one way we're hoping to kind of incentivize that whole area is to enable people to build really great environments for testing these epistemically virtuous properties like thoroughness, legibility, skepticism, this kind of thing.