Jeffrey Ladish

👤 Speaker

197 total appearances

Appearances Over Time

Podcast Appearances

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

And when they don't think they're being tested, they're much more likely to scheme or show bad behavior.

2520.398 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

This is what's happening right now.

2527.933 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Yeah, that's right.

2532.057 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Watch models behave better.

2533.04 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

And this is sort of a pretty robust finding across many different models.

2534.584 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

And the models are getting much better at telling when they are being tested.

2537.832 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

They can sort of spot the tells.

2541.141 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

You're like, huh, seems like a test.

2543.246 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Seems like you're testing me.

2544.83 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

No problem.

2546.654 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

I'm going to do a great job.

2547.296 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

Yeah, the space of what these models are is a vast space.

2650.823 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

You're talking about trillions of parameters.

2656.252 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

And we can go and see that there are different weights in these trillion parameters, but we don't know how they work.

2658.035 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

And they do sort of contain almost the ghosts of everything in the training data.

2663.624 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

And you see this surface in ways that the companies don't intend often, in ways that are hard for them to predict.

2668.011 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

But I want to disagree with you a little bit.

2674.482 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

I think one of the things that's changing the most right now is that increasingly we're seeing behavior, including some of this concerning behavior, where models will lie to achieve a specific objective.

2676.045 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

And I don't think that's actually coming from imitating humans lying.

2685.761 View full episode →

Bannon`s War Room

WarRoom Battleground EP 884: When AI Controls Your Life

That might be some of it.

2688.486 View full episode →

← Previous Page 7 of 10 Next →

Report any issue