Boris

And so there's a lot of work into like alignment and mechanistic interpretability and kind of all these ideas to make sure that the model does what you want in a way that's safe, kind of at the model layer.

540.436 View full episode →

The Startup Ideas Podcast

Claude Code's Creator Reveals "Claude Cowork"'s Setup

And this literally means like studying the neurons, kind of the same way that you would study neurons in the human world.

550.661 View full episode →

The Startup Ideas Podcast

Claude Code's Creator Reveals "Claude Cowork"'s Setup

And so you can identify structures and you can kind of study in a very scientific way as a black box also to make sure that it's safe.

557.699 View full episode →

The Startup Ideas Podcast

Claude Code's Creator Reveals "Claude Cowork"'s Setup

So this is called alignment.

565.813 View full episode →

The Startup Ideas Podcast

Claude Code's Creator Reveals "Claude Cowork"'s Setup

And then we do a whole bunch of other stuff.

567.857 View full episode →

The Startup Ideas Podcast

Claude Code's Creator Reveals "Claude Cowork"'s Setup

So there's actually a whole virtual machine running under the hood.

569.88 View full episode →

The Startup Ideas Podcast

Claude Code's Creator Reveals "Claude Cowork"'s Setup

And this is just to make sure that any actions taken are safe and don't affect your broader system.

572.665 View full episode →

The Startup Ideas Podcast

Claude Code's Creator Reveals "Claude Cowork"'s Setup

And then as of last week, there's also deletion protection.

577.173 View full episode →

The Startup Ideas Podcast

Claude Code's Creator Reveals "Claude Cowork"'s Setup

So if you accidentally delete something, then you're going to get prompted first.

580.577 View full episode →

The Startup Ideas Podcast

Claude Code's Creator Reveals "Claude Cowork"'s Setup

So the model can kind of make sure that that's actually a thing that you want to do.

585.643 View full episode →

The Startup Ideas Podcast

Claude Code's Creator Reveals "Claude Cowork"'s Setup

Obviously, also, as we start interacting with the Internet, something like prompt injection is quite scary.

589.568 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment