Mike Stonebraker

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

We've been trying to make it work on real-world databases.

2558.034 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

especially real-world data warehouses.

2566.685 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

So we've been trying the technology on four different production databases, warehouses, where we've gotten the workload, the actual workload that's run, and from the actual users using the system,

2570.531 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

And we've gotten them to reverse engineer the text that corresponds to that sequel.

2591.525 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

So we have text and sequel for, we have four benchmarks.

2600.094 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

Tell me all the professors at MIT who won the Turing Award.

2618.052 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

And so an LLM is supposedly good at that.

2622.699 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

And so the text to SQL benchmarks, there's one called Spider, another one called Bird.

2628.548 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

And the best LLM systems are pretty good at those benchmarks, like 80% accuracy or better.

2636.92 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

Not superhuman, but they're pretty good.

2646.74 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

Like, you would consider using them.

2649.004 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

And, you know, like the current leaderboard is something like 85% accuracy, which, I mean, it's getting there.

2652.289 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

You say maybe it's not quite ready for prime time, but it certainly looks pretty good.

2659.62 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

Well, on our benchmarks, large language models get 0%.

2668.12 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

And if you enhance them with RAG and all the tricks, it goes to 10%.

2674.386 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

And if you give as a prompt the from clause, in other words, all the actual tables that need to be accessed, and all the actual join clauses that need to be joined, then accuracy goes to about 35%.

2681.633 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

So the definition of this stuff is not ready for prime time and not going to be for a while, if ever.

2700.031 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

So what's the difference?

2710.524 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

Number one, LLMs are trained on the pile.

2713.167 View full episode →

The Peterman Pod

Turing Award Winner: Postgres, Disagreeing with Google, Future Problems | Mike Stonebraker

Data warehouse data is not in the pile.

2720.115 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment