Mike Stonebraker
๐ค SpeakerAppearances Over Time
Podcast Appearances
And there's an adage that if you haven't seen the data a couple of times before, you have no chance of regurgitating it.
That's number one.
Number two, query complexity on spider and bird is maybe 10 to 20 lines of SQL.
Real-world data warehouses, it's 100 lines of SQL.
Complexity is bigger.
Number three, the schema in Spider and Bird is clean.
The table names are mnemonic, the column names are mnemonic, and there's no duplication.
In data warehouses, people have materialized views all the time.
It means there's redundancy.
And column names are often underscore, Z, upperscore, blah.
And so they're not mnemonic.
That makes it a lot harder.
And then they also have idiosyncratic data.
So J term is popular thing at MIT.
It's a one month term in January.
Not unique to MIT, but not very popular.
So not in the pile.
idiosyncratic data, simple queries, schema is a mess, make it not work.
And those are true of every data warehouse I know of.
And so I think the technology simply