Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Mark Williams-Cook

👤 Person
624 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
AI vs Google Search....behind the scenes

When people talk about AI, LLMs and the training data that goes in, and this is related as well to a lot of people saying, oh, you need, you should use structured data.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

You should use schema.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

So there's structured markup that traditionally we put on web pages for search engines.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

to explicitly label connections between things.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

So you would say, this is this website.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

This website is part of this company.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

This is the author.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

He or she works for this company.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

And you map out those connections, right?

The Neuron: AI Explained
AI vs Google Search....behind the scenes

The idea is it's explicit and it removes ambiguity.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

Now, the way the large language models are obviously working with their training is, you know, they get given all this data or they get given, they take whatever they like, it seems.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

But when that goes through the process of tokenization and such, the actual training data

The Neuron: AI Explained
AI vs Google Search....behind the scenes

is not like saved within the model right that you know the text isn't there all that happens is they you know it's broken down into these tokens these components and the model is the relationship between those you know all the different combinations of tokens so how it produces text so

The Neuron: AI Explained
AI vs Google Search....behind the scenes

The schema side of things, the structured data can't survive that process.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

It can kind of generate structured data because it sees those patterns together.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

But the fact that it once saw that the organization was candor for this website, because that's statistically such a drop, not even drop in the bucket, that's forever gone.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

So it doesn't, that's not encapsulated in, I guess what you could call like the language graph

The Neuron: AI Explained
AI vs Google Search....behind the scenes

which is the way they're storing knowledge.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

So that's one of the reasons why you get those issues.

The Neuron: AI Explained
AI vs Google Search....behind the scenes

And I've had this with clients as well, where it thought that two different websites that were named similar were the same entity, if you wanna call it that.