Keith Coleman
π€ SpeakerAppearances Over Time
Podcast Appearances
reposts will drop by about 50% or 2x after a note's applied.
And this is, you know, this is really big in the scale of social media, like, you know, 1 or 5% when would be, you know, pretty big in the scale of typical A-B tests.
So...
One thing that I think is really heartening about this is that we know from this and some other studies that actually people are not just entrancing their beliefs.
When a note is applied to a post, they'll actually agree with the core claims in the post less.
And I think that's really cool.
And I guess there's a little bit of a mixed blessing here, though, because actually post authors will also be more likely to delete their post after they get noted.
So in that way, the best notes actually get seen very infrequently.
So I'm torn about that, because just for me personally, I think not everyone agrees on this, but for me personally, I'd rather see a post and a note than neither at all, just because that's probably not the only time in the world where you're ever going to see that particular wrong claim, so maybe you'll see it off X somewhere in another post.
And just for me, seeing a lot of notes has kind of increased the skepticism that I have when reading things.
Yeah, manipulation's a real thing.
I mean, people are always trying to game social media algorithms, and community notes is no exception.
So I think one thing to call out is that surprising agreement mechanism does provide a bit of a defense against a more naive attack than the one you described.
There's a lot of people with the same view all piling on, trying to get an incorrect note showing.
You know, that's not going to work.
But for a more sophisticated attack like the one you described,
We do have a lot of defences in place, so just to name a few, we do things like requiring a verified phone number from a trusted carrier, just to increase the probability that we're dealing with real humans.
We look for raters who have rated things really similarly in the past, and actually we might treat them as the same person, just to limit the influence of really similar behaviour.
Another thing is we can look at random samples of raters, and if they're rating things very differently than self-selected, possibly malicious raters, then that's a very important signal.
And we have other things too, like there's rater reputation to deal with low-quality people,