Keith Coleman
π€ SpeakerAppearances Over Time
Podcast Appearances
So the way we treat this now that's been working well is we still have that human layer where humans rate the notes in the same way as any other human-authored note.
And what we're working towards now is a way for AI and humans to collaborate more effectively to co-write better notes faster.
Yeah, there's this thing that we sometimes call reinforcement learning from community feedback, as opposed to just reinforcement learning from human feedback, which maybe would use potentially a smaller bias set of non-representative people.
And basically, in the case of community notes, what it would look like is directly training the model to be writing notes that would be maximally likely to be found helpful by a simulated set of raters who typically disagreed in the past.
Yeah, that's a really good point.
I think just in the same way that community notes spread less, even though there's no... Community notes caused posts to spread less, even though there's no downranking in the algorithm, I think you'll probably see something in Algus here where there's just a positive second-order effect from making that common ground, common knowledge.