Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Robert M

👤 Speaker
195 total appearances

Appearances Over Time

Podcast Appearances

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

Twitter thread.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

I have some complaints about both the paper and the accompanying blog post.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

Subheading.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

TLDR.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

The paper's abstract says that in several settings, larger, more capable models are more incoherent than smaller models, but in most settings they are more coherent.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

This emphasis is even more exaggerated in the blog post and Twitter thread.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

I think this is pretty misleading.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

The paper's technical definition of dinkoherence is uninteresting and the framing of the paper, blog post, and Twitter thread equivocate with the more normal English language definition of the term, which is extremely misleading.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

Section 5 of the paper, and to a larger extent the blog post and Twitter, attempt to draw conclusions about future alignment difficulties that are unjustified by the experiment results and would be unjustified even if the experiment results pointed in the other direction.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

The blog post is substantially LLM written.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

I think this contributed to many of its overstatements.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

I have no explanation for the Twitter thread, except that maybe it was written by someone who only read the blog post.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

Heading.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

Paper.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

The paper's abstract says, Quote.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

Incoherence changes with model scale in a way that is experiment dependent.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

However, in several settings, larger, more capable models are more incoherent than smaller models.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

Consequently, scale alone seems unlikely to eliminate incoherence.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

End quote.

LessWrong (Curated & Popular)
"Anthropic’s “Hot Mess” paper overstates its case (and the blog post is worse)" by RobertM

This is an extremely selective reading of the results, where in almost every experiment, model coherence increased with size.