Andy Halliday
๐ค SpeakerAppearances Over Time
Podcast Appearances
But what's interesting here is this is agentic reasoning.
And I think that Codex is a model that is more focused on agentic reasoning, but focused on coding as well.
And I just want to point out that
GPT-5.2 codex high only gets 57 on that agenda index.
So this is inconsistent with the analysis and commentary that I've been seeing about how superb and superior codex is.
But in terms of agentic coding,
That may be true just in the coding area.
But when it comes to overall agentic reasoning, it's far behind the leader, which is Claude Opus 4.6 Max.
It's 11 points behind that on the artificial analysis index.
So this is all conspired to give me little reason to leave Claude.
I mean, for all the buzz about Codex, I just can't find my way away from Claude Code and Claude Cowork because they really package things up into a desktop application for me.
That basically solves everything that I want it to do, with the exception of I also turn to GenSpark for things that are not so much in my sort of professional pursuits, but rather, oh, here's a complex problem that I'm trying to solve in various ways.
And GenSpark is really satisfying that way with an enormous palette of different features and services that are available to you as a GenSpark.ai user.
But what about clod-mem, which I haven't added as a plug-in to my clod-cowork and clod-code configuration.
But that, I understand, will capture the full context of all of your conversations with clod.
And it is an outboard memory.
Wow, so you're saying perplexity is falling short in your estimation because of their inability to capture or allow you to simply one-click capture the full conversation.