Nathaniel Whittemore
π€ SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
Now, if you have done either our ClawCamp or AgentOS program, you might be thinking to yourself, this system looks a lot like building the memory.md file for an agent or even something like the personal context project we ran through the AIDB operators community.
And indeed, one could be forgiven for thinking that memories is just about creating and maintaining a markdown file that stores crucial information about the user.
The key difference is that ChatGPT is now running this process automatically through the backend.
requiring far less of the average user to take full advantage.
OpenAI also noted that the huge efficiency gains from their new setup will allow them to provide dreaming to free users for the first time.
Last summer, when paid subscribers gained access to dreaming-style memory, free users had only access to basic saved memories.
OpenAI says that they've been able to cut down the compute requirements for dreaming by 5x, meaning it's now practical to serve at scale.
Mark Kretschmann argues that this is a bigger deal than it sounds, writing, The more ChatGPT becomes an actual work partner, the less sense it makes to restart from zero every time.
Projects, preferences, constraints, tools, writing styles, code-based details, all of this should carry forward.
Sounds small, but it changes the product.
A chatbot with real memory becomes much closer to a persistent agent.
I also think it's interesting in our context of how companies are adapting to the token scarcity era.
You remember Arvind Jain, the CEO of Glean, wrote a piece about the token economy that he called your token spend is an AI architecture problem.
And in it, he discussed a lot of similar themes around token efficiency.
In short, all the time that you spend getting a model to remember all the relevant context each and every time are wasted turns and wasted tokens that could be fixed theoretically through better memory systems.
So again, in some ways, a small update, but one with potentially bigger implications.
Now, speaking of dealing with the token scarcity era, TSMC has warned that there's only so much they can do to alleviate the chip shortage that is expected to last all of this decade.
In candid commentary at their annual shareholders meeting on Thursday, CEO Cici Wei said, Customer demand is so high and we can only support so much.
We are already working very hard.
We're doing our best to ensure TSMC does not become a bottleneck.