Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Eno Reyes

๐Ÿ‘ค Speaker
513 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

Our view is that we're definitely not opposed to building models.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

I think that the challenge is you want to build, you basically want to fine tune or build models as basically as late as possible before it becomes important to, right?

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

That's sort of our view.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

And what I mean by that is,

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

You know, if you had fine-tuned GLM2 or some model that's earlier, you know, it gets blown away by the model providers at the next level, right?

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

Which is three weeks away, usually.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

Exactly, exactly.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

You probably have three weeks before there's a better model out, and you have a month to two months before it's completely forgotten about, right?

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

Yeah.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

So if your bet is we're going to beat the people with $50 billion data centers, I think that you have to have a very clear strategy.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

And I think that there are reasons to do this, by the way, cost, right?

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

If you know that you can execute on a specific task at the max that a human cares about, and you just need to bring down cost, that's a great reason to fine tune a model, right?

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

Yeah.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

Yeah.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

but at the end of the day, uh, I think that we've already seen this, a bunch of, you know, software development organizations, uh, or, you know, software development, AI tooling, uh, they've all trained their own models and, you know, within two weeks to Gemini flash came out and it's like better, uh, or within three weeks later, 4.5 Opus is now like, not just better, but it's like two orders of magnitude better.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

Uh, and so I think that it's going to be a really challenging battle to fine tune or build your own models, uh, in the near future, but we're definitely not ruling it out.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

Um,

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

On the earlier point about being an agent research lab and sort of like how we tune the agent's behaviors, and in particular for seeking verification or validation, there's a lot that you can do at the harness level to enable this, right?

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

Some of it might be like context engineering.

The Neuron: AI Explained
This AI Agent Builds Better Code Than Most Developers (Factory AI)

So adjusting not just like the base system prompt, but reminders that you can insert automatically or, you know, prebuilt environmental reactions to tool calls.