Eno Reyes

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

And you sort of still to some extent have to think a little bit about

1771.354 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

you know, what if it does, what if it gives the right answer, but it's not parsed correctly?

1777.002 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

So there's a little bit of work required to make sure this is robust, but generally fairly reliable way to determine does the LLM actually know at this point in time what happened?

1781.514 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

And so,

1791.882 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

When we evaluated our compaction method and compared it to OpenAI's compression strategy, as well as Claude Code's compression strategy, we found that we had generally built one that was across all of these dimensions, much stronger at instruction following, continuity, completeness, but most importantly, just accuracy and context awareness, right?

1793.118 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

Yeah.

1814.383 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

Where it was just able to recall all of the critical pieces of information quite well.

1815.003 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

Well, not just faster.

1839.474 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

I think actually speed was relatively similar across the board.

1840.996 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

But the two things that really matter is like the quality of the compression and how much it actually compresses, right?

1845.842 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

Basically like the token reduction efficiency.

1853.893 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

And we do have the worst token reduction efficiency.

1856.977 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

You know, OpenAI is 99.3%.

1860.722 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

uh cloud codes was 98.7 and ours was 98.6 so like 0.1 off maybe maybe that's within the error bars right but uh but the overall quality right you can basically take all of these characteristics and you can uh build sort of a quality score that just says you know across all these dimensions which one is stronger

1863.706 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

I think that like probably the most important thing we learned was just how much structure matters.

1884.833 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

Right.

1891.782 View full episode →

The Neuron: AI Explained

This AI Agent Builds Better Code Than Most Developers (Factory AI)

So I think that probably the biggest failure case is generic summarization.