Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing
Podcast Image

The AI Daily Brief: Artificial Intelligence News and Analysis

How People Actually Use AI Agents

19 Feb 2026

Transcription

Transcript generated automatically by AI and may contain errors.

Chapter 1: What is the main topic discussed in this episode?

0.942 - 23.013 Nathaniel Whittemore

Today on the AI Daily Brief, a new study about agent autonomy in practice from Anthropic. And before that in the headlines, Google Gemini now allows you to create music. The AI Daily Brief is a daily podcast and video about the most important news and discussions in AI. All right, friends, quick announcements before we dive in.

0

23.934 - 35.219 Nathaniel Whittemore

First of all, thank you to today's sponsors, KPMG, Assembly, Robots and Pencils, and Blitzy. To get an ad-free version of the show, go to patreon.com slash aiDailyBrief, or you can subscribe on Apple Podcasts.

0

Chapter 2: How does Google's Gemini enhance music generation capabilities?

35.72 - 55.33 Nathaniel Whittemore

To learn about sponsoring the show, send us a note at sponsors at aiDailyBrief.ai. Lastly, a reminder once again about our latest ecosystem projects. ClawCamp, the free self-directed program where you can learn how to build agents and agent teams using OpenClaw, is kicking off its first sprint right now. So if you want to learn to be an agent boss with nearly 3,000 friends, come join us.

0

55.39 - 74.015 Nathaniel Whittemore

You can find that at campclaw.ai or from the AI Daily Brief website. If you are a company trying to figure out OpenClaw and other agent strategies, check out enterpriseclaw.ai. And more broadly, if you are just interested in keeping track of these types of educational programs we're doing, free and premium, you can find information about that at aidbtraining.com.

0

74.556 - 91.497 Nathaniel Whittemore

One thing happening there, we actually have a premium program survey as we are trying to figure out exactly which premium programs to launch first. If you are an enterprise or premium buyer who is interested in that, again, you can check it out at aidbtraining.com. Now with that barrage of URLs out of the way, let's talk about Gemini and music.

0

93.265 - 110.937 Nathaniel Whittemore

Today we kick off with Google's continuing quest to have AI products in every single multimodal category. The latest news is that the company has launched an AI music generator called Lyria 3. It's the latest version of DeepMind's music generation model and allows users to generate music clips based on text, images, or video inputs.

0

110.917 - 123.43 Nathaniel Whittemore

which is pretty unique compared to something like Suno, which is of course just text-based input. Lyrics can be generated in eight different languages, including German, French, Spanish, and Hindi. The feature can be accessed directly in the Gemini app by switching to a musical output.

123.771 - 139.708 Nathaniel Whittemore

It's also being added to YouTube's Dream Track tool to allow creators to quickly generate soundtracks for YouTube Shorts. Each track is accompanied by custom cover art generated by Nanobanana. Now, previous versions of Lyria have only been available through Google's Clouds Vertex program, so this is a big expansion in access.

139.688 - 150.142 Nathaniel Whittemore

However, there is a pretty significant limitation, which is that these are 30-second clips. The model itself isn't really capable of building on top of the initial generation, so this feature won't be useful to generate entire songs.

150.582 - 164.16 Nathaniel Whittemore

However, and it's pretty clear that this is the use case they're imagining initially, this could be extremely useful for generating background music for YouTube Shorts or fun, interactive, personal types of song messages. Indeed, this appears to be what Google had in mind, with them writing...

164.14 - 175.312 Nathaniel Whittemore

The goal of these tracks isn't to create a musical masterpiece, but rather to give you a fun, unique way to express yourself. And really, while it would be tempting to compare this to Suno, this is actually more of a social feature than anything else.

Chapter 3: What changes did Anthropic make to their terms of service?

1383.892 - 1399.953 Nathaniel Whittemore

Which is interesting, because that's not really a knock on its own autonomy, in the sense that it doesn't necessarily need that information to proceed, as it could theoretically just make the decision for itself, but a way to better align with humans on the upfront. Now, the one other really interesting chart is the chart of which domains agents are deployed in.

0

1400.433 - 1413.772 Nathaniel Whittemore

As you might expect, especially given that this is anchored by cloud code, software engineering represents around half of the tool calls overall. And although the other categories are all below 10%, they kind of read like a map of where agentic automation is likely to come next.

0

1414.193 - 1422.985 Nathaniel Whittemore

Back office automation is at number two at 9.1%, followed by marketing and copywriting at 4.4%, sales and CRM at 4.3%, finance and accounting at 4.0%.

0

1422.965 - 1436.201 Nathaniel Whittemore

It is notable that even at this early stage, with coding and engineering tasks being the clear breakout, you're still already seeing more than 50% of tool calls, in other words, more than 50% of agentic use cases being outside of that software engineering domain.

0

1436.242 - 1452.182 Nathaniel Whittemore

This is a pretty simple study overall, but a really valuable complement in my estimation to the meter study as it moves away from the realm of the theoretical and into the realm of what people are actually using agents for and how they're actually interacting with them. There are a few interesting implications that people picked up on.

1452.602 - 1468.463 Nathaniel Whittemore

David Hendrickson wrote, What's most surprising from the paper is that real-world AI agents are currently given much less autonomy than they could technically handle. In other words, we had to go to the 99.9th percentile to really see what Claude could do, despite the fact that the average turn is just 45 seconds.

1468.443 - 1488.156 Nathaniel Whittemore

We've talked a lot on this show about a capability overhang, and it looks like this is another example of that in practice, even with some of the most advanced tools in the space. Another interesting takeaway is about a shift in our thinking of autonomy, from purely based on model capability to this more complex view of model capability plus human interactive state. Yong-Ri Su writes...

1488.136 - 1497.928 Nathaniel Whittemore

Autonomy is not just steps taken, it is permission, scope, and ability to change state. The other thing that people are exploring is, based on all this, what they actually want the interactive mode to look like in the future.

1498.489 - 1512.086 Nathaniel Whittemore

Richie on X, for example, writes, Need a Claude code mode that isn't exactly dangerously skip permissions, but can skip pointless do you want to proceed questions, and at the same time doesn't nuke my entire database and family tree. Lorenzo responds, what you want is competent autonomy.

Comments

There are no comments yet.

Please log in to write the first comment.