Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing
Podcast Image

Lex Fridman Podcast

#496 – FFmpeg: The Incredible Technology Behind Video on the Internet

06 May 2026

Transcription

Chapter 1: What is FFmpeg and why is it considered essential for video on the internet?

0.031 - 32.691 Lex Fridman

The following is a conversation all about FFmpeg and VLC with Jean-Baptiste Kempf and Kieran Cunha. FFmpeg is an open source software system that is the invisible backbone behind YouTube, Netflix, Chrome, VLC, Discord, and basically every platform that touches video or audio on the internet. It can decode, encode, transcode, stream, and play almost any video or audio format ever created.

0

32.711 - 54.043 Lex Fridman

To me, it is one of the most incredible software systems ever developed. And it's all done by volunteers. VLC is also a legendary piece of software. It is an open-source media player that plays basically anything you throw at it, any format, any platform, no ads, no tracking.

0

54.584 - 73.83 Lex Fridman

It has been downloaded over 6 billion times, and again, for me, it has been one of my favorite pieces of software ever, with the most legendary logo, which I, of course, had to honor in this conversation by wearing the... VLC traffic cone hat the whole time.

0

74.813 - 100.542 Lex Fridman

So again, above all else, thank you to the incredible volunteer engineers who put their heart and soul into this code that has been used and loved by billions of people. Thank you. and about the two great engineers and human beings I'm talking to in this episode. Jean-Baptiste is the president of Videoland and is a key figure behind VLC and FFmpeg.

0

101.083 - 122.454 Lex Fridman

Kieran is a longtime Kodak engineer, FFmpeg contributor, and the man behind the now infamous FFmpeg account on Twitter, X, that I recommend watching. everybody follow for the memes and for the unapologetic celebration of open source and great low-level software engineering.

123.495 - 146.876 Lex Fridman

Let me also say that it's inspiring and humbling that so much of modern civilization rests on software built by people who are not chasing fame or money, but are obsessed with the craft of engineering. We live in a world where billions of people consume video every day without ever thinking about the invisible machinery underneath it. But that machinery matters.

147.638 - 176.966 Lex Fridman

Open source infrastructure matters. It is one of the great examples of human beings quietly collaborating across borders to build something useful, durable, and elegant for the rest of us. And so this conversation is not just about Codex and media pipelines. It is also about the deeper spirit of engineering and generosity that makes projects like FFmpeg possible. Again, I can never say it enough.

177.928 - 191.062 Lex Fridman

Thank you. And now, a quick few second mention of a sponsor. Check them out in the description or at lexfriedman.com slash sponsors. It is, in fact, the best way to support this podcast.

191.082 - 211.529 Lex Fridman

We've got Lairdon for understanding how AI is used in your business, Blitzy for code generation and large code bases, BetterHelp for mental health, Finn for customer service AI agents, Element for electrolytes, and Bookplexity for curiosity-driven knowledge exploration. Choose wisely, my friends. And now onto the full batteries.

Chapter 2: What are the unique features of VLC as a media player?

278.317 - 303.666 Lex Fridman

How is AI being used to increase the productivity of the individual contributors and teams seen as a whole? That's what Laredon does. If AI is part of your organization, now is the moment to get control of it. Head to laredon.com to book a demo and to start maximizing impact from AI. This episode is also brought to you by Blitzy, an AI-powered autonomous software development platform.

0

303.73 - 324.017 Lex Fridman

built for large, complex code bases. Huge number of cooperative agents, really optimized for huge code bases. Optimized for scaling speed when you're talking about a very large number of agents working together. And that's the big interesting question.

0

323.997 - 347.119 Lex Fridman

When you have a huge number of agents, a huge company, huge code base, how do you then, seen at the big picture code base level, have the growth and development, the evolution of that code base where a very large percentage of that code base is continuously worked on autonomously? The question is when you have a large code base,

0

347.099 - 363.758 Lex Fridman

that already delivers value, that already sells stuff, that already has a huge number of customers, how do you then use agentic engineering to continue adding features, continue improving, continue in the usual kind of development with the testing, with the security, all that kind of stuff?

0

363.738 - 388.141 Lex Fridman

How do you do that without messing stuff up, without filling up your code base with AI slop, and nevertheless doing it so, for the most part, autonomously? Not fully autonomously, semi-autonomously, but majority of the code is written autonomously. That's what Blitzy specializes in. The future of autonomous software development is here. Learn more or speak to a member of their team at blitzy.com.

389.041 - 422.089 Lex Fridman

That's blitzy.com. This episode is also brought to you by BetterHelp, spelled H-E-L-P, help. Moving away from AI to the human. The human mind is still, to this day, out of reach of our understanding from the perspective of creating intelligence. There are so many intricate psychological complexities to the mind, which I think attributes to what makes humans incredibly special.

422.65 - 444.096 Lex Fridman

But those complexities get all tangled up in ways that are counterproductive. And so they need to be untangled in a I'm a big fan of talk therapy as a set of tools as a methodology for untangling the complexities of the human mind. The easy, discreet, affordable way of doing that is BetterHelp. That's why I keep recommending it.

444.678 - 467.647 Lex Fridman

It's a really good way to take your first steps if you haven't done talk therapy. Get a licensed professional therapist in under 48 hours. BetterHelp makes it super easy. Check them out at betterhelp.com slash Lex and save in your first month. That's betterhelp.com slash Lex. This episode is also brought to you by FINN, the number one AI agent for customer service.

468.408 - 492.285 Lex Fridman

As I mentioned already about humans, humans are complicated. And customer service is ultimately about looking at each individual human and really listening. That's what I try to do with the podcast, to truly listen to each individual person, whether we're talking about a super technical topic or we're talking about the big issue. questions of the human condition.

Chapter 3: How does open source contribute to the development of VLC and FFmpeg?

708.93 - 733.641 Kieran Kunhya

And then, funnily, at one Videoland conference, we made a competition to make the weirdest and most horrible file ever and see if VLC could play it. What did it end up being? What's the file? It was an MKV file made by Derek, which each of the frame was changing resolution, aspect ratio, rotation, and it was like... Did it work? Yes.

0

734.262 - 749.526 Kieran Kunhya

And there was another one where the whole video was actually animated subtitles, right? SSA, right? So each frame was a black frame, but on top of that, there was a subtitle that was animated for each frame.

0

749.746 - 753.312 Jean-Baptiste Kempf

There was a file that's a valid zip and a valid MP3 at the same time or something like that.

0

753.392 - 769.109 Lex Fridman

So yeah, we'd made a competition of stupid files. And it worked. It opened all of the stupid files. Yes. By the way... For people who are not familiar, I am wearing a hat. Would it be fair to say this is the best, worst logo of all time, the cone?

0

769.77 - 792.549 Kieran Kunhya

Yeah, by far, right? The logo of VLC is so iconic, right? Like, we are a team with a small number of people, and the icon is known everywhere. I go to middle of nowhere in India or in China, people know the cone, right? Yeah. 25% of the website traffic that comes to our main website is ConePlayer, right? So many people don't know VLC, right? They know the ConePlayer.

792.569 - 806.895 Kieran Kunhya

That's the thing that Google for is ConePlayer. Yeah, they go on Google and they put ConePlayer and they download VLC, right? So that's iconic. And once we tried to change it as a joke, right? We said it was going to be a type of caterpillar game.

806.875 - 829.71 Kieran Kunhya

construction and we said that during april 1st and we had around 10 000 emails saying no don't change the logo and so on right so it's so iconic right it's so distinctive right if you want to do a video player you're going to put a play button on the tv right and that's a youtube youtube logo right it's an original this one is orange right very bright uh and and it's

829.69 - 849.118 Lex Fridman

weird and it's ridiculous and it's absurd and it's hilarious it becomes meme and meme becomes culture and you keep it and you know about it and you know that in 20 years like you still have going to have the cones and remember oh yeah that was a video player yeah and we'll talk about you know the the mission of ffmpeg being a kind of the archival aspect of it.

849.138 - 867.915 Lex Fridman

So you can think about a thousand years from now, we'll have all these videos that only VLC can open. Human civilization has already destroyed itself multiple times. And the only thing that will remain is this like, you know, the cockroaches will be crawling around and it'll be the VLC logo with some of the archival footage that VLC can open.

Chapter 4: What contributions can young developers make to open source projects?

5389.208 - 5412.247 Jean-Baptiste Kempf

This is a 16-year-old, some of his first contributions to FFmpeg. Actually doing and putting some of these quote-unquote security researchers to shame by actually finding issues and fixing them and being 16. There's no barriers. There's no barriers to you have to study at college under this person and understand these. You can learn C, and let's be honest, it's from the K&R book.

0

5412.468 - 5419.761 Jean-Baptiste Kempf

Learn C. You can learn assembly. We'll talk about that maybe a bit later. You can contribute to world-class technologies.

0

5419.808 - 5426.845 Kieran Kunhya

In VLC, one of the oldest contributors is called Felix. He's the one doing everything on Mac and iOS.

0

Chapter 5: How does FFmpeg handle security issues compared to traditional security researchers?

5426.865 - 5452.621 Kieran Kunhya

He's starting working on VLC. He was 16. We had a guy called Edward Wong, who used to be a Google Summer of Code student who stayed... for three years around video LAN, he was 14, right? And part of Google Summer of Code and Google Code-In, which were programs where basically we have students or high school, we wrote a ton of assembly for x264 and for VLC and for FFmpeg, right?

0

5452.641 - 5454.304 Kieran Kunhya

So everyone can contribute.

0

5454.453 - 5477.589 Jean-Baptiste Kempf

And he also did a good job because he didn't play the alarmist CVE heist, create a CVE, which is like a public exposure of security and do these big, scary, red 7.5 priority. He just fixed an issue in Git after three days and just fixed it. He didn't need to go and play a big security drama about it. And I think... I posted, you know, the kids are all right.

0

5477.689 - 5500.238 Jean-Baptiste Kempf

Whereas there's, you know, I'm not saying all security people do this, but there is a portion of the security community, as Alex said, that likes to hype themselves up by creating drama. They would have happily raised, this is a high priority CVE 8.0 or whatever on an issue that actually was in Git. It wasn't even in a release. It was in development and three days later was fixed.

0

5500.859 - 5521.707 Lex Fridman

Well, I just want to put a little bit of love out there, even to the bigger community. Much love and respect to Google engineers. Like you said, they're some of the best software engineers in the world, and they do contribute a lot even on the security front. And also, you know, I'm a big fan of Theo. Much love to Theo.

Chapter 6: What motivates developers in the open source community?

5521.727 - 5526.934 Lex Fridman

He was part of this debacle and drama a little bit. I think when you just zoom out,

0

5526.914 - 5551.993 Lex Fridman

on the grand arc of human history the drama contributed positively to everybody involved donations went up it brought more attention to the topic allowed uh everybody to bicker in a way that ultimately uh got them to figure out whatever fanbag is all about so the way the way we looked at this is like it's a rap battle at the end of the day no but it is we say stuff we say stuff

0

5551.973 - 5568.717 Jean-Baptiste Kempf

Yeah. But we can, we can leave it on the X is a perfect place for, you know, international rap battle. You say stuff, I say stuff about your mama, but it doesn't mean, you know, I'm going to have an actual personal issue with her. Yeah. And that's what it looks like. The Theo situation, you know, JB can maybe expand when a little bit too far and there was a little bit.

0

5569.034 - 5589.061 Jean-Baptiste Kempf

You know, it's just a bit of fun. It's just a bit of rap battle. It's WWE. You know, everyone's having a bit of fun on X. It doesn't need to be taken seriously. You know, the teenagers thing. So that guy was a Google employee saying, hey, you know, there are other ways to run an open source business. You know, just have a bit of fun, you know. That's what the point of this account is.

0

5589.261 - 5604.842 Jean-Baptiste Kempf

And furthermore, if you can teach people about the ways of open source projects, assembly, et cetera, by doing that, I think there's a lot to be offered here. It's not... dunking on people for dunking sake. It's showing actually the story that I think X learned is these are not big corporate open source projects.

5604.862 - 5621.583 Jean-Baptiste Kempf

This is not Kubernetes where there's, you know, hundreds, maybe thousands of people paid to develop this stuff. These are just people in their basements in their spare time. And if you can address that topic in a fun and entertaining way, I think that that's the good thing. And that's the value of X and the reach we have.

5622.004 - 5648.88 Kieran Kunhya

And to be honest, right, like even at Google, Google is one entity, but so many different people, right? And there is a ton of Google engineers we work with all the time. And even like Google from YouTube to Chrome to Chrome Media to the rest of Google, those are very different type of entities. But what we do is... efficient. And for example, for Theo, right, it went a bit too far.

5648.96 - 5671.266 Kieran Kunhya

I had him, like, I call everyone down, I had him on the phone, we say, okay, like, this goes too far, and so on. But in the end, yeah, it's a hard battle, but it's positive for the project. It's like the awareness we have on open source, and I mean, true open source from communities, right, not is increased dramatically in the last two years. And this is useful.

5672.107 - 5682.482 Lex Fridman

What do you think motivates all the incredible contributors that we've been talking about? What's the engine? It's so interesting to see. Like you said, they're sitting in the basement. What's the driver? What's the engine there?

Chapter 7: What are the challenges of encoding and decoding video?

10764.343 - 10780.088 Lex Fridman

So for people who are just listening, we're looking at a bunch of humans running along a river. You have the reflection, a lot of really high information textures everywhere, the leaves and the lighting playing with the leaves and all of this.

0

10780.355 - 10801.215 Jean-Baptiste Kempf

You could show clearly that encoders with high PSNR will blur everything. And you could see, actually, I could turn on psychovisual stuff, I could turn off adaptive quantization, and it would just look so much better. But your metrics, and these metrics are, at the time, were considered so holy. These are the holy metrics that are untouchable. PSNR is the most important thing.

0

10801.6 - 10812.908 Lex Fridman

Can you speak to how do you measure psychovisual stuff? Like, how do you turn how pleasing a compression is for a human eye into a number? Is that even possible?

0

10812.949 - 10817.5 Jean-Baptiste Kempf

That's what Netflix have been trying to do with VMAF. They said they've used a machine learning model.

0

10817.48 - 10840.315 Jean-Baptiste Kempf

that's a more recent thing but back in when x86 was being developed that's by eye it was by eye it was developers on their laptops so it's not like even with big companies with professional screens or anything it's and that was actually one of the goals which was i don't the developers at the time lauren merit in particular is i don't want to test this on a thirty thousand dollar screen it's i want this to look good on someone's laptop at home

10840.531 - 10869.988 Kieran Kunhya

Yeah, brilliant. And there is another sample, which is a sample that is Planet Earth's Kila sample that I absolutely love. And you are going to see why. It's a ton of birds flying. And the more it goes, the more there are birds. And at the end, it's almost like... You have millions of birds. It's the most complex thing ever to encode, right?

10870.048 - 10894.86 Kieran Kunhya

And well, you're watching it on YouTube and you see how bad the YouTube encoding is actually, right? And this is like phenomenal to optimize and get perfect quality in a constant bitrate. There was a lot of optimization, mostly by Loren also, on anime, right? For a long time, anime was very badly encoded because there was a ton of bending, right?

10894.9 - 10911.595 Kieran Kunhya

And you see those issues, and there was a ton of things. So x264 is like, and today it's still the reference to any encoder, new encoder, AV1, AV2, VVC, HVC, everyone compares to x264.

10911.575 - 10930.585 Jean-Baptiste Kempf

One of my favorite films, Cinema Paradiso, I know the engineer who created the Blu-ray and he showed me the comparisons of X264 versus others and It's completely different. And I think a bunch of guys in the Blu-ray world started using x264. I think the big one was Chris Henderson from Warner Brothers. He did the whole French box set.

Chapter 8: How does the future of codecs look with AV2?

11046.641 - 11059.605 Kieran Kunhya

One of the most amazing open source projects for subtitles is called AEG Sub. And it's a subtitle. It's done for anime, for South Asian and Japanese languages.

0

11059.889 - 11070.602 Jean-Baptiste Kempf

There are weird textures in anime that I don't think you get in real life content. I think that was a key one, which was optimizing these weird textures that you get because anime is not done in a normal fashion.

0

11070.762 - 11094.138 Kieran Kunhya

Yeah, the way you produce it is not, you mostly produce it like on screens, right? Since a bit of time and you have all those gradients, right? In colors, because they are very easy to produce digitally, very complex to produce in real life. And the subtitles also are very complex because you need to have often the Japanese and then you need to have the diacritics, right?

0

11094.498 - 11114.948 Kieran Kunhya

What we call the oribi, right? Which is the hiragana and the katakana for the kanji. And then, because of course, so that you have the official kanji, subtitling, but you also need the English subtitles or the French subtitles because you want to learn that, right? And there were so many things crazy on subtitles. And we had like crazy samples on subtitles that we've seen all around.

0

11115.129 - 11124.54 Kieran Kunhya

So this is an important part of the culture, but also because there was no official offering. There was no way of doing that.

11124.52 - 11140.654 Lex Fridman

Can you speak to the difference between H.264 and AV1 and then X.264 and David? This is this big step. Can you help people understand are some of the streaming sites moving more towards that direction of AV1? Let's be honest.

11141.256 - 11177.997 Kieran Kunhya

All of those codecs, since AV1, MPEG-2 video are the same concepts. The same concept about inverse transform, about intra-prediction, motion composite, entropy coding, all of them. However, each generation gives you a bump between 25% and 50% more compression for the same quality. And so you had the MPEG-2, you had the DivX area, you have H.264, which was changing. H.264 improved so much.

11178.017 - 11202.102 Kieran Kunhya

And then you had more. You had HEVC, you had VP9 at the same time of HEVC. VP9 is a bit similar to HEVC in terms of quality compression. But it's royalty free because in multimedia, there is a ton of patents and the licensing after H.264 became out of hand and could cost hundreds of millions of dollars per year. So it made no sense.

11202.122 - 11226.913 Kieran Kunhya

So Google did this VP9 and the Alliance for Open Media did this new codec called AV1. So you can imagine that AV1 saves between 40 and 60% less bandwidth than H.264 for the same quality, visual quality. At a given bitrate. At a given bitrate, right? So that's really like you increase the quality.

Comments

There are no comments yet.

Please log in to write the first comment.