Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing
Podcast Image

AI Breakdown

Technology Science Education

Episodes

Showing 401-412 of 412
«« ← Prev Page 5 of 5

arxiv Preprint - Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

29 Sep 2023

Contributed by Lukas

In this episode we discuss Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition by Yu Yu, Chao-Han Huc...

arxiv Preprint - DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models

28 Sep 2023

Contributed by Lukas

In this episode we discuss DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models by Sam Ade Ja...

arxiv Preprint - VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

27 Sep 2023

Contributed by Lukas

In this episode we discuss VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning by Han Lin, Abhay Zala, Jaemin Cho, M...

arxiv Preprint - PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

26 Sep 2023

Contributed by Lukas

In this episode we discuss PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training by Dawei Zhu, Nan Yang, Liang Wang, ...

arxiv Preprint - Summarization is (Almost) Dead

25 Sep 2023

Contributed by Lukas

In this episode we discuss Summarization is (Almost) Dead by Xiao Pu, Mingqi Gao, Xiaojun Wan. The paper investigates the capabilities of large la...

arxiv Preprint - LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent

23 Sep 2023

Contributed by Lukas

In this episode we discuss LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent by Jianing Yang, Xuweiyi Chen, ...

Neurips 2023 spotlight - Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems

22 Sep 2023

Contributed by Lukas

In this episode we discuss Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems by Benjamin Coleman, Wang-Cheng Kang,...

arxiv Preprint - Chain-of-Verification Reduces Hallucination in Large Language Models

21 Sep 2023

Contributed by Lukas

In this episode we discuss Chain-of-Verification Reduces Hallucination in Large Language Models by Shehzaad Dhuliawala, Mojtaba Komeili, Jing Xu, ...

arxiv Preprint - Language Modeling Is Compression

20 Sep 2023

Contributed by Lukas

In this episode we discuss Language Modeling Is Compression by Grégoire Delétang, Anian Ruoss, Paul-Ambroise Duquenne, Elliot Catt, Tim Genewein...

arxiv Preprint - From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

19 Sep 2023

Contributed by Lukas

In this episode we discuss From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting by Griffin Adams, Alexander Fabbri, Faisal La...

ICCV 2023 - Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing

18 Sep 2023

Contributed by Lukas

In this episode we discuss Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing by Alberto Baldrati, David...

arxiv Preprint - GPT Can Solve Mathematical Problems Without a Calculator

17 Sep 2023

Contributed by Lukas

In this episode we discuss GPT Can Solve Mathematical Problems Without a Calculator by Zhen Yang, Ming Ding, Qingsong Lv, Zhihuan Jiang, Zehai He,...

«« ← Prev Page 5 of 5