AI Breakdown
Episodes
arxiv Preprint - Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
29 Sep 2023
Contributed by Lukas
In this episode we discuss Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition by Yu Yu, Chao-Han Huc...
arxiv Preprint - DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models
28 Sep 2023
Contributed by Lukas
In this episode we discuss DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models by Sam Ade Ja...
arxiv Preprint - VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
27 Sep 2023
Contributed by Lukas
In this episode we discuss VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning by Han Lin, Abhay Zala, Jaemin Cho, M...
arxiv Preprint - PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
26 Sep 2023
Contributed by Lukas
In this episode we discuss PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training by Dawei Zhu, Nan Yang, Liang Wang, ...
arxiv Preprint - Summarization is (Almost) Dead
25 Sep 2023
Contributed by Lukas
In this episode we discuss Summarization is (Almost) Dead by Xiao Pu, Mingqi Gao, Xiaojun Wan. The paper investigates the capabilities of large la...
arxiv Preprint - LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent
23 Sep 2023
Contributed by Lukas
In this episode we discuss LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent by Jianing Yang, Xuweiyi Chen, ...
Neurips 2023 spotlight - Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems
22 Sep 2023
Contributed by Lukas
In this episode we discuss Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems by Benjamin Coleman, Wang-Cheng Kang,...
arxiv Preprint - Chain-of-Verification Reduces Hallucination in Large Language Models
21 Sep 2023
Contributed by Lukas
In this episode we discuss Chain-of-Verification Reduces Hallucination in Large Language Models by Shehzaad Dhuliawala, Mojtaba Komeili, Jing Xu, ...
arxiv Preprint - Language Modeling Is Compression
20 Sep 2023
Contributed by Lukas
In this episode we discuss Language Modeling Is Compression by Grégoire Delétang, Anian Ruoss, Paul-Ambroise Duquenne, Elliot Catt, Tim Genewein...
arxiv Preprint - From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
19 Sep 2023
Contributed by Lukas
In this episode we discuss From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting by Griffin Adams, Alexander Fabbri, Faisal La...
ICCV 2023 - Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing
18 Sep 2023
Contributed by Lukas
In this episode we discuss Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing by Alberto Baldrati, David...
arxiv Preprint - GPT Can Solve Mathematical Problems Without a Calculator
17 Sep 2023
Contributed by Lukas
In this episode we discuss GPT Can Solve Mathematical Problems Without a Calculator by Zhen Yang, Ming Ding, Qingsong Lv, Zhihuan Jiang, Zehai He,...