The Daily ML
Activity Overview
Episode publication activity over the past year
Episodes
Ep49. Artificial Intelligence, Scientific Discovery, and Product Innovation
18 Nov 2024
Contributed by Lukas
This research paper examines the impact of an artificial intelligence tool for materials discovery on the productivity and performance of scientists w...
Ep48. Large Language Models Can Self-Improve in Long-context Reasoning
16 Nov 2024
Contributed by Lukas
This research paper investigates how large language models (LLMs) can improve their ability to reason over long contexts. The authors propose a self-i...
Ep47. Personalization of Large Language Models: A Survey
16 Nov 2024
Contributed by Lukas
This paper is a survey of personalized large language models (LLMs), outlining different ways to adapt these models for user-specific needs. It analyz...
Ep46. Number Cookbook: Number Understanding of Language Models and How to Improve It
14 Nov 2024
Contributed by Lukas
This research paper investigates the numerical understanding and processing abilities (NUPA) of large language models (LLMs). The authors introduce a ...
Ep45. Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models
12 Nov 2024
Contributed by Lukas
This paper describes a novel method called Multi-expert Prompting that aims to improve the reliability, safety, and usefulness of large language model...
Ep44. Mixtures of In-Context Learners
11 Nov 2024
Contributed by Lukas
The provided text describes a novel approach to in-context learning (ICL) called Mixtures of In-Context Learners (MOICL) that addresses key limitation...
Ep43. Project Sid: Many-agent simulations toward AI civilization
10 Nov 2024
Contributed by Lukas
This technical report describes "Project Sid," an experiment that aims to create and study AI civilizations within a Minecraft environment. The resear...
Ep42. The Geometry of Concepts: Sparse Autoencoder Feature Structure
09 Nov 2024
Contributed by Lukas
This research paper investigates the structure of the concept universe represented by large language models (LLMs), specifically focusing on how spars...
Ep41. Distinguishing Ignorance from Error in LLM Hallucinations
08 Nov 2024
Contributed by Lukas
This research paper investigates the phenomenon of hallucinations in large language models (LLMs), focusing on distinguishing between two types: hallu...
Ep40. A Comprehensive Survey of Small Language Models in the Era of Large Language Models
07 Nov 2024
Contributed by Lukas
This paper provides a comprehensive survey of small language models (SLMs) in the context of large language models (LLMs). The authors discuss the ben...