Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

AI: post transformers

Multi Query Attention: PaLM: Scaling Language Modeling with Pathways

08 Aug 2025

Description

67 authors were involved in this research!This source is an academic paper titled "PaLM: Scaling Language Modeling with Pathways," authored by Aakanksha Chowdhery and numerous collaborators. It details the development and capabilities of PaLM (Pathways Language Model), a 540-billion parameter Transformer language model trained on 6144 TPU v4 chips using a new ML system called Pathways. The paper highlights PaLM's state-of-the-art performance in few-shot learning across various natural language tasks, including multilingual tasks and source code generation. Additionally, the authors provide analysis on bias, toxicity, and training data memorization, alongside a discussion of ethical considerations related to large language models. The document is hosted on arXiv, an open-access repository for scholarly articles, and includes submission history, full-text links, and citation tools.https://arxiv.org/abs/2204.02311

Audio
Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes
🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Comments

There are no comments yet.

Please log in to write the first comment.