Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

Data Driven

Arjun Patel on Vector Databases and the Future of Semantic Search

21 Jan 2025

Description

Today, we delve into the intriguing world of vector databases, retrieval augmented generation, and a surprising twist—origami.Our special guest, Arjun Patel, a developer advocate at Pinecone, will be walking us through his mission to make vector databases and semantic search more accessible. Alongside his impressive technical expertise, Arjun is also a self-taught origami artist with a background in statistics from the University of Chicago. Together with co-host Frank La Vigne, we explore Arjun’s unique journey from making speech coaching accessible with AI at Speeko to detecting AI-generated content at Appen.In this episode, get ready to unravel the mysteries of natural language processing, understand the impact of the attention mechanism in transformers, and discover how AI can even assist in the art of paper folding. From discussing the nuances of RAG systems to sharing personal insights on learning and technology, we promise a session that’s both enlightening and entertaining. So sit back, relax, and get ready to fold your way into the fascinating layers of AI with Arjun Patel on Data Driven.Show Notes00:00 Arjun Patel: Bridging AI & Education04:39 Traditional NLP and Geometric Models08:40 Co-occurrence and Meaning in Text13:14 Masked Language Modeling Success16:50 Understanding Tokenization in AI Models18:12 "Understanding Large Language Models"22:43 Instruction-Following vs Few-Shot Learning26:43 "Rel AI: Open Source Data Tool"31:14 "Retrieval-Augmented Generation Explained"33:58 "Pinecone: Efficient Vector Database"37:31 "AI Found Me: Intern to Innovator"41:10 "Impact of Code Generation Models"45:25 Personalized Learning Path Technology46:57 Mathematical Complexity in Origami Design50:32 "Data, AI, and Origami Insights"

Audio
Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes
🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Comments

There are no comments yet.

Please log in to write the first comment.