Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

AI: post transformers

SAIR: Accelerating Pharma R&D with AI-Powered Structural Intelligence

06 Sep 2025

Description

This September 2025 paper describe SAIR, the Structurally Augmented IC50 Repository, a groundbreaking open-source dataset developed by SandboxAQ in collaboration with NVIDIA. SAIR is the largest publicly available collection of over 5 million AI-generated 3D protein-ligand structures, each linked with experimentally measured drug potency data (IC₅₀ values). This dataset aims to bridge a critical data gap in AI-powered drug discovery by providing comprehensive structural intelligence, thereby enabling researchers to accelerate R&D, explore novel drug targets, and improve the accuracy of AI models for predicting drug properties. The creation of SAIR involved extensive high-performance computing, taking over 130,000 GPU hours, and its structures were rigorously validated with industry-standard tools, achieving a 97% pass rate. By offering this resource for free commercial and non-commercial use on platforms like Hugging Face, SAIR seeks to revolutionize how pharmaceutical, biotech, and tech-bio leaders approach drug design and optimization.Sources:https://go.sandboxaq.com/rs/175-UKR-711/images/sair_paper.pdfhttps://huggingface.co/datasets/SandboxAQ/SAIRhttps://huggingface.co/blog/SandboxAQ/sair-data-accelerating-drug-discovery-with-ai

Audio
Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes
🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Comments

There are no comments yet.

Please log in to write the first comment.