Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

AI可可AI生活

AI前沿:从数学推理到模型优化

29 Jun 2025

Description

[CL] OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling[Shanghai Jiao Tong University]https://arxiv.org/abs/2506.20512---[LG] Overtuning in Hyperparameter Optimization[LMU Munich]https://arxiv.org/abs/2506.19540---[LG] Distilling Normalizing Flows[University of Oregon & HSE University & Picsart AI Research]https://arxiv.org/abs/2506.21003---[LG] Gaussian Invariant Markov Chain Monte Carlo[Google DeepMind & UCL]https://arxiv.org/abs/2506.21511

Audio
Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes
🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Comments

There are no comments yet.

Please log in to write the first comment.