IA Sob Controle - Inteligência Artificial cover image

225: Claude Code Meetup em SP, sucesso de IAs programadoras, ByteDance Seedance 2.0

IA Sob Controle - Inteligência Artificial

00:00

Estudo: RL via Self‑Distillation (SDPO)

Apresentam paper do ETH sobre SDPO, técnica que melhora RL com feedback rico e acelera aprendizado em benchmarks.

Play episode from 01:37:40
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app