For more information or if you need help retrieving your data, please contact Weights & Biases Customer Support at support@wandb.com
Expect technical depth, live demos, and practical lessons learned from real-world teams.
Are general-purpose LLMs falling short of your company’s highly specialized practical requirements? While Supervised Fine-Tuning (SFT) is an option, what happens when you simply don’t have enough data?
“On-the-job” Reinforcement Learning (RL) is emerging as the key to filling this gap, enabling models to acquire advanced reasoning and align with highly specific business intents. However, the barrier to entry is notoriously high. Manually comparing GPU providers, building deployment scripts, and configuring infrastructure can delay RL training jobs by hours or even days.
Join us for a deep dive into Serverless RL, a new backend powered by Weights & Biases and CoreWeave designed to abstract away infrastructure headaches.
In this talk, we will cover:
Whether you’re looking to build hyper-fast voice agents or specialized internal experts, this session will show you how to empower your software teams to train specialized open-source models on demand, without forcing them to become hardware managers.

