Want to learn how to align a Vision Language Model (VLM) for reasoning using GRPO and TRL? 🌋 🧑‍🍳 We've got you covered!! NEW multimodal post training recipe to align a VLM using TRL in @huggingface's Cookbook
10,23K