Sergio Paniego PRO
AI & ML interests
Recent Activity
Organizations
Posts 87
> evaluate your agents using OpenEnv
> learn how rewards work via rubrics
> connect agents via MCP
> many moreeeee!
anything you think it's missing?
https://meta-pytorch.org/OpenEnv/tutorials/index.html
Articles 16
Welcome Gemma 4: Frontier multimodal intelligence on device
- Runtime errorRL
CARLA Environment Server
🚗Control a Carla driving simulation with custom actions
- Runtime errorRL
CARLA Environment Server
🚗Control a CARLA driving simulator with custom actions
- SleepingAgents
Carla Grpo Trolley
🚀Visualize your program’s I/O activity in real time
-
sergiopaniego/Qwen3-0.6B-carla-trolley-escape
0.8B • Updated • 7
- Running3.83k
The Ultra-Scale Playbook
🌌3.83kThe ultimate guide to training LLM on large GPU Clusters
- Running on CPU UpgradeFeatured3.15k
The Smol Training Playbook
📚3.15kThe secrets to building world-class LLMs
- Running315
Evaluation Guidebook
📝315Explore LLM benchmark trends over time
- Running224
FineVision: Open Data is All You Need
📝224A new open-source dataset for training VLMs
- Runtime errorRL
CARLA Environment Server
🚗Control a Carla driving simulation with custom actions
- Runtime errorRL
CARLA Environment Server
🚗Control a CARLA driving simulator with custom actions
- SleepingAgents
Carla Grpo Trolley
🚀Visualize your program’s I/O activity in real time
-
sergiopaniego/Qwen3-0.6B-carla-trolley-escape
0.8B • Updated • 7
- Running3.83k
The Ultra-Scale Playbook
🌌3.83kThe ultimate guide to training LLM on large GPU Clusters
- Running on CPU UpgradeFeatured3.15k
The Smol Training Playbook
📚3.15kThe secrets to building world-class LLMs
- Running315
Evaluation Guidebook
📝315Explore LLM benchmark trends over time
- Running224
FineVision: Open Data is All You Need
📝224A new open-source dataset for training VLMs
spaces 136
VLM Object Understanding
Explore object detection, visual grounding, keypoint Detecti
Qwen2-VL-7B
Ask questions about charts in images
SmolVLM-trl-dpo-rlaif-v
Generate text from an image and question
SmolVLM-trl-sft-ChartQA
Ask questions about charts in images
REPL Environment Server
Run agentic Python tasks with LLM guidance interactively
Reasoning Gym Environment Server
Interact with a reasoning gym environment via text steps