Quentin Gallouédec's picture

Hiring 💼

Quentin Gallouédec PRO

qgallouedec

huggingface

·

AI & ML interests

None yet

Recent Activity

updated a dataset 2 days ago

hf-doc-build/doc-build-dev

reacted to sergiopaniego's post with 🚀 2 days ago

What happens when you make an LLM drive a car where physics are real and actions can't be undone? I ported CARLA, the autonomous driving simulator, to OpenEnv and added training support via TRL + Hugging Face Spaces. The model interacts with the simulator through tool calls (observe, brake, change lane) and learns from a reward signal. In 50 training steps, Qwen 0.6B learns to swerve and brake to avoid pedestrians in emergency situations. The project supports text and vision (VLMs can see through a camera sensor), open-world driving with traffic, and multiple driving scenarios. This builds on the carla-env project by sinatras, which originally placed LLMs inside CARLA for evaluation. We extended it with vision, new scenarios, rubric-based rewards, and made it trainable end-to-end. Blog: https://huggingface.co/blog/sergiopaniego/bringing-carla-to-openenv-trl/ CARLA env in OpenEnv: https://github.com/meta-pytorch/OpenEnv/tree/main/envs/carla_env Training script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/carla.py

updated a Space 3 days ago

qgallouedec/trackio-dev

View all activity

Organizations

qgallouedec 's models 789

qgallouedec/tiny-aya-global-SFT

Updated 10 days ago

qgallouedec/tiny-aya-global-tool-calling-SFT

Updated 10 days ago

qgallouedec/my-other-awesome-model

Text Generation • 0.5B • Updated 15 days ago • 9

qgallouedec/my-awesome-model

Text Generation • 0.5B • Updated 15 days ago • 19

qgallouedec/trainer_output

Text Generation • 0.5B • Updated 15 days ago • 16

qgallouedec/test_push_output_4

Text Classification • 87.5k • Updated 15 days ago • 15

qgallouedec/qwen2-0.5b-deepmath-grpo

qgallouedec/my-finetuned-model

0.8B • Updated Jan 2 • 6

qgallouedec/Qwen3-0.6B-SFT-20251113165959

Text Generation • 0.6B • Updated Nov 13, 2025 • 5

qgallouedec/Qwen3-0.6B-SFT-20251113163732

Updated Nov 13, 2025

qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112173255

Updated Nov 12, 2025

qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112165832

Updated Nov 12, 2025

qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112171926

Updated Nov 12, 2025

qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112171823

Updated Nov 12, 2025

qgallouedec/gold-model

Updated Oct 30, 2025

qgallouedec/custom-resnet50d

Feature Extraction • 25.6M • Updated Oct 1, 2025 • 2

qgallouedec/Qwen3-1.7B-parsing

Text Generation • 2B • Updated Sep 27, 2025 • 2

qgallouedec/Qwen2.5-0.5B-SFT

Text Generation • 0.5B • Updated Sep 14, 2025 • 2

qgallouedec/Qwen2-0.5B-Reward

Token Classification • 0.5B • Updated Sep 14, 2025 • 1

qgallouedec/Qwen3-0.6B-SFT-20250911031144

Text Generation • 0.6B • Updated Sep 11, 2025 • 2

qgallouedec/Qwen3-0.6B-SFT-20250911023224

Text Generation • 0.6B • Updated Sep 11, 2025 • 3

qgallouedec/Qwen3-0.6B-Base-SFT-20250911020040

Text Generation • 0.6B • Updated Sep 11, 2025 • 1

qgallouedec/Qwen3-0.6B-SFT-20250911021538

Text Generation • 0.6B • Updated Sep 11, 2025 • 3

qgallouedec/Qwen3-0.6B-Base-SFT-20250911021314

Text Generation • 0.6B • Updated Sep 11, 2025 • 1

qgallouedec/Qwen3-0.6B-Base-SFT-20250911014759

Text Generation • 0.6B • Updated Sep 11, 2025 • 1

qgallouedec/Qwen3-0.6B-Base-SFT-20250911011255

Text Generation • 0.6B • Updated Sep 11, 2025 • 1

qgallouedec/after

Text Generation • 0.5B • Updated Sep 11, 2025 • 5

qgallouedec/before

Text Generation • 0.5B • Updated Sep 11, 2025 • 2

qgallouedec/Qwen3-1.7B-SFT-20250910184326

Text Generation • 2B • Updated Sep 10, 2025 • 2

qgallouedec/Qwen3-4B-SFT-20250910180651

Text Generation • 4B • Updated Sep 10, 2025 • 1