Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Hiring 💼
1231
153
118
Quentin Gallouédec
PRO
qgallouedec
Follow
Miladsol's profile picture
byamasupatrick's profile picture
PreludeZz's profile picture
593 followers
·
341 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a dataset
2 days ago
hf-doc-build/doc-build-dev
reacted
to
sergiopaniego
's
post
with 🚀
2 days ago
What happens when you make an LLM drive a car where physics are real and actions can't be undone? I ported CARLA, the autonomous driving simulator, to OpenEnv and added training support via TRL + Hugging Face Spaces. The model interacts with the simulator through tool calls (observe, brake, change lane) and learns from a reward signal. In 50 training steps, Qwen 0.6B learns to swerve and brake to avoid pedestrians in emergency situations. The project supports text and vision (VLMs can see through a camera sensor), open-world driving with traffic, and multiple driving scenarios. This builds on the carla-env project by sinatras, which originally placed LLMs inside CARLA for evaluation. We extended it with vision, new scenarios, rubric-based rewards, and made it trainable end-to-end. Blog: https://huggingface.co/blog/sergiopaniego/bringing-carla-to-openenv-trl/ CARLA env in OpenEnv: https://github.com/meta-pytorch/OpenEnv/tree/main/envs/carla_env Training script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/carla.py
updated
a Space
3 days ago
qgallouedec/trackio-dev
View all activity
Organizations
qgallouedec
's models
789
Sort: Recently updated
qgallouedec/tiny-aya-global-SFT
Updated
10 days ago
qgallouedec/tiny-aya-global-tool-calling-SFT
Updated
10 days ago
qgallouedec/my-other-awesome-model
Text Generation
•
0.5B
•
Updated
15 days ago
•
9
qgallouedec/my-awesome-model
Text Generation
•
0.5B
•
Updated
15 days ago
•
19
qgallouedec/trainer_output
Text Generation
•
0.5B
•
Updated
15 days ago
•
16
qgallouedec/test_push_output_4
Text Classification
•
87.5k
•
Updated
15 days ago
•
15
qgallouedec/qwen2-0.5b-deepmath-grpo
Updated
Jan 13
qgallouedec/my-finetuned-model
0.8B
•
Updated
Jan 2
•
6
qgallouedec/Qwen3-0.6B-SFT-20251113165959
Text Generation
•
0.6B
•
Updated
Nov 13, 2025
•
5
qgallouedec/Qwen3-0.6B-SFT-20251113163732
Updated
Nov 13, 2025
qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112173255
Updated
Nov 12, 2025
qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112165832
Updated
Nov 12, 2025
qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112171926
Updated
Nov 12, 2025
qgallouedec/Meta-Llama-3-8B-Instruct-SFT-20251112171823
Updated
Nov 12, 2025
qgallouedec/gold-model
Updated
Oct 30, 2025
qgallouedec/custom-resnet50d
Feature Extraction
•
25.6M
•
Updated
Oct 1, 2025
•
2
qgallouedec/Qwen3-1.7B-parsing
Text Generation
•
2B
•
Updated
Sep 27, 2025
•
2
qgallouedec/Qwen2.5-0.5B-SFT
Text Generation
•
0.5B
•
Updated
Sep 14, 2025
•
2
qgallouedec/Qwen2-0.5B-Reward
Token Classification
•
0.5B
•
Updated
Sep 14, 2025
•
1
qgallouedec/Qwen3-0.6B-SFT-20250911031144
Text Generation
•
0.6B
•
Updated
Sep 11, 2025
•
2
qgallouedec/Qwen3-0.6B-SFT-20250911023224
Text Generation
•
0.6B
•
Updated
Sep 11, 2025
•
3
qgallouedec/Qwen3-0.6B-Base-SFT-20250911020040
Text Generation
•
0.6B
•
Updated
Sep 11, 2025
•
1
qgallouedec/Qwen3-0.6B-SFT-20250911021538
Text Generation
•
0.6B
•
Updated
Sep 11, 2025
•
3
qgallouedec/Qwen3-0.6B-Base-SFT-20250911021314
Text Generation
•
0.6B
•
Updated
Sep 11, 2025
•
1
qgallouedec/Qwen3-0.6B-Base-SFT-20250911014759
Text Generation
•
0.6B
•
Updated
Sep 11, 2025
•
1
qgallouedec/Qwen3-0.6B-Base-SFT-20250911011255
Text Generation
•
0.6B
•
Updated
Sep 11, 2025
•
1
qgallouedec/after
Text Generation
•
0.5B
•
Updated
Sep 11, 2025
•
5
qgallouedec/before
Text Generation
•
0.5B
•
Updated
Sep 11, 2025
•
2
qgallouedec/Qwen3-1.7B-SFT-20250910184326
Text Generation
•
2B
•
Updated
Sep 10, 2025
•
2
qgallouedec/Qwen3-4B-SFT-20250910180651
Text Generation
•
4B
•
Updated
Sep 10, 2025
•
1
Previous
1
2
3
...
27
Next