view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift β’ Apr 2 β’ 892
view article Article How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II nvidia β’ Mar 12 β’ 32
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models Paper β’ 2602.07026 β’ Published Feb 2 β’ 140
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. β’ 28 items β’ Updated 5 days ago β’ 139
view article Article Open-source DeepResearch β Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier β’ Feb 4, 2025 β’ 1.32k
Allegro: Open the Black Box of Commercial-Level Video Generation Model Paper β’ 2410.15458 β’ Published Oct 20, 2024 β’ 40
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper β’ 2410.05993 β’ Published Oct 8, 2024 β’ 111
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series Paper β’ 2405.19327 β’ Published May 29, 2024 β’ 48
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin β’ 63 items β’ Updated Apr 17, 2024 β’ 58