1 40 12

Naman Anand

naman5a

AI & ML interests

RAG , LLMs

Recent Activity

upvoted an article 10 days ago

Welcome Gemma 4: Frontier multimodal intelligence on device

upvoted an article 2 months ago

Community Evals: Because we're done trusting black-box leaderboards over the community

upvoted an article 4 months ago

🪆 Introduction to Matryoshka Embedding Models

View all activity

Organizations

upvoted an article 10 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 894

upvoted an article 2 months ago

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

burtenshaw, SaylorTwift, kramp, merve, davanstrien, nielsr, julien-c

•

Feb 4

• 89

upvoted an article 4 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

tomaarsen, Xenova, osanseviero

•

Feb 23, 2024

• 208

upvoted an article 5 months ago

Article

The Optimal Architecture for Small Language Models

codelion

•

Dec 26, 2025

• 121

upvoted a paper 5 months ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published Dec 11, 2025 • 119

upvoted 2 articles 5 months ago

Article

Automatic Prompt Optimization with DSPy and Cross Encoders

dleemiller

•

Aug 2, 2025

• 5

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 624

upvoted 3 articles 6 months ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 385

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

toslali-ibm, mirinflim, qgallouedec, esnible, rganti, mudhakar

•

Jun 3, 2025

• 101

Article

20x Faster TRL Fine-tuning with RapidFire AI

kbigdelysh, arunkk09, qgallouedec

•

Nov 21, 2025

• 27

upvoted a collection 9 months ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 45 items • Updated Mar 2 • 109

upvoted a paper 9 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

upvoted 4 articles 11 months ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

julien-c

•

Feb 14, 2020

• 61

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

hyen, gaotianyu1350, houminmin, kding1, danf, moshew, cdq10131

•

Apr 16, 2025

• 42

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

danaaubakirova, Molbap, mshukor, cadene

•

Feb 4, 2025

• 192

Article

Finally, a Replacement for BERT: Introducing ModernBERT

bwarner, NohTow, bclavie, orionweller, ohallstrom, staghado, alexisgallagher, rbiswasfc, fladhak, tomaarsen, ncoop57, griffin, jph00, johnowhitaker, iacolippo

•

Dec 19, 2024

• 740

upvoted a paper 12 months ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26, 2025 • 96

upvoted a collection about 1 year ago

GLM-4-0414

Collection

GLM-4-0414 series model • 6 items • Updated Mar 2 • 135

upvoted a paper about 1 year ago

AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference

Paper • 2504.10326 • Published Apr 14, 2025 • 25

upvoted an article about 1 year ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 293

Naman Anand

AI & ML interests

Recent Activity

Organizations

naman5a's activity

Welcome Gemma 4: Frontier multimodal intelligence on device

Community Evals: Because we're done trusting black-box leaderboards over the community

🪆 Introduction to Matryoshka Embedding Models

The Optimal Architecture for Small Language Models

Automatic Prompt Optimization with DSPy and Cross Encoders

We Got Claude to Fine-Tune an Open Source LLM

Continuous batching from first principles

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

20x Faster TRL Fine-tuning with RapidFire AI

How to train a new language model from scratch using Transformers and Tokenizers

Introducing HELMET: Holistically Evaluating Long-context Language Models

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Finally, a Replacement for BERT: Introducing ModernBERT

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge