3 7 6

Albert Catalan-Tatjer

aldakata

https://aldakata.github.io/

aldakata

AI & ML interests

Efficiency

Recent Activity

liked a model about 1 month ago

deepseek-ai/DeepSeek-Math-V2

liked a model about 2 months ago

microsoft/bitnet-b1.58-2B-4T

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

liked a model about 1 month ago

deepseek-ai/DeepSeek-Math-V2

Text Generation • 685B • Updated Nov 27 • 8.72k • 672

liked a model about 2 months ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated 12 days ago • 6.88k • 1.23k

liked 2 Spaces about 2 months ago

The Smol Training Playbook

📚

2.72k

The secrets to building world-class LLMs

The Ultra-Scale Playbook

🌌

3.6k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article about 2 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30

•

202

liked a dataset about 2 months ago

bigcode/starcoderdata

Viewer • Updated May 16, 2023 • 207M • 15k • 470

authored a paper about 2 months ago

Training Dynamics Impact Post-Training Quantization Robustness

Paper • 2510.06213 • Published Oct 7 • 3

upvoted a paper about 2 months ago

Training Dynamics Impact Post-Training Quantization Robustness

Paper • 2510.06213 • Published Oct 7 • 3

New activity in allenai/OLMo-2-0425-1B 3 months ago

Main revision

#5 opened 3 months ago by

aldakata

New activity in HuggingFaceTB/SmolLM3-3B-checkpoints 3 months ago

Main branch

#6 opened 3 months ago by

aldakata

upvoted a collection 4 months ago

open-sci-ref-0.01 nemotron-hq

Collection

10 items • Updated Aug 17 • 4

upvoted an article 5 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16

•

liked a model 5 months ago

HuggingFaceTB/SmolLM3-3B-checkpoints

Updated Aug 14 • 2.78k • 22

upvoted a collection 6 months ago

🧠 SmolLM3

Collection

Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9 • 89

upvoted 2 articles about 1 year ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

May 16, 2024

•

Article

Let's talk about LLM evaluation

May 23, 2024

•

204

Albert Catalan-Tatjer

AI & ML interests

Recent Activity

Organizations

aldakata's activity

The Smol Training Playbook

The Ultra-Scale Playbook

KV Caching Explained: Optimizing Transformer Inference Efficiency

Main revision

Main branch

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Unlocking Longer Generation with Key-Value Cache Quantization

Let's talk about LLM evaluation