In a Training Loop 🔄

3 60 37

Ben Kelly PRO

YellowjacketGames

Fishtiks's profile picture

Shekswess's profile picture

kramp's profile picture

manacasterben

AI & ML interests

None yet

Recent Activity

updated a collection about 3 hours ago

[papers] Gameplay Optimization

replied to danielhanchen's post about 13 hours ago

You can now run Kimi K2.5 locally! 🔥 We shrank the 1T model to 240GB (-60%) via Dynamic 1-bit. Get >40 tok/s on 242GB or 622GB VRAM/RAM for near full precision. GGUF: https://huggingface.co/unsloth/Kimi-K2.5-GGUF Guide: https://unsloth.ai/docs/models/kimi-k2.5

replied to danielhanchen's post about 14 hours ago

View all activity

Organizations

YellowjacketGames 's collections 15

[mixed] Chess x AI

Research directly related to Chess technology.

Lichess/standard-chess-games

Viewer • Updated Oct 16, 2025 • 7.14B • 4.16k • 63
Lichess/tournament-chess-games

Viewer • Updated Dec 9, 2025 • 931k • 678 • 6
Lichess/three-check-chess-games

Viewer • Updated Oct 16, 2025 • 9.41M • 124 • 2

[models] RTX a6000 48gb

Models that run well on a *standalone* RTX a6000's 48gb of VRAM.

unsloth/Qwen-Image-Edit-2511-GGUF

Image-to-Image • 20B • Updated 22 days ago • 139k • 327
unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF

24B • Updated Dec 15, 2025 • 157k • 90
zai-org/GLM-Image

Text-to-Image • Updated 16 days ago • 14.4k • • 1.01k
nvidia/personaplex-7b-v1

Audio-to-Audio • Updated 2 days ago • 54.5k • 1.51k

[models] CPU-Offload &/|| A6000x2

TPS can be as low as 1.0, seriously. its SLOW.

unsloth/GLM-4.7-GGUF

Text Generation • 358B • Updated Dec 27, 2025 • 113k • 190
unsloth/DeepSeek-R1-0528-GGUF

Text Generation • 671B • Updated Jun 15, 2025 • 3.64k • 193
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF

Image-to-Text • 401B • Updated Jun 18, 2025 • 4.98k • 43
unsloth/MiniMax-M2.1-GGUF

Text Generation • 229B • Updated Dec 26, 2025 • 129k • 165

[models] iGPU-Capable < 512mb

hey, you gotta try. any precision acceptable here, QA check the actual result quality. at your own risk, fool.

unsloth/LFM2.5-1.2B-Thinking-GGUF

Text Generation • 1B • Updated 10 days ago • 1.7k • 2

[mixed] ORCAssist "Work's Done!"

nvidia/Nemotron-Orchestrator-8B

Text Generation • 8B • Updated Dec 2, 2025 • 18.4k • 538
allenai/SERA-8B-GA

8B • Updated about 18 hours ago • 9 • 9
GAIR/daVinci-Dev-72B

Text Generation • 73B • Updated 4 days ago • 105 • 5
A BERTology View of LLM Orchestrations: Token- and Layer-Selective Probes for Efficient Single-Pass Classification

Paper • 2601.13288 • Published 11 days ago • 12

[papers] Distillation

Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment

Paper • 2601.14249 • Published 10 days ago • 9
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models

Paper • 2402.07033 • Published Feb 10, 2024 • 19
MeepleLM: A Virtual Playtester Simulating Diverse Subjective Experiences

Paper • 2601.07251 • Published 19 days ago • 11
GameTalk: Training LLMs for Strategic Conversation

Paper • 2601.16276 • Published 8 days ago • 12

[papers] Sports Tech

VideoMaMa: Mask-Guided Video Matting via Generative Prior

Paper • 2601.14255 • Published 10 days ago • 13
Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Paper • 2601.14253 • Published 10 days ago • 9
Think3D: Thinking with Space for Spatial Reasoning

Paper • 2601.13029 • Published 12 days ago • 46

[data] What a Dump!

Seriously, look at the size of these dumps! They're a huge pile of data, dumped on your doorstep!

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5, 2025 • 60
Lichess/standard-chess-games

Viewer • Updated Oct 16, 2025 • 7.14B • 4.16k • 63

[papers] Gameplay Optimization

Research papers that may contribute to a broader approach to teaching machines how to play complex strategy games beyond just Chess.

OptiMind: Teaching LLMs to Think Like Optimization Experts

Paper • 2509.22979 • Published Sep 26, 2025 • 3
LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 51
Zero-Overhead Introspection for Adaptive Test-Time Compute

Paper • 2512.01457 • Published Dec 1, 2025 • 1
Confidence Estimation for LLMs in Multi-turn Interactions

Paper • 2601.02179 • Published 25 days ago • 16

[models] GTX 1660 Super 6gb

The best little card under 100 euros. Full Precision vs Quants not benchmarked. This card is so much better at running inference than you realize.

MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF

Text Generation • 8B • Updated Dec 6, 2025 • 71.2k • 4
unsloth/Qwen3-4B-Instruct-2507-GGUF

4B • Updated Aug 20, 2025 • 59k • 136
unsloth/SmolLM3-3B-128K-GGUF

3B • Updated Jul 8, 2025 • 2.82k • 37
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF

Text Generation • 8B • Updated Jun 16, 2025 • 50.4k • 367

[models] Sub-1gb for Edge

Toaster Tier but not iGPU

lmstudio-community/LFM2.5-1.2B-Thinking-GGUF

1B • Updated 10 days ago • 1.02k • 2
unsloth/SmolLM3-3B-128K-GGUF

3B • Updated Jul 8, 2025 • 2.82k • 37

[models] non-EN specialists

Emphasis on Portuguese, Spanish, Vietnamese, Hebrew

lucifrrrrrrrrrr/vn-geography-8b

8B • Updated 5 days ago • 58 • 2

[mixed] Image Generation Stack

The stuff we actually use, pruned on an ongoing basis.

black-forest-labs/FLUX.2-dev

Image-to-Image • Updated Nov 27, 2025 • 133k • • 1.3k
unsloth/Qwen-Image-Edit-2511-GGUF

Image-to-Image • 20B • Updated 22 days ago • 139k • 327
zai-org/GLM-Image

Text-to-Image • Updated 16 days ago • 14.4k • • 1.01k
fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA

Image-to-Image • Updated 23 days ago • 80.3k • • 890

[papers] RAG$ to Riche$

UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation

Paper • 2504.08761 • Published Mar 31, 2025 • 7
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Paper • 2512.23959 • Published Dec 30, 2025 • 111
DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal

Paper • 2601.18081 • Published 5 days ago • 7

[papers] Film & Cinema

The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation

Paper • 2601.17737 • Published 6 days ago • 53
Advancing Open-source World Models

Paper • 2601.20540 • Published 3 days ago • 86

[mixed] Chess x AI

Research directly related to Chess technology.

Lichess/standard-chess-games

Viewer • Updated Oct 16, 2025 • 7.14B • 4.16k • 63
Lichess/tournament-chess-games

Viewer • Updated Dec 9, 2025 • 931k • 678 • 6
Lichess/three-check-chess-games

Viewer • Updated Oct 16, 2025 • 9.41M • 124 • 2

[papers] Gameplay Optimization

Research papers that may contribute to a broader approach to teaching machines how to play complex strategy games beyond just Chess.

OptiMind: Teaching LLMs to Think Like Optimization Experts

Paper • 2509.22979 • Published Sep 26, 2025 • 3
LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 51
Zero-Overhead Introspection for Adaptive Test-Time Compute

Paper • 2512.01457 • Published Dec 1, 2025 • 1
Confidence Estimation for LLMs in Multi-turn Interactions

Paper • 2601.02179 • Published 25 days ago • 16

[models] RTX a6000 48gb

Models that run well on a *standalone* RTX a6000's 48gb of VRAM.

unsloth/Qwen-Image-Edit-2511-GGUF

Image-to-Image • 20B • Updated 22 days ago • 139k • 327
unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF

24B • Updated Dec 15, 2025 • 157k • 90
zai-org/GLM-Image

Text-to-Image • Updated 16 days ago • 14.4k • • 1.01k
nvidia/personaplex-7b-v1

Audio-to-Audio • Updated 2 days ago • 54.5k • 1.51k

[models] GTX 1660 Super 6gb

The best little card under 100 euros. Full Precision vs Quants not benchmarked. This card is so much better at running inference than you realize.

MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF

Text Generation • 8B • Updated Dec 6, 2025 • 71.2k • 4
unsloth/Qwen3-4B-Instruct-2507-GGUF

4B • Updated Aug 20, 2025 • 59k • 136
unsloth/SmolLM3-3B-128K-GGUF

3B • Updated Jul 8, 2025 • 2.82k • 37
unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF

Text Generation • 8B • Updated Jun 16, 2025 • 50.4k • 367

[models] CPU-Offload &/|| A6000x2

TPS can be as low as 1.0, seriously. its SLOW.

unsloth/GLM-4.7-GGUF

Text Generation • 358B • Updated Dec 27, 2025 • 113k • 190
unsloth/DeepSeek-R1-0528-GGUF

Text Generation • 671B • Updated Jun 15, 2025 • 3.64k • 193
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF

Image-to-Text • 401B • Updated Jun 18, 2025 • 4.98k • 43
unsloth/MiniMax-M2.1-GGUF

Text Generation • 229B • Updated Dec 26, 2025 • 129k • 165

[models] Sub-1gb for Edge

Toaster Tier but not iGPU

lmstudio-community/LFM2.5-1.2B-Thinking-GGUF

1B • Updated 10 days ago • 1.02k • 2
unsloth/SmolLM3-3B-128K-GGUF

3B • Updated Jul 8, 2025 • 2.82k • 37

[models] iGPU-Capable < 512mb

hey, you gotta try. any precision acceptable here, QA check the actual result quality. at your own risk, fool.

unsloth/LFM2.5-1.2B-Thinking-GGUF

Text Generation • 1B • Updated 10 days ago • 1.7k • 2

[models] non-EN specialists

Emphasis on Portuguese, Spanish, Vietnamese, Hebrew

lucifrrrrrrrrrr/vn-geography-8b

8B • Updated 5 days ago • 58 • 2

[mixed] ORCAssist "Work's Done!"

nvidia/Nemotron-Orchestrator-8B

Text Generation • 8B • Updated Dec 2, 2025 • 18.4k • 538
allenai/SERA-8B-GA

8B • Updated about 18 hours ago • 9 • 9
GAIR/daVinci-Dev-72B

Text Generation • 73B • Updated 4 days ago • 105 • 5
A BERTology View of LLM Orchestrations: Token- and Layer-Selective Probes for Efficient Single-Pass Classification

Paper • 2601.13288 • Published 11 days ago • 12

[mixed] Image Generation Stack

The stuff we actually use, pruned on an ongoing basis.

black-forest-labs/FLUX.2-dev

Image-to-Image • Updated Nov 27, 2025 • 133k • • 1.3k
unsloth/Qwen-Image-Edit-2511-GGUF

Image-to-Image • 20B • Updated 22 days ago • 139k • 327
zai-org/GLM-Image

Text-to-Image • Updated 16 days ago • 14.4k • • 1.01k
fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA

Image-to-Image • Updated 23 days ago • 80.3k • • 890

[papers] Distillation

Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment

Paper • 2601.14249 • Published 10 days ago • 9
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models

Paper • 2402.07033 • Published Feb 10, 2024 • 19
MeepleLM: A Virtual Playtester Simulating Diverse Subjective Experiences

Paper • 2601.07251 • Published 19 days ago • 11
GameTalk: Training LLMs for Strategic Conversation

Paper • 2601.16276 • Published 8 days ago • 12

[papers] RAG$ to Riche$

UltraRAG: A Modular and Automated Toolkit for Adaptive Retrieval-Augmented Generation

Paper • 2504.08761 • Published Mar 31, 2025 • 7
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Paper • 2512.23959 • Published Dec 30, 2025 • 111
DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal

Paper • 2601.18081 • Published 5 days ago • 7

[papers] Sports Tech

VideoMaMa: Mask-Guided Video Matting via Generative Prior

Paper • 2601.14255 • Published 10 days ago • 13
Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Paper • 2601.14253 • Published 10 days ago • 9
Think3D: Thinking with Space for Spatial Reasoning

Paper • 2601.13029 • Published 12 days ago • 46

[papers] Film & Cinema

The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation

Paper • 2601.17737 • Published 6 days ago • 53
Advancing Open-source World Models

Paper • 2601.20540 • Published 3 days ago • 86

[data] What a Dump!

Seriously, look at the size of these dumps! They're a huge pile of data, dumped on your doorstep!

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5, 2025 • 60
Lichess/standard-chess-games

Viewer • Updated Oct 16, 2025 • 7.14B • 4.16k • 63