Ornith-1.0 Collection Ornith-1.0 is a family of open-source LLMs specialized for agentic coding. • 8 items • Updated 4 days ago • 269
DFlash Collection Block Diffusion for Flash Speculative Decoding • 23 items • Updated 2 days ago • 142
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution Paper • 2510.25726 • Published Oct 29, 2025 • 47
SWE-rebench-V2 Collection SWE-rebench-V2 is a curated dataset of software-engineering tasks derived from real GitHub issues and pull requests. • 3 items • Updated Mar 3 • 18
talkie-13b Collection talkie-1930-13b is a vintage language model trained on pre-1931 English-language text. See https://github.com/talkie-lm/talkie to run talkie. • 3 items • Updated Apr 21 • 56
Ling 2.6 Collection Ling-2.6 series is designed for real-world agents that require fast responses, strong execution, and high token efficiency, with several sized SKUs. • 6 items • Updated 9 days ago • 13
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 123
Granite 4.1 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated Apr 29 • 61
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 36 items • Updated 16 days ago • 225
Qwopus3.5-v3.5/v3 Collection 🌟Qwopus3.5-v3.5 is the latest model in the Claude series. • 13 items • Updated 1 day ago • 105
APEX Quants (GGUF) Collection MoE models quantized with the APEX Quantization technique ( https://github.com/mudler/apex-quant ) • 38 items • Updated about 6 hours ago • 122