12 58

HumanistAtypik

HumanistAtypik

AI & ML interests

None yet

Recent Activity

liked a Space 21 days ago

OpenEvals/every-leaderboards

liked a model 24 days ago

mistralai/Mistral-Small-4-119B-2603

liked a dataset about 1 month ago

nebius/SWE-rebench-V2

View all activity

Organizations

None yet

liked a Space 21 days ago

Official Benchmarks Leaderboard 2026

🏆

Explore and compare AI model scores across official benchmarks

liked a model 24 days ago

mistralai/Mistral-Small-4-119B-2603

119B • Updated 15 days ago • 73.2k • 349

liked a dataset about 1 month ago

nebius/SWE-rebench-V2

Viewer • Updated 20 days ago • 32.1k • 6.29k • 34

upvoted a collection about 1 month ago

SWE-rebench-V2

Collection

SWE-rebench-V2 is a curated dataset of software-engineering tasks derived from real GitHub issues and pull requests. • 3 items • Updated Mar 3 • 9

liked a Space about 1 month ago

Nanbeige 4.1 3B

🔮

Chat with Nanbeige AI locally in your browser

liked a model about 2 months ago

mistralai/Voxtral-Mini-4B-Realtime-2602

Automatic Speech Recognition • 4B • Updated 29 days ago • 883k • 806

upvoted a changelog about 2 months ago

Hugging Face Changelog

Community Evals and Benchmark Repositories

Feb 5

• 76

liked 2 models about 2 months ago

Lightricks/LTX-2

Image-to-Video • Updated Mar 2 • 919k • • 1.67k

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated 16 days ago • 434k • • 1.03k

liked a model 4 months ago

mistralai/Mistral-Large-3-675B-Instruct-2512

Updated Dec 19, 2025 • 625 • 223

upvoted a collection 4 months ago

Ministral 3

Collection

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 162

liked a Space 4 months ago

Image Arena Leaderboard

📊

587

Image Generation and Image Editing Arena & Leaderboard

liked a model 5 months ago

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated Dec 23, 2025 • 56.8k • • 1.49k

liked a model 6 months ago

DragonLLM/Dragon-3B-Base-alpha

4B • Updated Dec 12, 2025 • 11 • 9

liked a Space 6 months ago

LLM Performance Leaderboard

🐨

451

View the latest LLM performance leaderboard online

liked 2 datasets 6 months ago

theResearchNinja/violentutf_cybersecurityBehavior

Viewer • Updated Jun 12, 2024 • 10k • 28 • 3

CounterBench/CounterBench

Preview • Updated Aug 4, 2025 • 41 • 1

liked 3 Spaces 6 months ago

Open VLM Video Leaderboard

🌎

131

VLMEvalKit Eval Results in video understanding benchmark

WebWalkerQALeaderboard

🥇

Display leaderboard for AI models

LVBench Leaderboard

🐨

Submit and view model evaluations