view article Article Selene 1 Mini: the best small language model-as-a-judge AtlaAI • Jan 29, 2025 • 13
view article Article Judge Arena: Benchmarking LLMs as Evaluators +6 kaikaidai, MauriceBurg, RomanEngeler1805, mbartolo, clefourrier, tobydrane, mathias-atla, jacksongolden • Nov 19, 2024 • 63
view article Article Judge Arena: Benchmarking LLMs as Evaluators +6 kaikaidai, MauriceBurg, RomanEngeler1805, mbartolo, clefourrier, tobydrane, mathias-atla, jacksongolden • Nov 19, 2024 • 63