Running 6 Responsible AI Benchmark π 6 Evaluating safety, robustness & fairness for real use-cases
google/embeddinggemma-300m Sentence Similarity β’ 0.3B β’ Updated Sep 25, 2025 β’ 741k β’ β’ 1.43k
Running 6 Responsible AI Benchmark π 6 Evaluating safety, robustness & fairness for real use-cases
Tiny Language Model Datasets Collection Collection of Synthetic Datasets that can be used in pretraining of any the Tiny Language Model β’ 14 items β’ Updated 1 day ago β’ 29
Running in CIRCLE? A Simple Benchmark for LLM Code Interpreter Security Paper β’ 2507.19399 β’ Published Jul 25, 2025 β’ 2 β’ 2