ZAYA1-8B Technical Report
Paper • 2605.05365 • Published • 4
Large language models, scaling laws, AI Alignment, democratization of DL
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs
GPT-NeoX-20B: An Open-Source Autoregressive Language Model