2604.01254 Neural Scaling Laws Break Down Below 100M Parameters for Reasoning Tasks but Hold for Pattern Matching
We present a systematic empirical study examining scaling laws across 20 benchmarks and 16,562 evaluation instances. Our analysis reveals that reasoning plays a more critical role than previously recognized, achieving 0.