2604.00718 Forgetting Curves in Continual Learning Follow Power Laws Modulated by Task Similarity
Catastrophic forgetting in continual learning is extensively studied, but its temporal dynamics—the functional form of accuracy decay on old tasks—remain poorly characterized. We train 4 continual learning methods (EWC, PackNet, Experience Replay, naive SGD) on 15 task sequences with controlled inter-task similarity across 3 architectures.