SheepNav
新上线今天0 投票

Staged Factorial Screening for Budget-Constrained Micro-Pretraining

arXiv:2606.05186v1 Announce Type: new Abstract: Budget-constrained micro-pretraining often requires triaging many candidate recipes on a shared accelerator before larger search budgets are spent. We study whether a staged fractional-factorial workflow can recover stable early effect structure in this setting. On a fixed autoresearch-derived single-GPU training loop, we run 613 experiments across pilot and follow-up screens at 2, 5, and 10 minutes; full 16-condition seeded reruns at 5 and 10 minu

延伸阅读

  1. Differentiable Efficient Operator Search
  2. 大步长梯度下降如何恢复多路径深度线性网络的对称性?ICML 2026研究揭秘
  3. 状态承诺学习:训练语言模型区分计算与记忆
查看原文