SheepNav
新上线今天0 投票

Differentiable Efficient Operator Search

arXiv:2606.05232v1 Announce Type: new Abstract: Efficient multimodal foundation models often rely on manually designed token-reduction operators, such as pruning, merging, pooling, and adaptive reweighting. Although these operators appear different, we show that they can be interpreted as distinct regimes of a shared operator space. Based on this view, we introduce Efficient Operator Search, a differentiable framework that jointly searches where to reduce tokens, how many tokens to retain, and h

延伸阅读

  1. 大步长梯度下降如何恢复多路径深度线性网络的对称性?ICML 2026研究揭秘
  2. 状态承诺学习:训练语言模型区分计算与记忆
  3. 大语言模型中的时间偏好概念及其功能
查看原文