稀疏激活论文 - 专知

会员服务 ·

稀疏激活

Plug-and-Play Homeostatic Spark: Zero-Cost Acceleration for SNN Training Across Paradigms

Arxiv

0+阅读 · 2025年12月4日

COUNTDOWN: Contextually Sparse Activation Filtering Out Unnecessary Weights in Down Projection

Arxiv

0+阅读 · 2025年10月27日

MoE-Prism: Disentangling Monolithic Experts for Elastic MoE Services via Model-System Co-Designs

Arxiv

0+阅读 · 2025年10月22日

Sparse Activation Editing for Reliable Instruction Following in Narratives

Arxiv

0+阅读 · 2025年5月22日

AMMSM: Adaptive Motion Magnification and Sparse Mamba for Micro-Expression Recognition

Arxiv

0+阅读 · 2025年3月31日

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Arxiv

0+阅读 · 2025年2月27日

fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving

Arxiv

0+阅读 · 2025年2月7日

Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts

Arxiv

0+阅读 · 2025年2月18日

Memory Layers at Scale

Arxiv

1+阅读 · 2024年12月20日

Memory Layers at Scale

Arxiv

1+阅读 · 2024年12月12日

Exploring the Benefit of Activation Sparsity in Pre-training

Arxiv

0+阅读 · 2024年10月4日

STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning

Arxiv

0+阅读 · 2024年9月10日

Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor

Arxiv

0+阅读 · 2024年9月12日

Cheems: Wonderful Matrices More Efficient and More Effective Architecture

Arxiv

0+阅读 · 2024年7月24日

Cheems: Wonderful Matrices More Efficient and More Effective Architecture

Arxiv

0+阅读 · 2024年7月25日

参考链接

微信扫码咨询专知VIP会员