← Back to Model Beat
4Research·Jun 12

Select to Think: Unlocking SLM Potential with Local Sufficiency

arXiv:2604.26940v2 Announce Type: replace Abstract: Small language models (SLMs) offer efficient deployment, yet they often lag behind their larger counterparts (LLMs) in reasoning. Existing remedies either invoke an LLM at points of reasoning divergence, incurring substantial latency and cost, or rely on standard distillation, which is limited by the SLM's capacity to accurately mimic the LLM's complex generative distribution. We address this dilemma by identifying local sufficiency: at divergence points, the LLM's preferred token often resides within the SLM's top-K next-token predictions, even when failing to emerge as the SLM top-1 choice. We therefore propose Select to Think (S2T), which reframes the LLM's role from open-ended generation to selection among the SLM's proposals, simplifying the supervision signal to discrete candidate rankings. Leveraging this, we introduce S2T-Local, which distills the selection logic into the SLM, empowering it to perform autonomous re-ranking without inference-time LLM dependency. Empirically, a 1.5B SLM's top-8 candidates…

Covered by 1 source

  • AarXiv CS.AIWenxuan Ye, Yangyang Zhang, Xueli An, Georg Carle, Yunpu MaJun 12

Related stories

ResearchInvesting in multi-agent AI safety researchJun 10 · 2 sourcesResearchEconomic Research - AnthropicJun 8ResearchMicrosoft Research's Mirage gives video generation a persistent spatial memory that doesn't forget what's around the cornerJun 14ResearchAnthropic study shows AI needs hours, not weeks, to build exploits from security patchesJun 10