4Research·5h ago
SUNTA: Hierarchical Video Prediction with Surprise-based Chunking
Researchers have introduced SUNTA, a hierarchical state-space model that improves long-form video prediction by dynamically segmenting video sequences based on visual surprise. By identifying key moments of change rather than using fixed intervals, this approach allows the model to better manage complex temporal dependencies in extended video generation.
Covered by 1 source
- AarXiv CS.AI↗Tomoshi Iiyama, Masahiro Suzuki, Yutaka Matsuo5h ago