← Back to Model Beat
4Research·5h ago

ICDepth: Taming Video Diffusion Models for Video Depth Estimation via In-Context Conditioning

Researchers have introduced ICDepth, a new method that utilizes video diffusion models to improve depth estimation in single-camera footage. By incorporating in-context conditioning, the technique helps maintain temporal consistency and geometric precision across frames, addressing a common limitation in existing video processing models.

Covered by 1 source

  • AarXiv CS.AIXuanhua He, Jiaxin Xie, Mingzhe Zheng, Qifeng Chen5h ago

Related stories

ResearchOn Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMsJun 29 · 13 sourcesResearchAnti-Causal Domain Generalization: Leveraging Unlabeled DataJul 1 · 2 sourcesResearchLearning Unmasking Policies for Diffusion Language ModelsJun 29 · 6 sourcesResearchRedKnot: Efficient Long-Context LLM Serving with Head-Aware KV Reuse and SegPagedAttentionJun 29