Published January 21, 2026 | Version Video-Instance-Segmentation
Software Open

zerotne/SPVIS: SPVIS: Enhancing Video Instance Segmentation through Stabilized Feature Propagation

Authors/Creators

Description

Video instance segmentation (VIS) extends instance-level understanding from static images to continuous video, necessitating accurate pixel-level masks and consistent identity association across frames. Feature propagation approaches, while computationally efficient, are often hindered by error accumulation and feature degradation over time. We introduce SPVIS, a VIS framework based on feature propagation, addressing these challenges through in-memory object-query propagation. SPVIS comprises a Progressive Tracker (PGT) for cross-clip association with error correction and joint feature-preserving modeling, including the Refinement Compensator (RCP) and Spatial Interaction Module (SIM), to maintain high-quality object queries. Across standard benchmarks, SPVIS achieves competitive accuracy-efficiency trade-offs, delivering 69.5, 64.6, 51.9, and 54.3 AP on YouTube-VIS 2019, 2021, 2022, and OVIS, respectively. Our framework provides a lightweight solution for long-sequence association, including scenarios with low frame rates and occlusions.

Files

zerotne/SPVIS-Video-Instance-Segmentation.zip

Files (19.8 MB)

Name Size Download all
md5:32f973029480c8a1be3d0df187c77027
19.8 MB Preview Download

Additional details

Related works

Software