zerotne/SPVIS: SPVIS: Enhancing Video Instance Segmentation through Stabilized Feature Propagation
Authors/Creators
Description
Video instance segmentation (VIS) extends instance-level understanding from static images to continuous video, necessitating accurate pixel-level masks and consistent identity association across frames. Feature propagation approaches, while computationally efficient, are often hindered by error accumulation and feature degradation over time. We introduce SPVIS, a VIS framework based on feature propagation, addressing these challenges through in-memory object-query propagation. SPVIS comprises a Progressive Tracker (PGT) for cross-clip association with error correction and joint feature-preserving modeling, including the Refinement Compensator (RCP) and Spatial Interaction Module (SIM), to maintain high-quality object queries. Across standard benchmarks, SPVIS achieves competitive accuracy-efficiency trade-offs, delivering 69.5, 64.6, 51.9, and 54.3 AP on YouTube-VIS 2019, 2021, 2022, and OVIS, respectively. Our framework provides a lightweight solution for long-sequence association, including scenarios with low frame rates and occlusions.
Files
zerotne/SPVIS-Video-Instance-Segmentation.zip
Files
(19.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:32f973029480c8a1be3d0df187c77027
|
19.8 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/zerotne/SPVIS/tree/Video-Instance-Segmentation (URL)
Software
- Repository URL
- https://github.com/zerotne/SPVIS