- OnlineAnySeg propose an efficient data structure for organizing sequential 2D masks, which can incrementally maintain the spatial associations between all the masks in real-time.
- OnlineAnySeg designs a zero-shot online mask merging strategy. By leveraging spatial overlap and multimodal similarity through collaborative filtering, this approach eliminates the dependency on training data, enabling it to maintain good performance even in incomplete scanned scenes.
- OnlineAnySeg performs comparably with existing offline methods and gains notable improvements over the SOTA online method on the publicly available benchmark, running at ~15 FPS.