Published July 11, 2025 | Version v1
Presentation Open

POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images - EXAKI training

Description

EXAKI training number 1.

In this session, we present a joint work produced in collaboration between CIIRC at CTU in Prague and valeo.ai within the EXA4MIND project.

This work focuses on leveraging multimodal learning, particularly language-vision integration, to enable open-vocabulary 3D scene understanding. This is based on our work called “POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images”, which was presented at NeurIPS.

Files

1 Session POP-3D_ ... SLIDES (2).pdf

Files (10.7 MB)

Name Size Download all
md5:64a15561cad1f56020bb8a3a6f972037
10.7 MB Preview Download

Additional details

Funding

European Commission
EXA4MIND - EXtreme Analytics for MINing Data spaces 101092944

Dates

Accepted
2025-07-11