E-M-S: Event-Monocular-Spacecraft dataset for 6D pose estimation of non-cooperative targets.

Wang, Yishi; Michele, Maestrini; Mauro, Massarib; Pierluigi, Di Lizia; Zhang, Zexu

doi:10.5281/zenodo.14774652

Published February 10, 2025 | Version 1.0

Dataset Open

E-M-S: Event-Monocular-Spacecraft dataset for 6D pose estimation of non-cooperative targets.

1. Harbin Institute of Technology
2. Politecnico di Milano

This dataset was used in the paper "Cross-Modal Fusion of Monocular images and Neuromorphic streams for 6D Pose Estimation of Non-Cooperative Targets", for studying 6D pose estimation of non-cooperative spacecrafts.

Research Background: Robust pose estimation for non-cooperative space targets is a critical technology for on-orbit servicing and active debris removal. Learning-based monocular vision methods have been widely adopted for spacecraft pose estimation tasks, achieving superior performance compared to traditional approaches. However, extreme illumination conditions found in space pose challenges for these approaches. Unlike traditional cameras, event cameras capture asynchronous brightness changes at the pixel level. These cameras exhibit remarkable attributes, including high dynamic range, low latency, and resistance to motion blur, making them well-suited for high dynamic range scenes and high-speed motion. These features make event cameras an ideal complement to standard RGB cameras for object pose estimation. Combining these two sensor types to leverage their complementary strengths is a promising research endeavor. Currently, there are only a few datasets of event images for spacecraft, and it is important to generate and make public new spacecraft event datasets.

In our paper, we describe the methodology used to generate the dataset. For more information, please refer to our paper:“Cross-Modal Fusion of Monocular images and Neuromorphic streams for 6D Pose Estimation of Non-Cooperative Targets”

This dataset contains a total of 57200 event.csv (57,200 event frames), corresponding 57,200 RGB images, 572 RGB videos and ground truth 6D pose labels. The image resolution is 640×480, including different lighting conditions such as normal and low light. The images in the RGB/img folder have the same pose label as the corresponding numbered csv event stream in the Even/event_stream folder. The data format is as follows:
E-M-S.zip
--training
--RGB
--img{*.jpg}
--pose {*.txt: Pose format <r1, r2, r3, r4, r5, r6, r7, r8, r9, tx, ty, tz>, the first 9 values represent the rotation matrix, and the last three values represent the translation vector}
--Event
--event_stream{*.csv, event stream with the format<timestamp,x,y,polarity>}
--pose {*.txt: Pose format <r1, r2, r3, r4, r5, r6, r7, r8, r9, tx, ty, tz>, the first 9 values represent the rotation matrix, and the last three values represent the translation vector}

--validation (Same data structure as training)
--RGB
--Event

--testing (Same data structure as training)
--RGB
--Event

Notes

Please note: Our event data is generated by upsampling RGB videos with an FPS of 5. The timestamp resolution in milliseconds. In the future, we plan to release `event.csv` files with microsecond-level timestamp resolution to better approximate real-world scenarios. Stay tuned.

When using the data in an academic context, please cite the following paper:
Wang Yishi, Michele Maestrini,Zhang Zexu,Mauro Massari,Pierluigi Di Lizia. Cross-Modal Fusion of Monocular images and Neuromorphic streams for 6D Pose Estimation of Non-Cooperative Targets.

Files

E-M-S.zip

Files (6.5 GB)

Name	Size	Download all
E-M-S.zip md5:81d05bbb4b3d3be3e1acf9a33fdb8805	6.5 GB	Preview Download

Additional details

DOI: 10.5281/zenodo.14774652

Submitted: 2025-02-10

	All versions	This version
Views	349	349
Downloads	69	69
Data volume	1.8 TB	1.8 TB

E-M-S.zip

Files (6.5 GB)

Identifiers

Dates

E-M-S: Event-Monocular-Spacecraft dataset for 6D pose estimation of non-cooperative targets.

Authors/Creators

Description

Notes

Files

E-M-S.zip

Files (6.5 GB)

Additional details

Identifiers

Dates