PromptFusionNet_dataset
Authors/Creators
Description
Self-made Tri-Modal Dataset. This dataset provides visual, tactile, and textual recordings collected from a robotic grasping platform to support research on multi-modal perception, robot grasping, and task-guided manipulation. It integrates RGB-D visual data, high-resolution tactile force data, and natural language instructions, enabling studies in perception-driven manipulation, cross-modal learning, and robust object recognition under manufacturing uncertainties.
The dataset was collected to validate the proposed tri-modal fusion method described in our paper. It is publicly available to promote reproducibility and facilitate further research in robotic perception and control.
1. Data Acquisition Platform
The data collection platform consists of:
• Robot: UR10 robotic arm (position control mode, 125 Hz)
• Hand: Barrett BH8-282 three-fingered hand with capacitive tactile sensors embedded in fingertips and palm
• Camera: Intel RealSense RGB-D camera mounted on the end effector
The UR10 performs smooth, high-precision grasping trajectories covering approach, contact, and lifting phases. The Barrett hand provides real-time tactile force distribution, while the RealSense camera captures RGB-D images from multiple viewpoints.
2. Object Set
A total of 15 object categories were selected, covering a wide range of materials, surface characteristics, and shapes:
• Materials: plastic, metal, sponge, wood, fabric, ceramic
• Textures: smooth, rough, soft, rigid
• Shapes: regular geometric objects (e.g., cubes, cylinders) and irregular items (e.g., deformable fabric, curved ceramic pieces)
Each object is grasped under multiple angles and varying illumination conditions to simulate real-world sensing degradation.
3. Visual Data
• Sensor: Intel RealSense RGB-D camera
• Resolution: 1920 × 1080 (RGB)
• Operational Range: 0.1 – 10 m
• Preprocessing: RGB images are resized to 224×224×3 for network input
• Calibration:Camera calibration via chessboard pattern. Hand–eye calibration aligns the camera coordinate frame with the UR10 base frame. Ensures spatial consistency between visual and tactile data
4. Tactile Data
• Sensor Type: Capacitive tactile array embedded in fingertips and palm
• Resolution: Each finger has 8×3 sensor cells, signals from all fingers are combined into a global 8×9 tactile map
• Accuracy: ±0.01 N per cell
• Calibration: All tactile sensors were zero-calibrated under no-load conditions to eliminate baseline drift and improve measurement repeatability
• Data format: Raw readings are normalized to the range 0–255 and rearranged according to sensor layout
5. Textual Data
• Source: Natural language instructions provided by operators
• Content: Task-relevant attributes such as material, shape, or functional cues (e.g., “pick up the soft sponge”, “grasp the metal cylinder”)
• Quantity: 100 unique instructions per object category, totaling 1500 distinct text prompts
• Purpose: Serve as semantic guidance to modulate visual and tactile feature extraction, enabling task-aware perception
For details about the dataset or if you wish to use it in your research, please contact: 230248515@seu.edu.cn (English/Chinese)
Files
cloth box.zip
Files
(48.5 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:4989a2f4a0c6a1c177918a41925959aa
|
3.0 GB | Preview Download |
|
md5:d6b82009ec450c8bd0f81b13df6e86ad
|
3.3 GB | Preview Download |
|
md5:6bf14af83073b26e83507ae6c1ff252c
|
3.3 GB | Preview Download |
|
md5:0f92029160040424404be2f2a8a18177
|
3.0 GB | Preview Download |
|
md5:23a269ff608bac3a3164394cdbdd5c18
|
3.2 GB | Preview Download |
|
md5:1fa2eee44d93e391f4cb2b09986edce8
|
3.3 GB | Preview Download |
|
md5:101e67808a79f5dcedd4710ea693341b
|
3.4 GB | Preview Download |
|
md5:3a3b328f0801f7c11af9387214db28a3
|
3.3 GB | Preview Download |
|
md5:3e7adbf91e21f8be2bc82053a49e92c6
|
3.2 GB | Preview Download |
|
md5:e349dffb6db7310c6f2d7602241e5a7e
|
3.2 GB | Preview Download |
|
md5:f5b7c48662a773eb2c73bb763f8fd302
|
3.3 GB | Preview Download |
|
md5:84adc2545adc8afd8fa806f039d2e4fe
|
3.1 GB | Preview Download |
|
md5:b621877745aba129efca42f12a64079d
|
3.1 GB | Preview Download |
|
md5:9b1c5586f81d5b9801f59333dc2580b4
|
3.4 GB | Preview Download |
|
md5:fe4df95b276e30f394fe833d087d6ce3
|
3.3 GB | Preview Download |