AudioEar: Single-View Ear Reconstruction for Personalized Spatial Audio

Huang, Xiaoyang; Wang, Yanjun; Liu, Yang; Ni, Bingbing; Zhang, Wenjun; Liu, Jinxian; Li, Teng

doi:10.5281/zenodo.7581758

Published January 30, 2023 | Version 1.0

Dataset Open

AudioEar: Single-View Ear Reconstruction for Personalized Spatial Audio

1. Shanghai Jiao Tong University
2. None

Abstract

We introduce AudioEar3D, a high-quality 3D ear dataset consisting of 112 point cloud ear scans with RGB images, to benchmark the ear reconstruction task. We further collect a 2D ear dataset composed of 2,000 images, each one with manual annotation of occlusion and 55 landmarks, named AudioEar2D. To our knowledge, both datasets have the largest scale and best quality of their kinds for public use.

Usage

The code is publicly available at https://github.com/seanywang0408/AudioEar.

The file organization of AudioEar3D is as following:

AudioEar3D

├── 001

# left ear data

├── left.jpg # processed RGB image of left ear

├── left.ply # processed point cloud of left ear in canonical pose (frontal view is negative-X and upper view is positive-Z)

├── left.json # 56 landmark annotations of image

├── mask_left.jpg # mask generated by the outer landmarks

├── masked_left.jpg # exclude background in left.jpg using mask_left.jpg

├── masked_left.png # exclude background in left.jpg using mask_left.jpg, but with four channels of RGB-A

# right ear data

├── right.jpg

...

├── masked_right.png

├── 002

...

├── 056

The file organization of AudioEar2D is as following:

AudioEar2D

├── 00000.png # processed ear image

├── 00000.json # landmark annotations

...

├── 69985.png # the index is aligned with the data source FFHQ.

├── 69985.json

Citation

If you find this project useful, currently please cite the paper as:

Xiaoyang Huang, Yanjun Wang, Yang Liu, Bingbing Ni, Wenjun Zhang, Jinxian Liu, Teng Li. "AudioEar: Single-View Ear Reconstruction for Personalized Spatial Audio". arXiv preprint arXiv:2301.12613, 2023.

or using bibtex:

@article{huang2023audioear,
  title={AudioEar: Single-View Ear Reconstruction for Personalized Spatial Audio},
  author={Huang, Xiaoyang and Wang, Yanjun and Liu, Yang and Ni, Bingbing and Zhang Wenjun and Liu Jinxian and Li, Teng},
  journal={arXiv preprint arXiv:2301.12613},
  year={2023}
}

License

The dataset is licensed under Creative Commons Attribution 4.0 International (CC BY 4.0).

The code is under Apache-2.0 License.

Mirror Link

We recommend users to download the data from Zenodo official link. However, if you find any downloading problem, you can also use this mirror link from Google Drive.

Changelog

v1.0: Initial repository of AudioEar3D and AudioEar2D.

Files

AudioEar2D.zip

Files (1.3 GB)

Name	Size	Download all
AudioEar2D.zip md5:bda8c4b384c75a0a9609eeb12fae1b74	323.4 MB	Preview Download
AudioEar3D.zip md5:b75a03cd90f82e40d50712e1f2d24d5a	998.7 MB	Preview Download

Additional details

Is described by: Preprint: https://arxiv.org/abs/2301.12613 (URL)

	All versions	This version
Views	1,007	101
Downloads	746	53
Data volume	504.0 GB	39.0 GB

AudioEar: Single-View Ear Reconstruction for Personalized Spatial Audio

Authors/Creators

Description

Files

AudioEar2D.zip

Files (1.3 GB)

Additional details

Related works