Published January 10, 2020 | Version 1.1
Other Open

Fully automatic segmentation of glottis and vocal folds in endoscopic laryngeal high-speed videos using a deep Convolutional LSTM Network - dataset & code

  • 1. Department of Computer Science, Trier University of Applied Sciences, Schneidershof, 54293 Trier, Germany
  • 2. Department of Otorhinolaryngology and Head and Neck Surgery, University of Munich, Campus Grosshadern, Marchioninistr. 13, 81366 Munich, Germany
  • 3. Department of Otorhinolaryngology, Saarland University Hospital, Kirrbergerstr. 100, 66424 Homburg/Saar, Germany

Description

This repository is associated with the manuscript "Fully automatic segmentation of glottis and vocal folds in endoscopic laryngeal high-speed videos using a deep Convolutional LSTM Network", where we used a deep Convolutional Neural Network (CNN) approach for the first time to fully automatically segment not only the time-varying glottal area but also the vocal fold tissue directly from laryngeal HS video. The approach was developed and intensely evaluated on a dataset comprising 130 HS-sequences (13,000 HS video frames in total) obtained from healthy as well as pathologic subjects.

Here, we provide the used dataset, the code along with the best performing Neural Network, and scripts to evaluate the segmentation performance.

This work was supported by the German Research Foundation (DFG), LO-1413/2-2. Computational resources were provided by the High Performance Compute Cluster 'Elwetritsch' at the University of Kaiserslautern, which is part of the 'Alliance of High Performance Computing Rheinland-Pfalz' (AHRP). We kindly acknowledge the support.

Notes

version 1.1: updated image data due to some corrupted files

Files

version_1-1.zip

Files (891.4 MB)

Name Size Download all
md5:da656e0509dfc6f5e98c2745708b2ca3
891.4 MB Preview Download

Additional details

Related works

Is cited by
Journal article: 10.1371/journal.pone.0227791 (DOI)