There is a newer version of the record available.

Published March 4, 2024 | Version 1.0.0
Dataset Restricted

NeuroVoz: a Castillian Spanish corpus of parkinsonian speech

Description

The NeuroVoz dataset emerges as a pioneering resource in the field of computational linguistics and biomedical research, specifically designed to enhance the diagnosis and understanding of Parkinson's Disease (PD) through speech analysis. This dataset is distinguished as the first of its kind to be made publicly available in Castilian Spanish, addressing a critical gap in the availability of linguistic and dialectical diversity within PD research.

Compiled from a cohort of 108 participants, including 53 individuals diagnosed with PD and 55 healthy controls, the NeuroVoz dataset offers a rich compilation of speech recordings. All PD participants were recorded under medication (ON state), ensuring consistency and reliability in the speech samples collected. The dataset is meticulously curated to include a variety of speech tasks—ranging from sustained vowel phonations and diadochokinetic (DDK) tests to 16 structured listen-and-repeat utterances and spontaneous monologues. The inclusion of both manually transcribed listen-and-repeat tasks and Whisper-automated transcriptions for monologues underscores our commitment to data accuracy and usability.

Encompassing 2,903 audio files, the NeuroVoz dataset provides an extensive repository, averaging 26.88+- 3.35 recordings per participant, making it an invaluable asset for researchers seeking to explore the nuances of PD-affected speech. The dataset's structure and composition facilitate a multifaceted analysis of speech impairments associated with PD, offering insights into phonatory, articulatory, and prosodic changes.

In contributing to the body of knowledge with the NeuroVoz dataset, we invite the scientific community to engage with this dataset, explore the specific speech characteristics of PD in Castilian Spanish speakers, and advance the field of PD diagnosis through innovative speech analysis techniques.

 

If you use this dataset, please cite both this Zenodo and the arXiv preprint:

  • arXiv preprint: J. Mendes-Laureano, J. A. Gómez-García, A. Guerrero-López,E. Luque-Buzo, J. D. Arias-Londoño, F. J. Grandas-Pérez, and J. I. Godino-Llorente, “Neurovoz: a castillian spanish corpus of parkinsonian speech,” arXiv preprint arXiv:2403.02371 (2024).
    • Link: https://arxiv.org/abs/2403.02371
  • Zenodo dataset: Mendes-Laureano, J., Gómez-García, J. A., Guerrero-López, A., Luque-Buzo, E., Arias-Londoño, J. D., Grandas-Pérez, F. J., & Godino Llorente, J. I. (2024). NeuroVoz: a Castillian Spanish corpus of parkinsonian speech (1.0.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.10777657

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

PLEASE, COPY AND PASTE THE FOLLOWING TEXT FILLED WITH YOUR PERSONAL DATA IN YOUR REQUEST TO ACCESS THE DATASET. 

--------------------------------------------------------------------------------------------------------------------------------------------------

 

This document establishes the conditions under which Mr./Mrs. …….…......................……..….........……...................………….

with ID # ……………..........…. Adress ...................................................................................................................................

on behalf of (hereinafter, the “Downloader”): ………………………...…….................................……….....................………......…..

requests the transfer of the Neurovoz speech database recorded by Universidad Politécnica de Madrid and Hospital Gregorio Marañón de Madrid.

The Downloader and the owner of the materials (“User”) accept the following agreement (“Agreement”) on the use of the materials (“Materials”) to be downloaded.

I. Acceptance of this Agreement

By downloading or otherwise accessing the Materials, Downloader represents his/her acceptance of the terms of this Agreement.

II. Modification of this Agreement

User may modify the terms of this Agreement at any time. However, any modifications to this Agreement will only be effective for downloads subsequent to such modification. No modifications will supersede any previous terms that were in effect at the time of the Downloader’s download.

III. Use of the Materials

Use of the Materials include but are not limited to viewing parts or the whole of the content included in the Materials; comparing data or content from the Materials with data or content in other Materials; verifying research results with the content included in the Materials; and extracting and/or appropriating any part of the content included in the Materials for use in other projects, publications, research, or other related work products.

Representations

In Use of the Materials, Downloader represents that:

1.       Downloader is not bound by any pre-existing legal obligations or other applicable laws that prevent Downloader from downloading or using the Materials;

2.       Downloader will not use the Materials in any way prohibited by applicable laws;

3.       Downloader has no knowledge of and will therefore not be responsible for any restrictions regarding the use of Materials beyond what is described in this Agreement, and

4.       Downloader has no knowledge of and will therefore not be responsible for any inaccuracies and any other such problems with regards to the content of the Materials and the accompanying citation information.

Restrictions In his/her Use of the Materials

The download and/or use or the Materials will not involve financial remuneration for the User.

Downloaders cannot:

1.       Obtain information from the Materials that results in Downloader or any third party(ies) directly or indirectly identifying any research subjects with the aid of other information acquired elsewhere;

2.       Produce connections or links among the information included in User’s datasets (including information in the Materials), or between the information included in User’s datasets (including information in the Materials) and other third-party information that could be used to identify any individuals or organizations, not limited to research subjects; and

3.       Extract information from the Materials that could aid Downloader in gaining knowledge about or obtaining any means of contacting any subjects already known to Downloader.

4.       Copy, transfer or distribure the Material to third parties in any way, with or without modifications, without the written consent of the User.

5.       Use the Materials for commercial purposes in their current form or derived from them without the written consent of the User.

Downloaders must:

1.       Recognize the User as the owner of the database.

2.       The Downloader must reference the corpus in Zenodo:

Mendes-Laureano, J., Gómez-García, J. A., Guerrero-López, A., Luque-Buzo, E., Arias-Londoño, J. D., Grandas-Pérez, F. J., & Godino Llorente, J. I. (2024). NeuroVoz: a Castillian Spanish corpus of parkinsonian speech [Data set]. Zenodo. https://doi.org/10.5281/zenodo.12517368

3.       The Downloader must reference the work that provides a description of the Materials downloaded:

Mendes-Laureano, J., Gómez-García, J. A., Guerrero-López, A., Luque-Buzo, E., Arias-Londoño, J. D., Grandas-Pérez, F. J., & Godino-Llorente, J. I. (2024). NeuroVoz: a Castillian Spanish corpus of parkinsonian speech. arXiv preprint arXiv:2403.02371.

IV. Representations and Warranties

USER REPRESENTS THAT USER HAS ALL RIGHTS REQUIRED TO MAKE AVAILABLE AND DISTRIBUTE THE MATERIALS. EXCEPT FOR SUCH REPRESENTATION, THE MATERIALS IS PROVIDED “AS IS” AND “AS AVAILABLE” AND WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, NON-INFRINGEMENT, MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE, AND ANY WARRANTIES IMPLIED BY ANY COURSE OF PERFORMANCE OR USAGE OF TRADE, ALL OF WHICH ARE EXPRESSLY DISCLAIMED.

WITHOUT LIMITING THE FOREGOING, USER DOES NOT WARRANT THAT: (A) THE MATERIALS ARE ACCURATE, COMPLETE, RELIABLE OR CORRECT; (B) THE MATERIALS FILES WILL BE SECURE ; (C) THE MATERIALS WILL BE AVAILABLE AT ANY PARTICULAR TIME OR LOCATION; (D) ANY DEFECTS OR ERRORS WILL BE CORRECTED; (E) THE MATERIALS AND ACCOMPANYING FILES ARE FREE OF VIRUSES OR OTHER HARMFUL COMPONENTS; OR (F) THE RESULTS OF USING THE MATERIALS WILL MEET DOWNLOADER’S REQUIREMENTS. DOWNLOADER’S USE OF THE MATERIALS IS SOLELY AT DOWNLOADER’S OWN RISK.

V. Limitation of Liability

IN NO EVENT SHALL USER BE LIABLE UNDER CONTRACT, TORT, STRICT LIABILITY, NEGLIGENCE OR ANY OTHER LEGAL THEORY WITH RESPECT TO THE MATERIALS (I) FOR ANY DIRECT DAMAGES, OR (II) FOR ANY LOST PROFITS OR SPECIAL, INDIRECT, INCIDENTAL, PUNITIVE, OR CONSEQUENTIAL DAMAGES OF ANY KIND WHATSOEVER.

VI. Indemnification

Downloader will indemnify and hold User harmless from and against any and all loss, cost, expense, liability, or damage, including, without limitation, all reasonable attorneys’ fees and court costs, arising from the i) Downloader’s misuse of the Materials; (ii) Downloader’s violation of the terms of this Agreement; or (iii) infringement by Downloader or any third party of any intellectual property or other right of any person or entity contained in the Materials. Such losses, costs, expenses, damages, or liabilities shall include, without limitation, all actual, general, special, and consequential damages.

VII. Dispute Resolution

Downloader and User agree that any cause of action arising out of or related to the download or use of the Materials must commence within one (1) year after the cause of action arose; otherwise, such cause of action is permanently barred.

This Agreement shall be governed by and interpreted in accordance with the laws of the Spain (excluding the conflict of laws rules thereof). All disputes under this Agreement will be resolved in the applicable courts of Madrid, Spain. Downloader consents to the jurisdiction of such courts and waives any jurisdictional or venue defenses otherwise available.

VIII. Integration and Severability

This Agreement represents the entire agreement between Downloader and User with respect to the downloading and use of the Materials, and supersedes all prior or contemporaneous communications and proposals (whether oral, written or electronic) between Downloader and User with respect to downloading or using the Materials. If any provision of this Agreement is found to be unenforceable or invalid, that provision will be limited or eliminated to the minimum extent necessary so that the Agreement will otherwise remain in full force and effect and enforceable.

IX. Miscellaneous

User may assign, transfer or delegate any of its rights and obligations hereunder without consent. No agency, partnership, joint venture, or employment relationship is created as a result of the Agreement and neither party has any authority of any kind to bind the other in any respect outside of the terms described within this Agreement. In any action or proceeding to enforce rights under the Agreement, the prevailing party will be entitled to recover costs and attorneys’ fees.                   

 

 

For the record and for the appropriate purposes this agreement is signed.

On behalf of the Downloader:                                                                                            Place and Date:

………………………………………..                                                                                          …….……………………………….

You are currently not logged in. Do you have an account? Log in here

Additional details

Funding

Ministry of Economy and Competitiveness of Spain PID2021-128469OB-I00
Agencia Estatal de Investigación
Ministry of Economy and Competitiveness of Spain TED2021-131688B-I00
Agencia Estatal de Investigación
Maria Zambrano 2021 Maria Zambrano 2021
Universidad Politécnica de Madrid
Ministry of Economy and Competitiveness of Spain DPI2017-83405-R1
Agencia Estatal de Investigación