Dataset Restricted Access
McCool, Chris; Marcel, Sébastien; Hadid, Abdenour; Pietikäinen, Matti; Matějka, Pavel; Černocký, Jan; Poh, Norman; Kittler, Josef; Larcher, Anthony; Lévy, Christophe; Matrouf, Driss; Bonastre, Jean-François; Tresadern, Phil; Cootes, Timothy
MOBIO is a dataset for mobile face and speaker recognition. The dataset consists of bi-modal (audio and video) data taken from 152 people. The dataset has a female-male ratio of nearly 1:2 (100 males and 52 females) and was collected from August 2008 until July 2010 in six different sites from five different countries. This led to a diverse bi-modal dataset with both native and non-native English speakers.
In total 12 sessions were captured for each client: 6 sessions for Phase I and 6 sessions for Phase II. The Phase I data consists of 21 questions with the question types ranging from: Short Response Questions, Short Response Free Speech, Set Speech, and Free Speech. The Phase II data consists of 11 questions with the question types ranging from: Short Response Questions, Set Speech, and Free Speech. A more detailed description of the questions asked of the clients is provided below.
The database was recorded using two mobile devices: a mobile phone and a laptop computer. The mobile phone used to capture the database was a NOKIA N93i mobile while the laptop computer was a standard 2008 MacBook. The laptop was only used to capture part of the first session, this first session consists of data captured on both the laptop and the mobile phone.
Detailed Description of Questions
Please note that the answers to the Short Response Free Speech and Free Speech questions DO NOT necessarily relate to the question as the sole purpose is to have the subject speaking free speech, therefore, the answers to ALL of these questions are assumed to be false.
1. Short Response Questions
The short response questions consisted of five pre-defined questions, which were:
2. Short Response Free Speech
There were five random questions taken form a list of 30-40 questions. The user had to answer these questions by speaking for approximately 5 seconds of recording (sometimes more and sometimes less).
3. Set Speech
The users were asked to read pre-defined text out aloud. This text was designed to take longer than 10 seconds to utter and the participants were allowed to correct themselves while reading these paragraphs.
The text that was read was:
I have signed the MOBIO consent form and I understand that my biometric data is being captured for a database that might be made publicly available for research purposes.
I understand that I am solely responsible for the content of my statements and my behaviour.
I will ensure that when answering a question I do not provide any personal information in response to any question.
4. Free Speech
The free speech session consisted of 10 random questions from a list of approximately 30 questions. The answers to each of these questions took approximately 10 seconds (sometimes less and sometimes more).
Elie Khoury, Laurent El-Shafey, Christopher McCool, Manuel Günther, Sébastien Marcel, “Bi-modal biometric authentication on mobile phones in challenging conditions”, Image and Vision Computing Volume 32, Issue 12, 2014.
You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.
Access to the dataset is based on an End-User License Agreement. The use of the dataset is strictly restricted to non-commercial research.
Please provide us the following information about the authorized signatory (MUST hold a permanent position):
The requester must use their personal valid email address from the same organization as the signatory to contact us.