Video/Audio Open Access

The Fharvard corpus

Aubanel, Vincent; Bayard, Clémence; Strauss, Antje; Schwartz, Jean-Luc

Citation Style Language JSON Export

  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.1462854", 
  "language": "fra", 
  "title": "The Fharvard corpus", 
  "issued": {
    "date-parts": [
  "abstract": "<p>The Fharvard corpus is a collection of 700 sentences in French, phonetically balanced into&nbsp;70&nbsp;lists of 10 sentences each. Each sentence contains 5 keywords for scoring.</p>\n\n<p>The list of&nbsp;sentences is contained in the file <strong>The Fharvard corpus.pdf</strong>&nbsp;with keywords in bold.</p>\n\n<p>The phonetic transcription is provided in <strong>The Fharvard corpus - phonetic.txt</strong>. The <em>ortho</em>&nbsp;column contains the orthographic representation of the sentence with&nbsp;keywords in capital letters. The <em>phono</em>&nbsp;column contains the phonetic representation in SAMPA coding, with words separated by two successive&nbsp;space characters.&nbsp;Note that the phonetic representation is provided on an&nbsp;individual word basis,&nbsp;that is, discarding word-to-word liaisons. This is to provide an unambiguous&nbsp;basis for phonetic balancing at the keyword level, as the realisation of some&nbsp;liaisons can vary from talker to talker.</p>\n\n<p>Audio recordings of the Fharvard sentences spoken by a female and a male talker are contained in the .zip archive files, and available&nbsp;with a 44.1 kHz and 16 kHz sampling rate.</p>\n\n<p>A sample sentence for the female and the male talker is also attached.</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>", 
  "author": [
      "family": "Aubanel, Vincent"
      "family": "Bayard, Cl\u00e9mence"
      "family": "Strauss, Antje"
      "family": "Schwartz, Jean-Luc"
  "type": "motion_picture", 
  "id": "1462854"
All versions This version
Views 729729
Downloads 202202
Data volume 5.5 GB5.5 GB
Unique views 674674
Unique downloads 112112


Cite as