Dataset Open Access

Polynesian Segmented Data

Walworth, Mary


JSON-LD (schema.org) Export

{
  "@context": "https://schema.org/", 
  "@id": "https://doi.org/10.5281/zenodo.1689909", 
  "@type": "Dataset", 
  "creator": [
    {
      "@type": "Person", 
      "affiliation": "Max Planck Institute for the Science of Human History", 
      "name": "Walworth, Mary"
    }
  ], 
  "datePublished": "2018-11-29", 
  "description": "<p>Presented here are segmented data from 210 basic vocabulary concepts in 31 Polynesian languages. This data was extracted from the Austronesian Basic Vocabulary Database (Greenhill et al 2008) and then verified and edited for accuracy, relying on both my knowledge of Polynesian languages and existing source material from published dictionaries and grammars (see references below). Once the data was verified, the numerous transcription conventions used in the original data were converted into consistent IPA forms using existing phonetic and phonological descriptions of these languages. Once the data were in a consistent and reliable format, I went through each of the 210 concepts and segmented the lexical items into morphemes using EDICTOR (http://edictor.digling.org). The purpose of the segmentation task was to isolate base forms by indicating affixes, reduplication, and compounded words, in order to optimise cognate detection. This process also resulted in the addition of partial cognates; resulting in a richer, more comprehensive data set.</p>\n\n<p>Language codes in the dataset refer to the following languages:</p>\n\n<p>Anuta_253&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Anuta</p>\n\n<p>Austral_1213&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Reo Ra&rsquo;ivavae</p>\n\n<p>Austral_128&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Rurutuan</p>\n\n<p>EastFutuna_210&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; East_Futunan</p>\n\n<p>Emae_1030&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Emae</p>\n\n<p>FutunaAniwa_156&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Futuna-Aniwa</p>\n\n<p>Hawaiian_52&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Hawaiian</p>\n\n<p>Kapingamarangi_217&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Kapingamarangi</p>\n\n<p>Luangiua_238&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Luangiua</p>\n\n<p>Mangareva_239&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Reo Mangareva</p>\n\n<p>Maori_85&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Maori</p>\n\n<p>Mele-Fila_1163&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Mele_Fila</p>\n\n<p>Niuean_247&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Niuean</p>\n\n<p>NorthMarquesan_38&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Marquesan</p>\n\n<p>Nukuria_1212&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Nukeria</p>\n\n<p>Penrhyn_235&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Penrhyn</p>\n\n<p>Polynesian_658&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Proto-Polynesian</p>\n\n<p>Pukapuka_152&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Pukapuka</p>\n\n<p>RakahangaManihiki_589 Rakahanga-Manihiki</p>\n\n<p>Rapanui_264&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Rapanui</p>\n\n<p>Rarotongan_58&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Rarotongan</p>\n\n<p>RennellBellona_206&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Rennell-Bellona</p>\n\n<p>Samoan_118&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Samoan</p>\n\n<p>Sikaiana_243&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Sikaiana</p>\n\n<p>Tahitian_173&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Tahitian</p>\n\n<p>Tikopia_155&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Tikopia</p>\n\n<p>TongaTongaIslands_136 Tongan</p>\n\n<p>Tuamotuan_246&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Tuamotuan</p>\n\n<p>Tuvalu_753&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Tuvalu</p>\n\n<p>VaeakauTaumako_375&nbsp;&nbsp;&nbsp; Vaeakau Taumako</p>\n\n<p>Wallisian_258&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Wallisian</p>\n\n<p>&nbsp;</p>\n\n<p>References:</p>\n\n<p>(2017). Dictionary of Cook Island Languages. http://cookislandsdictionary.com/</p>\n\n<p>(2017). Fare V\u0101na&rsquo;a Dictionary. http://www.farevanaa.pf/dictionnaire.php</p>\n\n<p>(2017). n\u0101 puke wehewehe \u02bb\u014dlelo Hawai&lsquo;i. http://wehewehe.org/</p>\n\n<p>Buse, J. (1996). Cook Islands Maori dictionary with English-Cook Islands Maori finderlist. Canberra, Pacific Linguistics.</p>\n\n<p>Capell, A. (1962). The Polynesian Language of Mae (Emwae), New Hebrides. Auckland, Linguistic Society of New Zealand.</p>\n\n<p>Churchill, W. (1912). Easter Island: the Rapanui speech and the peopling of southeast Polynesia. Washington, DC, The Carnegie Institution of Washington.</p>\n\n<p>Clark, R. (1998). A dictionary of the Mele language (Atara Imere), Vanuatu. Canberra, Pacific Linguistics.</p>\n\n<p>Donner, W. (1987) Sikaiana Vocabulary: Na male ma na talatala o Sikaiana. Honiara, Solomon Island</p>\n\n<p>Dordillon, M. R. I. (1931-32). Grammaire et Dictionnaire de la Langue des Iles Marquises. Paris, Institut d&#39;Ethnologie.</p>\n\n<p>Dougherty, J. W. D. (1983). West Futuna-Aniwa: An Introduction to a Polynesian Outlier Language. Berkeley, University of California Press.</p>\n\n<p>Elbert, S. H. (1975-81). Dictionary of the Language of Rennell and Bellona. Copenhagen, The National Museum of Denmark.</p>\n\n<p>Feinberg, R. (1977). The Anutan language reconsidered: Lexicon and grammar of a Polynesian Outlier. New Haven, Human Relations Area Files Press.</p>\n\n<p>Firth, R. (1985). Tikopia-English Dictionary/Taranga Fakatikopia ma Taranga Fakainglisi. Auckland, Auckland University Press</p>\n\n<p>Greenhill S.J. &amp; R. Clark. (2011).&nbsp;<a href=\"http://simon.net.nz/files//2012/12/Greenhill_and_Clark2011.pdf\">POLLEX-Online: The Polynesian Lexicon Project Online.&nbsp;</a><a href=\"http://muse.jhu.edu/login?auth=0&amp;type=summary&amp;url=/journals/oceanic_linguistics/v050/50.2.greenhill.html\">Oceanic Linguistics</a>, 50(2), 551-559.</p>\n\n<p>Greenhill, S.J., Blust. R, &amp; Gray, R.D. (2008).&nbsp;<a href=\"https://abvd.shh.mpg.de/publications/index.php?pub=Greenhill_et_al2008\">The Austronesian Basic Vocabulary Database: From Bioinformatics to Lexomics</a>. Evolutionary Bioinformatics, 4:271-283.</p>\n\n<p>Hollyman, K. J. (1987). De Muna Fagauvea I: Dictionnaire fagauvea-fran&ccedil;ais. Auckland, Linguistic Society of New Zealand.</p>\n\n<p>Jackson, G. (2001). Tuvaluan Dictionary. Oceania Printers. Suva, Fiji.</p>\n\n<p>Janeau, V.F. (1908). Essai de grammaire de la langue des &icirc;les Gambier ou Mangar&eacute;va. Paris, Chadenat.</p>\n\n<p>Kieviet, P. (2017): A Grammar of Rapa Nui. Berlin, Language Science Press.</p>\n\n<p>Lieber, M. D. and K. H. Dikepa (1974). Kapingamarangi Lexicon. Honolulu, University Press of Hawaii.</p>\n\n<p>Milner, G. B. (1966). Samoan Dictionary. London, Oxford University Press.</p>\n\n<p>Moorfield, J.C. (2017). Te Aka Online M\u0101ori Dictionary, https://maoridictionary.co.nz.</p>\n\n<p>Moyse-Faurie, C. (1993). Dictionnaire futunien-fran&ccedil;ais. Paris, Peeters.</p>\n\n<p>N&aelig;ss, A. &amp; E. Hovdhaugen. (2011). A Grammar of Vaeakau-Taumako. Berlin, de Gruyter Mouton.</p>\n\n<p>Pukui, M. K. and S. H. Elbert (1986). Hawaiian Dictionary. Honolulu, University of Hawaii Press</p>\n\n<p>Salisbury, K. (2017). Personal communication on transcription.</p>\n\n<p>Sperlich,W. (1997). Tohi Vagahai Niue. Honolulu, University of Hawai&lsquo;i Press.</p>\n\n<p>Stimson, J. F. (1964). A Dictionary of Some Tuamotuan Dialects of the Polynesian Language. The Hague, Martinus Nijhoff.</p>\n\n<p>Tregear, E. (1899). A Dictionary of Mangareva (or Gambier Islands). Wellington, Government Printer.</p>\n\n<p>Walworth, M. Unpublished fieldnotes &nbsp;on Reo Mangareva, 2015-2017</p>\n\n<p>Walworth, M. Unpublished fieldnotes on Rurutuan, 2010.</p>\n\n<p>Walworth, M. Unpublished fieldnotes on Reo Ra&#39;ivavae,&nbsp;2010-2012.</p>\n\n<p>Weber, R.L. &amp; N. Weber. (1995). Rapanui, in Darrell T.Tryon (ed), Comparative Austronesian Dictionary. Berlin, Mouton de Gruyter.</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>", 
  "distribution": [
    {
      "@type": "DataDownload", 
      "contentUrl": "https://zenodo.org/api/files/792bb21b-70a0-4baa-986e-9f45c39bce55/polynesian-aligned_22112018.tsv", 
      "fileFormat": "tsv"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.1689909", 
  "license": "http://creativecommons.org/licenses/by/4.0/legalcode", 
  "name": "Polynesian Segmented Data", 
  "url": "https://zenodo.org/record/1689909", 
  "version": "1"
}
351
29
views
downloads
All versions This version
Views 351351
Downloads 2929
Data volume 59.4 MB59.4 MB
Unique views 298298
Unique downloads 2727

Share

Cite as