Published November 28, 2011 | Version v1
Journal article Open

XML schemas and mark-up practices of taxonomic literature

  • 1. Institute of Biodiversity & Ecosystem Research, Bulgarian Academy of Sciences and Pensoft Publishers, Sofia, Bulgaria
  • 2. Natural History Museum, London, , United Kingdom
  • 3. Smithsonian Institution, Washington, United States of America
  • 4. The Open University, Milton Keynes, United Kingdom
  • 5. IPD Böhm, Karlsruhe Institute of Technology, Karlsruhe, Germany
  • 6. Pensoft Publishers, Sofia, Bulgaria
  • 7. University of Massachusetts, Boston, ,
  • 8. Columbia University, New York, United States of America
  • 9. www.plazi.org, Bern, Switzerland

Description

We review the three most widely used XML schemas used to mark-up taxonomic texts, TaxonX, TaxPub and taXMLit. These are described from the viewpoint of their development history, current status, implementation, and use cases. The concept of “taxon treatment” from the viewpoint of taxonomy mark-up into XML is discussed. TaxonX and taXMLit are primarily designed for legacy literature, the former being more lightweight and with a focus on recovery of taxon treatments, the latter providing a much more detailed set of tags to facilitate data extraction and analysis. TaxPub is an extension of the National Library of Medicine Document Type Definition (NLM DTD) for taxonomy focussed on layout and recovery and, as such, is best suited for mark-up of new publications and their archiving in PubMedCentral. All three schemas have their advantages and shortcomings and can be used for different purposes.

Files

ZK_article_3038.pdf

Files (2.5 MB)

Name Size Download all
md5:ca13fdaf338a597817f37fac606f5512
2.4 MB Preview Download
md5:7ea9c9f50ae8b8f5d18855c6ede71c8d
136.0 kB Preview Download

Linked records