Orthography

Validate against: http://json-schema.org/schema#

Schema ID: http://cdn.digitallinguistics.io/schemas/Orthography-1.0.0.json

Description

An orthography is a way of representing a particular language variety in writing. An orthography can include a set of acceptable graphemes (characters), punctuation, and word separators.

Type: object

Required Properties

  • abbreviation
  • graphemes
  • name

Properties

  • Abbreviation: "abbreviation"

    Description

    An abbreviation for this Orthography.

    Must be an instance of the Abbreviation schema.

  • Graphemes: "graphemes"

    Description

    A collection of graphemes, i.e. units of writing within a writing system, for this Orthography. Each grapheme is an attribute, and the value of that attribute is an object containing information about it. Punctuation characters should not be included in this collection (see the punctuation attribute instead).

    Type: array

    Unique items: true

    Items

    Grapheme

    Description

    An object representing a grapheme.

    Type: object

    Required Properties

    • form

    Properties

    • Allographs: "allographs"

      Description

      A list of all the allographs (graphical / writing variants) of the current grapheme. For example, t in an English orthography would have the allograph T.

      Type: array

      Unique items: true

      Items

      Allograph

      Type: string

    • Grapheme Form: "form"

      Description

      The written form of this grapheme. Graphemes should include their diacritics. For example, c and should be considered separate graphemes, unless they are allographs. Do not include graphemes for characters that only occur as part of a digraph (or trigraph, etc.). For example, standard Swahili orthography has a grapheme ch, but no grapheme c by itself, so c would not be listed as a grapheme, only ch.

      Type: string

    • Grapheme Name: "name"

      Description

      The name of this grapheme, if any. The grapheme name can be represented in one or more orthographies (for example, the grapheme a in English could be represented as A in the English orthography itself, or as in an IPA Orthography.)

      Must be an instance of the MultiLangString schema.

    • Pronunciations: "pronunciations"

      Description

      A list of the possible pronunciations of this grapheme. It may be difficult or impossible to fill in this field for all orthographies (like English).

      Type: array

      Unique items: true

      Items

      Pronunciation

      Type: string

  • Orthography Name: "name"

    Description

    The name of this orthography, optionally in multiple languages/orthographies.

    Must be an instance of the MultiLangString schema.

  • Notes: "notes"

    Description

    A collection of notes about this Orthography.

    Type: array

    Unique items: true

    Items

    Note

    Must be an instance of the Note schema.

  • Punctuation: "punctuation"

    Description

    A list of all the punctuation characters in this orthography, including white space and non-breaking characters.

    Type: array

    Unique items: true

    Items

    Punctuation Character

    Type: string

  • Tags: "tags"

    Must be an instance of the Tags schema.