Word

Validate against: http://json-schema.org/schema

Description

A Word object represents a specific token in a text, rather than an abstract lexeme (see the Lexeme object for that).

Type: object

Required Properties

  • transcription

Additional properties: true

Properties

  • Type: "type"

    Description

    The type of object. Must be set to Word.

    Type: string

  • End Time: "endTime"

    Description

    The point in time in the text at which the word token stop being spoken, formatted as seconds and milliseconds (SS.MMM). This property is useful for phonetic data analysis.

    Type: number

    Minimum: 0.001

  • Gloss: "gloss"

    Description

    A Leipzig-style gloss for this Word. This does not need to be supplied if it can be generated automatically from the glosses at the morpheme level. May be in multiple languages.

    Must be an instance of the MultiLangString schema.

  • Key: "key"

    Description

    A key that uniquely identifies this word token within the Text. The key for a Word consists of the abbreviation of the text, a period, the number of Utterance within the text, another period, and then the number of this word within the Utterance (indexing starts at 1). For example, for the fourth word of the third Utterance of a text with the abbreviation A, the key would be A.3.4. Keys should be unique within a corpus.

    Type: string

    Regular expression pattern: ^[(a-z)|(A-Z)|(0-9)]+\.[0-9]{1,3}\.[0-9]{1,2}$

  • Morphemes / Components: "morphemes"

    Description

    A collection of the components (morphemes, lexemes) in this Word. Components may be repeated if the same morpheme appears twice in the Word.

    Type: array

    Unique items: false

    Items

    Morpheme

    Description

    An object representing this morpheme. This is not an object representing a lexeme (the abstract type). It represents the specific morpheme token (the particular instance in speech) instead.

    Must be an instance of the Morpheme schema.

  • Notes: "notes"

    Description

    A collection of notes about this token

    Type: array

    Unique items: true

    Items

    Note

    Description

    A Note about this token

    Must be an instance of the Note schema.

  • Phonemes: "phonemes"

    Description

    An array of the phones (phoneme tokens) in this word, in IPA. Each phone should appear in order, and does not have to be unique.

    Type: array

    Unique items: false

    Items

    Phone (Phoneme Token)

    Description

    An IPA representation of this phoneme. The phoneme field is required, but the allophone field is optional, for use only when you need to specify that this phoneme is being realized as a particular phoneme. Both phoneme and allophone must be in IPA rather than a practical orthography.

    Type: object

    Required Properties

    • phoneme

    Additional properties: true

    Properties

    • Allophone: "allophone"

      Description

      This property may be used to specify which allophone this particular phoneme token is. Must be written in IPA.

      Type: string

    • End Time: "endTime"

      Description

      The point in time in the text at which this phone stops being pronounced. This property is primarily useful for phonetic data analysis.

      Type: number

      Minimum: 0.001

    • Notes: "notes"

      Description

      A collection of notes about this phone

      Type: array

      Unique items: true

      Items

      Note

      Description

      A note about this phone. Useful for one-time observations of interesting allophonic variation in the data.

      Must be an instance of the Note schema.

    • Phoneme: "phoneme"

      Description

      An IPA representation of this phoneme token. Must be a representation of the abstract phoneme rather than the particular allophone (use the allophone property for that insetad).

      Type: string

    • Start Time: "startTime"

      Description

      The point in time in the text at which this phone starts being pronounced. This property is primarily useful for phonetic data analysis. The endTime property is not necessarily required if the startTime property is present (this is useful when the endTime of one phone is the same as the startTime of the next).

      Type: number

    • Tags: "tags"

      Description

      A set of tags for this phone

      Must be an instance of the Tags schema.

  • Start Time: "startTime"

    Description

    The point in time in the text at which this word token starts being uttered, formatted as seconds and milliseconds (SS.MMM). This property is primarily useful for phonetic data analysis. The endTime property is not necessarily required if the startTime property is present (this is useful when the endTime of one word token is the same as the startTime of the next).

    Type: number

  • Tags: "tags"

    Description

    A set of tags for this word token

    Must be an instance of the Tags schema.

  • Transcription: "transcription"

    Description

    A transcription of this word token, optionally in multiple orthographies. A transcription must be provided in at least one orthography.

    Must be an instance of the Transcription schema.

  • Translation: "translation"

    Description

    A translation of this word, optionally in multiple languages. Note that the translation is not the same as a gloss. Use the translation field for literal or free, natural language translations of a word; use the gloss field for Leipzig-style glosses only.

    Must be an instance of the Translation schema.