Lexeme Form

Validate against: http://json-schema.org/schema#

Description

The base forms of a lexeme are the minimal set of forms needed to determine the full set of inflectional possibilities of a lexeme. These include suppletive forms, irregular forms, or morphologically-conditioned forms. For example, the lexeme be has the base forms be, am, is, etc., while the lexeme man has the base forms man and men. A base form does not refer to a regularly-inflected, predictable form like being or cats. Principal parts of verbs in Latin are another example of base forms, since they are the minimal set of forms that someone must know to determine all the inflectional possibilities of a verb.

Type: object

Required Properties

  • transcription

Additional properties: true

Dependencies

  • If the property variantType is present, the following properties must also be present:

    • variantOf

Properties

  • Allomorphs: "allomorphs"

    Description

    A list of allomorphs (that is, phonologically-conditioned alternants) of this lexeme

    Type: array

    Unique items: true

    Items

    Allomorph

    Description

    An allomorph of this lexeme

    Type: object

    Required Properties

    • environments
    • transcription

    Additional properties: true

    Properties

    • Environments: "environments"

      Description

      A list of phonological environments in which this allomorph occurs. May be an empty array.

      Type: array

      Unique items: true

      Items

      Environment

      Description

      A formalization of a (morpho)phonologial environemnt, e.g. _k

      Type: string

      Min length: 1

    • Syllable Structure: "syllableStructure"

      Description

      An abstract representation of the syllable structure of this allomorph, e.g. CVC

      Type: string

    • Tone: "tone"

      Description

      An abstract representation of the tonal pattern of this allomorph. Examples: HLH, 313, ˦˨˦ etc.

      Type: string

    • Transcription: "transcription"

      Description

      A transcription of this allomorph, optionally in multiple orthographies. Do not include any leading or trailing tokens (e.g. hyphens, equal signs).

      Must be an instance of the Transcription schema.

  • Components: "components"

    Description

    A list of the morphemes or other lexemes contained within the current form. For example, the form gentlemen in an English lexicon might have references to the lexeme gentle, and the form men within the lexeme for man. Components may reference either an entire lexeme or a specific form. Components do not have to be unique (useful when the same morpheme appears twice in a word).

    Type: array

    Unique items: false

    Items

    Lexeme / Lexeme Form (Database Reference)

    Description

    A database reference to a lexeme or lexeme form

    Must be an instance of the DatabaseReference schema.

  • Examples: "examples"

    Description

    A collection of examples illustrating this lexeme form in use. Each example is an Utterance from a Text. The Utterance number should be indicated in the index field of the Database Reference object. If using a full Utterance object rather than a Database Reference object, the key field should be included. For precision's sake, it is recommended that examples be given for individual forms rather than the entire lexeme when possible.

    Type: array

    Unique items: true

    Items

    Example Utterance (Database Reference)

    Description

    A database reference to an Utterance object

    Must be an instance of the DatabaseReference schema.

  • Features: "features"

    Description

    A set of inflectional features for this lexeme form (used primarily with grammatical morphemes). Each property should be the name of a feature type (e.g. case, person, number, gender, nounClass, etc.), and its value should be the value for that feature, as a string (e.g. nominative, 1, singular, masculine, etc.). Features may be written more than once, in different languages. For example, a morpheme may have the feature case: accusative (English) as well as caso: acusativo (Spanish).

    Type: object

    Validates Against (allOf)

    This schema must validate against all of the following schemas:

    • Tags

      Description

      The Features object must be a Tags object

      Must be an instance of the schema.

  • Inflectional Class: "inflectionClass"

    Description

    If this lexeme is a root or stem, this field indicates the inflectional class that the sense takes. If this lexeme is an inflectional morpheme, this field indicates the inflectional class that the morpheme belongs to. If this lexeme is a derivational morpheme, this field indicates the inflectional class of the derived form. May be written in multiple languages.

    Must be an instance of the MultiLangString schema.

  • Link: "link"

    Description

    A URL where a presentational format for this lexeme form may be viewed

    Type: string

    Format: uri

  • Media: "media"

    Description

    Media items associated with this lexeme form, such as recordings of this form being pronounced. When a media item pertains to a specific lexeme form, it should be placed in that form's media field rather than the lexeme's.

    Type: array

    Unique items: true

    Items

    Media Item (Database Reference)

    Description

    A database reference to a media item associated with this lexeme form

    Must be an instance of the DatabaseReference schema.

  • Morpheme Type: "morphemeType"

    Description

    The type of morpheme or complex construction that this lexeme is, optionally in multiple languages. Examples: root, stem, bipartite stem, enclitic, prefix, inflected word, phrase, circumfix, compound, complex, ideophonic. Typically, all the forms of a lexeme will have the same morpheme type, but occasionally they differ (e.g. independent vs. cliticized forms of a word, such as are vs. ='re in English).

    Must be an instance of the MultiLangString schema.

  • Notes: "notes"

    Description

    A collection of notes about this lexeme form. Each Note object must have its noteType property specified. Notes with a note type of private are not intended for publication in dictionaries, while other types of notes are. For precision's sake, it is recommended that notes be attached to specific forms rather than the lexeme whenever possible.

    Type: array

    Unique items: true

    Items

    Validates Against (allOf)

    This schema must validate against all of the following schemas:

    • Note

      Description

      A note about this lexeme form

      Must be an instance of the schema.

    • Required Properties

      • noteType

      Properties

      • Note Type

        Description

        The type of note about this lexeme form

        Type: string

        Allowed Values (enum)

        • private
        • general
        • anthropology
        • discourse
        • encyclopedic
        • grammar
        • phonology
        • semantics
        • sociocultural
  • Bibliographic References: "references"

    Description

    A collection of bibliographic references relating to this lexeme form. For example, a particular form may have been discussed in detail in a published article.

    Type: array

    Unique items: true

    Items

    Reference

    Description

    A bibliographic Reference about this lexeme form

    Must be an instance of the BibliographicReference schema.

  • Sources: "sources"

    Description

    A list of attested sources for this lexeme form, such as a citation to a published text where it appears, the key of an Utterance in the database, or the initials of the speaker who provided it. For precision's sake, sources should be listed for specific forms of a lexeme rather than the lexeme whenever possible.

    Type: array

    Unique items: true

    Items

    Source

    Description

    An attested source for this lexeme form. This will often be the initials of a speaker, but could also be the abbreviation of the story the sense was found it, or a citation to a published text in which the form appears.

    Type: string

    Min length: 1

  • Syllable Structure: "syllableStructure"

    Description

    An abstract representation of the syllable structure of this form, e.g. CVC

    Type: string

  • Tags: "tags"

    Description

    A set of tags for this lexeme form

    Must be an instance of the Tags schema.

  • Tone: "tone"

    Description

    An abstract representation of the tonal pattern of this lexeme form. Examples: HLH, 313, ˦˨˦ etc.

    Type: string

  • Transcription: "transcription"

    Description

    A transcription of this lexeme form, optionally in multiple orthographies

    Must be an instance of the Transcription schema.

  • Variant Of: "variantOf"

    Description

    If this lexeme form is a variant of another form, a reference to the other form should go here. For example, some speakers of English have hanged as the past tense of hang, while others have hung.

    Must be an instance of the DatabaseReference schema.

  • Variants: "variants"

    Description

    A list of variants of this this form. This field should be used for dialectal and idiolectal variants, rapid and careful speech variants, register-based variants, spelling variants, etc. It should not be used for phonologically-conditioned variants (use the allomorphs field of a specific form instead). Each variant should have its variantType property specified.

    Type: array

    Unique items: true

    Items

    Validates Against (allOf)

    This schema must validate against all of the following schemas:

    • Variant (Database Reference)

      Description

      A database reference to a variant of this form. Note: The Database Reference object must have a variantType property, indicating the type of variant.

      Must be an instance of the schema.

    • Required Properties

      • variantType

      Properties

      • Variant Type

        Description

        This field is be used to specify the type of variant. Possible values might be a person’s name (representing an idiolectal variant), or simply idiolectal, or dialectal (or the name of the dialect), or rapid speech, etc. May be in multiple languages.

        Must be an instance of the schema.

  • Variant Type: "variantType"

    Description

    If this form is a variant of another lexeme form, this field can be used to specify the type of variant. Possible values might be a person’s name (representing an idiolectal variant), or simply idiolectal, or dialectal (or the name of the dialect), or rapid speech, etc. Optionally in multiple languages.

    Must be an instance of the MultiLangString schema.