This is an updated version of the csv file for this repository containing BPC's and metadata from SAAo, dated to February 13, 2024. Description of file "BPC_saa_01_05_10_13_15_16_17_18_19_21_13022024.csv": Lists BPC's found in SAA letter corpus along with metadata taken from Oracc. Note (2/13/2024): Manual annotations now available for all letter volumes: SAA 01,05,10,13,15,16,17,18,19,21. Certain errors removed from previous versions, including small numbers of bad analyses. Overall search results more accurate since theoretically all letter volumes now have gold standard annotations instead of some volumes having only automated ones. Some manually analyses of BPC metaphors from previous versions not continued in this version Metadata fields for sender, receiver, dialect, and a few others standardized in the sense that SAA volumes which did not originally specify values for these categories (e.g. SAA 10) now have '(unknown)' instead of blank cell. Note that for SAA 10 and SAA 13 the receiver is always the king, even as the field for receiver is still set to (unknown) since the this field is not specified in the Oracc metadata, or the exact formula in the text referring to the king is not known (due to breakage in tablet). Explanation of various header fields in the CSV file: The fields assume a BPC with components [Verb], [Base Preposition], [Body Part Term], and optionally [Direct Object] and [Oblique Object]. These components are labelled in the headers as follows: Verb - tword Base Preposition - rword Body Part Term - sword Direct Object - doword Oblique Object - ooword The lemmas for each of these component words are labelled with fields like those above but with 'lemma' substituted for 'word'. E.g. tlemma, rlemma, slemma, etc. Other explanations of fields: date - Estimated date of composition of letter according to editors of the relevant SAA volume dialect - Dialect of Akkadian the letter is written in script - Script of Akkadian the letter is written in designation - Volume and letter number the attestation appears in ruler - Assyrian king under which letter was written, if known etype - Syntactic dependency type (Indirect object or Oblique object) relating compound preposition to verb pcom - url of the letter that the attestation appears in PP_type - Whether the compound preposition expresses a required, external argument of the verb (EA) or is an optional adjunct (AJ) Translation - optional translation of BPC in context Speaker - Identity of speaker of sentence containing the BPC, which may on occasion be different from the letter's author due to quoted speech. Note that this field and the one below were checked only for a portion of the BPC's. Addressee - Identity of addressee of sentence containing the BPC Description of files of the form "akk-mcong-ud*": These are the conllu and ttl files associated with the SAA volumes. Description of "sparql_Cxn_PRP_Bodypart_X.txt": Contains the sparql query used to find BCP's in the ttl files. Known to work on TriplyDB. Note (2/13/2024): Additional code added to search query to list explicitly direct object and oblique object of BPC. Description of "ak_norm_conllu.zip": Contains conllu files used to train the spaCy language model, including training, development, and test data Description of "ak_norm_spacy.zip": Contains files used to train the language model as in "ak_norm_conllu.zip", converted to spaCy's required binary format Description of "ak_AkkParser_Norm_1_2_5_8_9_10_13_15_16_17_18_19_21_anzu_barutu_rinap4_tcmaassur-0.0.0.tar.gz": Contains files defining the spaCy language model used in this project.