{ "access": { "embargo": { "active": false, "reason": null }, "files": "public", "record": "public", "status": "open" }, "created": "2015-06-25T15:04:02+00:00", "custom_fields": {}, "deletion_status": { "is_deleted": false, "status": "P" }, "files": { "count": 1, "enabled": true, "entries": { "qdap-qdapVersion2.1.0.zip": { "checksum": "md5:24b1ec10aa2a54b5dcf91f82315af9a9", "ext": "zip", "id": "8aa8fd88-b988-4662-8ae2-9c30934c90f1", "key": "qdap-qdapVersion2.1.0.zip", "metadata": null, "mimetype": "application/zip", "size": 9808555 } }, "order": [], "total_bytes": 9808555 }, "id": "11124", "is_draft": false, "is_published": true, "links": { "access": "https://zenodo.org/api/records/11124/access", "access_links": "https://zenodo.org/api/records/11124/access/links", "access_request": "https://zenodo.org/api/records/11124/access/request", "access_users": "https://zenodo.org/api/records/11124/access/users", "archive": "https://zenodo.org/api/records/11124/files-archive", "archive_media": "https://zenodo.org/api/records/11124/media-files-archive", "communities": "https://zenodo.org/api/records/11124/communities", "communities-suggestions": "https://zenodo.org/api/records/11124/communities-suggestions", "doi": "https://doi.org/10.5281/zenodo.11124", "draft": "https://zenodo.org/api/records/11124/draft", "files": "https://zenodo.org/api/records/11124/files", "latest": "https://zenodo.org/api/records/11124/versions/latest", "latest_html": "https://zenodo.org/records/11124/latest", "media_files": "https://zenodo.org/api/records/11124/media-files", "parent": "https://zenodo.org/api/records/592474", "parent_doi": "https://zenodo.org/doi/10.5281/zenodo.592474", "parent_html": "https://zenodo.org/records/592474", "requests": "https://zenodo.org/api/records/11124/requests", "reserve_doi": "https://zenodo.org/api/records/11124/draft/pids/doi", "self": "https://zenodo.org/api/records/11124", "self_doi": "https://zenodo.org/doi/10.5281/zenodo.11124", "self_html": "https://zenodo.org/records/11124", "self_iiif_manifest": "https://zenodo.org/api/iiif/record:11124/manifest", "self_iiif_sequence": "https://zenodo.org/api/iiif/record:11124/sequence/default", "versions": "https://zenodo.org/api/records/11124/versions" }, "media_files": { "count": 0, "enabled": false, "entries": {}, "order": [], "total_bytes": 0 }, "metadata": { "creators": [ { "affiliations": [ { "name": "University at Buffalo" } ], "person_or_org": { "family_name": "Tyler Rinker", "name": "Tyler Rinker", "type": "personal" } }, { "affiliations": [ { "name": "Newcastle University" } ], "person_or_org": { "family_name": "Colin Gillespie", "name": "Colin Gillespie", "type": "personal" } }, { "affiliations": [ { "name": "Google" } ], "person_or_org": { "family_name": "Craig Citro", "name": "Craig Citro", "type": "personal" } } ], "description": "\nCHANGES IN qdap VERSION 2.1.1\n\n
BUG FIXES
\n\nsyllable_count returned the sentence (recycled) in the words column of the\noutput. This behavior has been fixed. See GitHub issue #188 for details.
syn returned antonyms for some words. This was caused by the dictionary:\nqdapDictionaries::key.syn contained antonyms and elemets the were error \nmessages (character). This has been fixed. Reference issue #190. (Jingjing Zou)
The pres_debates2012 data set contained three errors in speech attribution.\nThis has been corrected and the turn of talk (tot) as well.
word_stats would throw an error if no poly-syllable words existed. This has \nbeen corrected (reported by Nicolas Turenne).
NEW FEATURES
\n\nqdap_df and %&% added to mimic some of the functionality of dplyr's \ntbl_df and chaining pipe in a more specific, less flexible, qdap oriented \nway.
Text added to view and change the text.var attribute of a data.frame of \nthe classqdap_df`.
cumulative generic method added to view cumulative scores over time.
formality picks up a cumulative method.
polarity picks up a cumulative method.
end_mark picks up a class (end_mark), plot method, and a cumulative \nmethod.
syllable_sum, polysyllable_sum, and combo_syllable_sum pick up a \nclass, plot method, and a cumulative method.
wfm becomes a generic method currently applied to a text.var that is:\ncharacter, factor (coerced to character), or wfdf.
unbag added as a compliment to bag_o_words and friends for undoing string \nsplitting. A convenience wrapper for paste(collapse = " ").
as.Corpus.TermDocumentMatrix, as.Corpus.DocumentTermMatrix, and \nas.Corpus.wfm added to convert a matrix format to a tm::Corpus.
exclude becomes a generic method for various classes. Functionality is the \nsame but with improved code readability.
check_spelling_interactive, check_spelling, which_misspelled, and \ncorrect allow the user to identify potentially misspelled words and \noptionally suggest replacements.
random_data & random_sent added to generate random sentence data sets and \nvectors.
comma_spacer added to ensure strings with commas contain a space after them.
check_text added to identify potential problems in text.
replace_ordinal added to convert ordinal representations of 1 through 100 to\nstrictly ordinal text (e.g., "1st" becomes "first").
A vignette: Cleaning Text & Debugging was added to assist users with \ncleaning and debugging problems in qdap.
pronoun_type, and subject_pronoun_type, object_pronoun_type added to \nexamine usage of subject/object pronouns by grouping variable.
MINOR FEATURES
\n\nIMPROVEMENTS
\n\nwfm gains a speedup through generic classes and tm package integration \n(strip is no longer used in wfm).
as.tdm.character and as.dtm.character gain a speed boost with a tm \npackage integration.
Added message to as.data.frame.Corpus for missing end-marks suggesting the \nuse of: sent.split = FALSE.
as.Corpus familiy of functions didn't necessarily respect document names and\nsometimes used numeric sequence instead. The introduction of a reader via\ntm::readTabular has fixed this.
sentSplit now gives warnings for text that may contain anomalies such as:\nnon-ASCII characters, factors, missing punctuation, empty cells, and no \nalphabetic characters found.
read.transcript now gives a warning when reading from a .docx file and the \nseparator (sep) used is still found in the text as this may indicate the \ndata did not split correctly.
dispersion_plot now takes a named list of vectors of terms as the argument to \nmatch.terms. The vectors are combined as a unified theme named with the \nnames of the list supplied to match.terms.
CHANGES
\n\nas.data.frame.Corpus's default value for sent.split is now FALSE.
The state column in the qdap::DATA2 data-set is now character (previously \nfactor).