{ "access": { "embargo": { "active": false, "reason": null }, "files": "public", "record": "public", "status": "open" }, "created": "2019-01-30T22:04:59.721031+00:00", "custom_fields": {}, "deletion_status": { "is_deleted": false, "status": "P" }, "files": { "count": 1, "enabled": true, "entries": { "quanteda/quanteda-v1.4.zip": { "checksum": "md5:d73bcfb636cb589134cbec939f27495f", "ext": "zip", "id": "d95c8b5e-71f6-4363-9b03-3b2ea0a28b1c", "key": "quanteda/quanteda-v1.4.zip", "metadata": null, "mimetype": "application/zip", "size": 32689937 } }, "order": [], "total_bytes": 32689937 }, "id": "2553641", "is_draft": false, "is_published": true, "links": { "access": "https://zenodo.org/api/records/2553641/access", "access_links": "https://zenodo.org/api/records/2553641/access/links", "access_request": "https://zenodo.org/api/records/2553641/access/request", "access_users": "https://zenodo.org/api/records/2553641/access/users", "archive": "https://zenodo.org/api/records/2553641/files-archive", "archive_media": "https://zenodo.org/api/records/2553641/media-files-archive", "communities": "https://zenodo.org/api/records/2553641/communities", "communities-suggestions": "https://zenodo.org/api/records/2553641/communities-suggestions", "doi": "https://doi.org/10.5281/zenodo.2553641", "draft": "https://zenodo.org/api/records/2553641/draft", "files": "https://zenodo.org/api/records/2553641/files", "latest": "https://zenodo.org/api/records/2553641/versions/latest", "latest_html": "https://zenodo.org/records/2553641/latest", "media_files": "https://zenodo.org/api/records/2553641/media-files", "parent": "https://zenodo.org/api/records/596731", "parent_doi": "https://zenodo.org/doi/10.5281/zenodo.596731", "parent_html": "https://zenodo.org/records/596731", "requests": "https://zenodo.org/api/records/2553641/requests", "reserve_doi": "https://zenodo.org/api/records/2553641/draft/pids/doi", "self": "https://zenodo.org/api/records/2553641", "self_doi": "https://zenodo.org/doi/10.5281/zenodo.2553641", "self_html": "https://zenodo.org/records/2553641", "self_iiif_manifest": "https://zenodo.org/api/iiif/record:2553641/manifest", "self_iiif_sequence": "https://zenodo.org/api/iiif/record:2553641/sequence/default", "versions": "https://zenodo.org/api/records/2553641/versions" }, "media_files": { "count": 0, "enabled": false, "entries": {}, "order": [], "total_bytes": 0 }, "metadata": { "creators": [ { "affiliations": [ { "name": "London School of Economics and Political Science" } ], "person_or_org": { "family_name": "Kenneth Benoit", "name": "Kenneth Benoit", "type": "personal" } }, { "affiliations": [ { "name": "Waseda University" } ], "person_or_org": { "family_name": "Kohei Watanabe", "name": "Kohei Watanabe", "type": "personal" } }, { "affiliations": [ { "name": "LSE" } ], "person_or_org": { "family_name": "Haiyan Wang", "name": "Haiyan Wang", "type": "personal" } }, { "affiliations": [ { "name": "University of Cambridge" } ], "person_or_org": { "family_name": "Paul Nulty", "name": "Paul Nulty", "type": "personal" } }, { "affiliations": [ { "name": "Columbia University, London School of Economics" } ], "person_or_org": { "family_name": "Adam Obeng", "name": "Adam Obeng", "type": "personal" } }, { "affiliations": [ { "name": "University of Zurich" } ], "person_or_org": { "family_name": "Stefan M\u00fcller", "name": "Stefan M\u00fcller", "type": "personal" } }, { "person_or_org": { "family_name": "Jiong Wei Lua", "name": "Jiong Wei Lua", "type": "personal" } }, { "affiliations": [ { "name": "Department of Methodology, London School of Economics" } ], "person_or_org": { "family_name": "Aki Matsuo", "name": "Aki Matsuo", "type": "personal" } }, { "affiliations": [ { "name": "London School of Economics and Political Science" } ], "person_or_org": { "family_name": "Christian Mueller", "name": "Christian Mueller", "type": "personal" } }, { "affiliations": [ { "name": "Princeton University" } ], "person_or_org": { "family_name": "Will Lowe", "name": "Will Lowe", "type": "personal" } }, { "affiliations": [ { "name": "London School of Economics" } ], "person_or_org": { "family_name": "Pablo Barber\u00e1", "name": "Pablo Barber\u00e1", "type": "personal" } }, { "affiliations": [ { "name": "Campus Labs" } ], "person_or_org": { "family_name": "Tyler Rinker", "name": "Tyler Rinker", "type": "personal" } }, { "affiliations": [ { "name": "@ATFutures" } ], "person_or_org": { "family_name": "mark padgham", "name": "mark padgham", "type": "personal" } }, { "affiliations": [ { "name": "@zalando" } ], "person_or_org": { "family_name": "Christopher Gandrud", "name": "Christopher Gandrud", "type": "personal" } }, { "affiliations": [ { "name": "London School of Economics and Political Science" } ], "person_or_org": { "family_name": "Tom Paskhalis", "name": "Tom Paskhalis", "type": "personal" } }, { "person_or_org": { "family_name": "nicmer", "name": "nicmer", "type": "personal" } }, { "person_or_org": { "family_name": "lindbrook", "name": "lindbrook", "type": "personal" } }, { "person_or_org": { "family_name": "hofaichan", "name": "hofaichan", "type": "personal" } }, { "person_or_org": { "family_name": "etienne-s", "name": "etienne-s", "type": "personal" } }, { "person_or_org": { "family_name": "hotzeplotz", "name": "hotzeplotz", "type": "personal" } }, { "person_or_org": { "family_name": "Thomas J. Leeper", "name": "Thomas J. Leeper", "type": "personal" } }, { "affiliations": [ { "name": "Soil Cryology Lab" } ], "person_or_org": { "family_name": "Stas Malavin", "name": "Stas Malavin", "type": "personal" } }, { "affiliations": [ { "name": "@MUDSA" } ], "person_or_org": { "family_name": "Michael W. Kearney", "name": "Michael W. Kearney", "type": "personal" } }, { "affiliations": [ { "name": "@myteksi" } ], "person_or_org": { "family_name": "Michael Chirico", "name": "Michael Chirico", "type": "personal" } }, { "affiliations": [ { "name": "@TIBHannover" } ], "person_or_org": { "family_name": "Katrin Leinweber", "name": "Katrin Leinweber", "type": "personal" } } ], "description": "Bug fixes and stability enhancements\n
dfm_compress()
and dfm_group()
that changed or deleted docvars attributes of dfm objects (#1506).textplot_xray()
that caused incorrect facet labels when a pattern contained multiple list elements or values (#1514).kwic()
now correctly returns the pattern associated with each match as the \"keywords\"
attribute, for all pattern
types (#1515)textstat_simil()
and textstat_dist()
.textstat_lexdiv()
now works on tokens objects, not just dfm objects. New methods of lexical diversity now include MATTR (the Moving-Average Type-Token Ratio, Covington & McFall 2010) and MSTTR (Mean Segmental Type-Token Ratio).tokens_split()
allows splitting single into multiple tokens based on a pattern match. (#1500)tokens_chunk()
allows splitting tokens into new documents of equally-sized \"chunks\". (#1520)textstat_entropy()
now computes entropy for a dfm across feature or document margins.textstat_readability()
is vastly improved, now providing detailing all formulas and providing full references.dfm_match()
allows a user to specify the features in a dfm according to a fixed vector of feature names, including those of another dfm. Replaces dfm_select(x, pattern)
where pattern
was a dfm.vertex_labelsize
added to textplot_network()
to allow more precise control of label sizes, either globally or individually.tokens.tokens(x, remove_hyphens = TRUE)
where x
was generated with remove_hyphens = FALSE
now behaves similarly to how the same tokens would be handled had this option been called on character input as tokens.character(x, remove_hyphens = TRUE)
. (#1498)