There is a newer version of this record available.

Software Open Access

quanteda/quanteda: CRAN v1.5.0

Kenneth Benoit; Kohei Watanabe; Haiyan Wang; Paul Nulty; Adam Obeng; Stefan Müller; Jiong Wei Lua; Aki Matsuo; Christian Mueller; Will Lowe; Pablo Barberá; Tyler Rinker; mark padgham; Christopher Gandrud; José Tomás Atria; Tom Paskhalis; nicmer; lindbrook; hofaichan; etienne-s; hotzeplotz; Thomas J. Leeper; Stas Malavin; Michael W. Kearney; Michael Chirico; Katrin Leinweber; Johannes Gruber


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/1518ca6f-b3c2-43cc-b736-07ee05bc1f43/quanteda/quanteda-v1.5.0.zip"
      }, 
      "checksum": "md5:5b7294057e21e230fdb349076e556c1e", 
      "bucket": "1518ca6f-b3c2-43cc-b736-07ee05bc1f43", 
      "key": "quanteda/quanteda-v1.5.0.zip", 
      "type": "zip", 
      "size": 37034944
    }
  ], 
  "owners": [
    25430
  ], 
  "doi": "10.5281/zenodo.3268686", 
  "stats": {
    "version_unique_downloads": 46.0, 
    "unique_views": 9.0, 
    "views": 9.0, 
    "downloads": 1.0, 
    "unique_downloads": 1.0, 
    "version_unique_views": 577.0, 
    "volume": 37034944.0, 
    "version_downloads": 129.0, 
    "version_views": 621.0, 
    "version_volume": 3449611139.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.3268686", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.596731", 
    "bucket": "https://zenodo.org/api/files/1518ca6f-b3c2-43cc-b736-07ee05bc1f43", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.596731.svg", 
    "html": "https://zenodo.org/record/3268686", 
    "latest_html": "https://zenodo.org/record/3355387", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.3268686.svg", 
    "latest": "https://zenodo.org/api/records/3355387"
  }, 
  "conceptdoi": "10.5281/zenodo.596731", 
  "created": "2019-07-04T13:34:51.319007+00:00", 
  "updated": "2019-07-30T12:33:56.143661+00:00", 
  "conceptrecid": "596731", 
  "revision": 4, 
  "id": 3268686, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.3268686", 
    "description": "New features\n<ul>\n<li>Add <code>flatten</code> and <code>levels</code> arguments to <code>as.list.dictionary2()</code> to enable more flexible conversion of dictionary objects. (#1661)</li>\n<li>In <code>corpus_sample()</code>, the <code>size</code> now works with the <code>by</code> argument, to control the size of units sampled from each group.</li>\n<li>Improvements to <code>textstat_dist()</code> and <code>textstat_simil()</code>, see below.</li>\n<li>Long tokens are not discarded automatically in the call to <code>tokens()</code>. (#1713)</li>\n</ul>\nBehaviour changes\n<ul>\n<li><code>textstat_dist()</code> and <code>textstat_simil()</code> now return sparse symmetric matrix objects using classes from the <strong>Matrix</strong> package.  This replaces the former structure based on the <code>dist</code> class.  Computation of these classes is now also based on the fast implementation in the <strong>proxyC</strong> package.  When computing similarities, the new <code>min_simil</code> argument allows a user to ignore certain values below a specified similarity threshold.  A new coercion method <code>as.data.frame.textstat_simildist()</code> now exists for converting these returns into a data.frame of pairwise comparisons.  Existing methods such as <code>as.matrix()</code>, <code>as.dist()</code>, and <code>as.list()</code> work as they did before.</li>\n<li>We have removed the \"faith\", \"chi-squared\", and \"kullback\" methods from <code>textstat_dist()</code> and <code>textstat_simil()</code> because these were either not symmetric or not invariant to document or feature ordering. Finally, the <code>selection</code> argument has been deprecated in favour of a new <code>y</code> argument.  </li>\n<li><code>textstat_readability()</code> now defaults to <code>measure = \"Flesch\"</code> if no measure is supplied.  This makes it consistent with <code>textstat_lexdiv()</code> that also takes a default measure (\"TTR\") if none is supplied.  (#1715)</li>\n<li>The default values for <code>max_nchar</code> and <code>min_nchar</code> in <code>tokens_select()</code> are now NULL, meaning they are not applied if the user does not supply values.  Fixes #1713.</li>\n</ul>\nBug fixes and stability enhancements\n<ul>\n<li><code>kwic.corpus()</code> and <code>kwic.tokens()</code> behaviour now aligned, meaning that dictionaries are correctly faceted by key instead of by value. (#1684)</li>\n<li>Improved formatting of <code>tokens()</code> verbose output. (#1683)</li>\n<li>Subsetting and printing of subsetted kwic objects is more robust. (#1665)</li>\n<li>The \"Bormuth\" and \"DRP\" measures are now fixed for <code>textstat_readability()</code>. (#1701)</li>\n</ul>", 
    "license": {
      "id": "other-open"
    }, 
    "title": "quanteda/quanteda: CRAN v1.5.0", 
    "relations": {
      "version": [
        {
          "count": 22, 
          "index": 20, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "596731"
          }, 
          "is_last": false, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "3355387"
          }
        }
      ]
    }, 
    "version": "v1.5.0", 
    "publication_date": "2019-07-04", 
    "creators": [
      {
        "affiliation": "London School of Economics and Political Science", 
        "name": "Kenneth Benoit"
      }, 
      {
        "affiliation": "Waseda University", 
        "name": "Kohei Watanabe"
      }, 
      {
        "affiliation": "Tracr", 
        "name": "Haiyan Wang"
      }, 
      {
        "affiliation": "University College Dublin", 
        "name": "Paul Nulty"
      }, 
      {
        "affiliation": "Columbia University, London School of Economics", 
        "name": "Adam Obeng"
      }, 
      {
        "affiliation": "University of Zurich", 
        "name": "Stefan M\u00fcller"
      }, 
      {
        "affiliation": "London School of Economics", 
        "name": "Jiong Wei Lua"
      }, 
      {
        "affiliation": "Institute for Analytics and Data Science, University of Essex", 
        "name": "Aki Matsuo"
      }, 
      {
        "affiliation": "London School of Economics and Political Science", 
        "name": "Christian Mueller"
      }, 
      {
        "affiliation": "Princeton University", 
        "name": "Will Lowe"
      }, 
      {
        "affiliation": "University of Southern California", 
        "name": "Pablo Barber\u00e1"
      }, 
      {
        "affiliation": "Campus Labs", 
        "name": "Tyler Rinker"
      }, 
      {
        "affiliation": "@ATFutures", 
        "name": "mark padgham"
      }, 
      {
        "affiliation": "@zalando", 
        "name": "Christopher Gandrud"
      }, 
      {
        "name": "Jos\u00e9 Tom\u00e1s Atria"
      }, 
      {
        "affiliation": "London School of Economics and Political Science", 
        "name": "Tom Paskhalis"
      }, 
      {
        "name": "nicmer"
      }, 
      {
        "name": "lindbrook"
      }, 
      {
        "name": "hofaichan"
      }, 
      {
        "name": "etienne-s"
      }, 
      {
        "name": "hotzeplotz"
      }, 
      {
        "name": "Thomas J. Leeper"
      }, 
      {
        "affiliation": "Soil Cryology Lab", 
        "name": "Stas Malavin"
      }, 
      {
        "affiliation": "@MUDSA", 
        "name": "Michael W. Kearney"
      }, 
      {
        "affiliation": "@myteksi", 
        "name": "Michael Chirico"
      }, 
      {
        "affiliation": "@TIBHannover", 
        "name": "Katrin Leinweber"
      }, 
      {
        "affiliation": "University of Glasgow", 
        "name": "Johannes Gruber"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "software", 
      "title": "Software"
    }, 
    "related_identifiers": [
      {
        "scheme": "url", 
        "identifier": "https://github.com/quanteda/quanteda/tree/v1.5.0", 
        "relation": "isSupplementTo"
      }, 
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.596731", 
        "relation": "isVersionOf"
      }
    ]
  }
}
621
129
views
downloads
All versions This version
Views 6219
Downloads 1291
Data volume 3.4 GB37.0 MB
Unique views 5779
Unique downloads 461

Share

Cite as