Published May 20, 2022
| Version 2.2.2
Software
Open
huggingface/datasets: 2.2.2
Authors/Creators
- Quentin Lhoest1
-
Albert Villanova del Moral1
- Patrick von Platen1
- Thomas Wolf1
- Mario Šaško1
- Yacine Jernite1
- Abhishek Thakur1
- Lewis Tunstall1
- Suraj Patil1
- Mariama Drame1
- Julien Chaumond1
- Julien Plu1
- Joe Davison1
- Simon Brandeis1
- Victor Sanh1
- Teven Le Scao1
- Kevin Canwen Xu1
- Nicolas Patry1
- Steven Liu1
- Angelina McMillan-Major1
- Philipp Schmid1
- Sylvain Gugger1
- Nathan Raw1
- Sylvain Lesage1
- Anton Lozhkov1
- Matthew Carrigan1
- Théo Matussière1
- Leandro von Werra1
- Lysandre Debut1
- Stas Bekman1
- Clément Delangue1
- 1. Hugging Face
Description
Datasets fixes
- Fix: irc_disentangle - fix checksum and bug dataset by @albertvillanova in https://github.com/huggingface/datasets/pull/4377
- Fix: CC-Aligned - fix invalid url by @juntang-zhuang in https://github.com/huggingface/datasets/pull/4231
- Fix: multi_news - don't strip proceeding hyphen by @JohnGiorgi in https://github.com/huggingface/datasets/pull/4353
- Support lists of multi-dimensional numpy arrays by @albertvillanova in https://github.com/huggingface/datasets/pull/4194
- Check if dataset features match before push in
DatasetDict.push_to_hubby @mariosasko in https://github.com/huggingface/datasets/pull/4372 - Pin dill by @albertvillanova in https://github.com/huggingface/datasets/pull/4380
- dill 0.3.5 has some issues in
transformers- pinning the version to<0.3.5for now
- dill 0.3.5 has some issues in
- Adding eval metadata for ade v2 by @sashavor in https://github.com/huggingface/datasets/pull/4319
- Adding eval metadata for AG News by @sashavor in https://github.com/huggingface/datasets/pull/4329
- Adding eval metadata to Allociné dataset by @sashavor in https://github.com/huggingface/datasets/pull/4330
- Adding eval metadata to Amazon Polarity by @sashavor in https://github.com/huggingface/datasets/pull/4331
- Adding eval metadata for arabic speech corpus by @sashavor in https://github.com/huggingface/datasets/pull/4332
- Adding eval metadata for Banking 77 by @sashavor in https://github.com/huggingface/datasets/pull/4333
- Eval metadata Batch 4: Tweet Eval, Tweets Hate Speech Detection, VCTK, Weibo NER, Wisesight Sentiment, XSum, Yahoo Answers Topics, Yelp Polarity, Yelp Review Full by @sashavor in https://github.com/huggingface/datasets/pull/4338
- Eval metadata batch 3: Reddit, Rotten Tomatoes, SemEval 2010, Sentiment 140, SMS Spam, Snips, SQuAD, SQuAD v2, Timit ASR by @sashavor in https://github.com/huggingface/datasets/pull/4337
- Eval metadata batch 1: BillSum, CoNLL2003, CoNLLPP, CUAD, Emotion, GigaWord, GLUE, Hate Speech 18, Hate Speech by @sashavor in https://github.com/huggingface/datasets/pull/4335
- Eval metadata batch 2 : Health Fact, Jigsaw Toxicity, LIAR, LJ Speech, MSRA NER, Multi News, NCBI Disease, Poem Sentiment by @sashavor in https://github.com/huggingface/datasets/pull/4336
- Add API code examples for Builder classes by @stevhliu in https://github.com/huggingface/datasets/pull/4313
- Add redirect to dataset script in the repo structure page by @lhoestq in https://github.com/huggingface/datasets/pull/4369
- Fix failing CI on Windows for sari and wiki_split metrics by @albertvillanova in https://github.com/huggingface/datasets/pull/4342
- Fix never ending GH Action to build documentation by @albertvillanova in https://github.com/huggingface/datasets/pull/4345
- Fix warning in upload_file by @albertvillanova in https://github.com/huggingface/datasets/pull/4355
- Fix warning in push_to_hub by @albertvillanova in https://github.com/huggingface/datasets/pull/4357
- Remove config names as yaml keys by @lhoestq in https://github.com/huggingface/datasets/pull/4367
- Add missing language tags for udhr dataset by @albertvillanova in https://github.com/huggingface/datasets/pull/4371
- Remove links in docs to old dataset viewer by @mariosasko in https://github.com/huggingface/datasets/pull/4373
- @JohnGiorgi made their first contribution in https://github.com/huggingface/datasets/pull/4353
- @juntang-zhuang made their first contribution in https://github.com/huggingface/datasets/pull/4231
Full Changelog: https://github.com/huggingface/datasets/compare/2.2.1...2.2.2
Files
huggingface/datasets-2.2.2.zip
Files
(53.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:d72823a554f1d9a6b1cb2c9a5240d26f
|
53.5 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/huggingface/datasets/tree/2.2.2 (URL)