Published March 7, 2022
| Version v0.2.0
Software
Open
EleutherAI/lm-evaluation-harness: v0.2.0
Authors/Creators
- 1. Booz Allen Hamilton
- 2. @DivaHQ
- 3. PKU
- 4. Ivy Natal
Description
Major changes since 0.1.0:
- added blimp (#237)
- added qasper (#264)
- added asdiv (#244)
- added truthfulqa (#219)
- added gsm (#260)
- implemented description dict and deprecated provide_description (#226)
- new
--check_integrityflag to run integrity unit tests at eval time (#290) - positional arguments to
evaluateandsimple_evaluateare now deprecated _CITATIONattribute on task modules (#292)- lots of bug fixes and task fixes (always remember to report task versions for comparability!)
Files
EleutherAI/lm-evaluation-harness-v0.2.0.zip
Files
(731.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:903ef491489d5b3e98ac87af7ac3886d
|
731.0 kB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/EleutherAI/lm-evaluation-harness/tree/v0.2.0 (URL)