There is a newer version of this record available.

Software Open Access

Transformers: State-of-the-Art Natural Language Processing

Wolf, Thomas; Debut, Lysandre; Sanh, Victor; Chaumond, Julien; Delangue, Clement; Moi, Anthony; Cistac, Perric; Ma, Clara; Jernite, Yacine; Plu, Julien; Xu, Canwen; Le Scao, Teven; Gugger, Sylvain; Drame, Mariama; Lhoest, Quentin; Rush, Alexander M.

  • Fix gradient_checkpointing backward compatibility (#14408)
  • [Wav2Vec2] Make sure that gradient checkpointing is only run if needed (#14407)
  • Experimenting with adding proper get_config() and from_config() methods (#14361)
  • enhance rewrite state_dict missing _metadata (#14348)
  • Support for TF >= 2.7 (#14345)
  • improve rewrite state_dict missing _metadata (#14276)
  • Fix of issue #13327: Wrong weight initialization for TF t5 model (#14241)
If you use this software, please cite it using these metadata.
Files (12.3 MB)
Name Size
12.3 MB Download
All versions This version
Views 33,922149
Downloads 1,1946
Data volume 8.8 GB73.7 MB
Unique views 28,439119
Unique downloads 5886


Cite as