Trained Models from "General Cross-Architecture Distillation of Pretrained Language Models into Matrix Embeddings"

Galke, Lukas; Cuber, Isabell; Meyer, Christoph; Noelscher, Henrik Ferdinand; Sonderecker, Angelina; Scherp, Ansgar

doi:10.5281/zenodo.6533889

Published May 9, 2022 | Version 1

Dataset Open

Trained Models from "General Cross-Architecture Distillation of Pretrained Language Models into Matrix Embeddings"

1. Max Planck Institute for Psycholinguistics
2. University of Ulm

Trained models from the paper:

Lukas Galke, Isabell Cuber, Christoph Meyer, Henrik Ferdinand Noelscher, Angelina Sonderecker, and Ansgar Scherp: General Cross-Architecture Distillation of Pretrained Language Models into Matrix Embeddings, in: International Joint Conference on Neural Networks (IJCNN), 2022.

File seq2mat_hybrid_bidirectional_sbertlike-100p-bsz512 holds the model from pretraining
File ws2020_transformer_final_models holds the fine-tuned models for each task of the GLUE benchmark

Files

seq2mat_hybrid_bidirectional_sbertlike-100p-bsz512.zip

Files (26.3 GB)

Name	Size	Download all
seq2mat_hybrid_bidirectional_sbertlike-100p-bsz512.zip md5:0a543e81efafb7b688f15d613b76b0ea	974.6 MB	Preview Download
ws2020_transformer_final_models.tar.gz md5:e3b8125b6f3dde41790a032c7f09d5ca	25.3 GB	Download

140

Views

Downloads

Show more details

	All versions	This version
Views	140	140
Downloads	23	23
Data volume	314.3 GB	314.3 GB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: May 11, 2022
Modified: May 11, 2022

Trained Models from "General Cross-Architecture Distillation of Pretrained Language Models into Matrix Embeddings"

Creators

Description

Files

seq2mat_hybrid_bidirectional_sbertlike-100p-bsz512.zip

Files (26.3 GB)