Published November 12, 2023
| Version v1
Conference proceeding
Open
OpenGPT-X: Novel Architecture Exploration
Description
The OpenGPT-X project is a German initiative with ten collaborators to build, train, and deploy a multilingual open- source language model. Models trained within the project will be used for pilot cases by industry partners and commercialized through the Gaia-X Federation. Due to the substantial memory and compute resources required for efficiently training large language models, high-performance computing systems such as JUWELS Booster1 are essential. This paper presents the results of the exploration of novel hardware architecture conducted within the scope of the project.
Files
ChelseaMariaJohn+AndreasHerten.pdf
Files
(239.5 kB)
Name | Size | Download all |
---|---|---|
md5:51d948e2cf4a7f6448ec72fe83c3662b
|
239.5 kB | Preview Download |