Published November 12, 2023 | Version v1
Conference proceeding Open

OpenGPT-X: Novel Architecture Exploration

  • 1. ROR icon Forschungszentrum Jülich

Contributors

Project member:

  • 1. ROR icon Forschungszentrum Jülich

Description

The OpenGPT-X project is a German initiative with ten collaborators to build, train, and deploy a multilingual open- source language model. Models trained within the project will be used for pilot cases by industry partners and commercialized through the Gaia-X Federation. Due to the substantial memory and compute resources required for efficiently training large language models, high-performance computing systems such as JUWELS Booster1 are essential. This paper presents the results of the exploration of novel hardware architecture conducted within the scope of the project.

Files

ChelseaMariaJohn+AndreasHerten.pdf

Files (239.5 kB)

Name Size Download all
md5:51d948e2cf4a7f6448ec72fe83c3662b
239.5 kB Preview Download