KonKIS 24 - Konferenz der deutschen KI-Servicezentren 2024 (Conference of the German AI Service Centers 2024)

Name: KonKIS 24 - Konferenz der deutschen KI-Servicezentren 2024 (Conference of the German AI Service Centers 2024)
Start: 2024-09-18T11:00:00+02:00
End: 2024-09-19T17:40:00+02:00
Location: Göttingen, Alte Mensa

18–19 Sept 2024

Göttingen, Alte Mensa

Europe/Berlin timezone

Home
Programm & Abstracts / Slides
Bilder / Pictures
Bericht / Report

DOSMo-7B: A Large Language Model Trained Exclusively on German text

18 Sept 2024, 16:50

1h 30m

Emmy-Noether-Saal

Postersession 🇬🇧 🇩🇪 Postersession

Maximilian Idahl (L3S)

We introduce DOSMo-7B, an open 7 billion parameter large language model (LLM) trained on 1T tokens of exclusively German text. DOSMo-7B uses the same architecture as Mistral-7B, paired with a custom tokenizer to maximize the encoding efficiency for German text. In contrast to existing approaches, which typically improve the German skills of LLMs with continued pretaining, we perform from scratch pretraining to explore the potential of training LLMs with only German text. In this technical report, we describe our approach to dataset creation, training, and evaluation of DOSMo-7B.

Maximilian Idahl (L3S)

poster.pdf

KonKIS 24 - Konferenz der deutschen KI-Servicezentren 2024 (Conference of the German AI Service Centers 2024)

DOSMo-7B: A Large Language Model Trained Exclusively on German text

Emmy-Noether-Saal

Speaker

Description

Author

Presentation materials

Choose timezone

KonKIS 24 - Konferenz der deutschen KI-Servicezentren 2024 (Conference of the German AI Service Centers 2024)

Speaker

Description

Author

Presentation materials