Closure

#ai

After a pause, this series comes to a conclusion, mostly because of the rapid developments in the area of large language models.

Original intention

At the beginning I intended to create a language model, that would have gotten a prompt "Geschirrabwaschgesetz" (a law about washing dishes) and write me a corresponding law text in German.

I was discouraged from training the original char RNN because of the scary amount of training time with a 110 M training data. Therefore I went with fine-tuning a German GPT-2 (and later the better one; thanks Jo!). The fine-tuning process of such a model is described here or here, for example.

(Un-)expected discovery

I happened to discover that my intended case is covered perfectly by the LLAMA 2 Chat German model (almost, because of a few grammatical errors). This is very likely because of being fine-tuned with the German legal SQuAD dataset, among others.

I do not want to withhold the result from you (produced in LM Studio):

Just look at this beauty! It even defined "Hygiene" in the last subparagraph! And hence this series is concluded.

DEV Community

Closure

Original intention

(Un-)expected discovery

Top comments (0)

Read next

ByteDance just announced X-Portrait 2! This AI brings photos to life with realistic emotions and expressions from a single video

How to Use PydanticAI for Structured Outputs with Multimodal LLMs

Qwen2.5-Coder-32B-Instruct vs. Claude 3.5 Sonnet vs. GPT-4o: Coding LLM Comparison

🚀 Introducing Chromate: Build AI Agents Easily with Ruby and the Chrome DevTools Protocol (CDP)