DEV Community

makotunes
makotunes

Posted on

[GPT-2] The problem feeling when trying to support multi-language with GPT-2

Online tool is published

Normal GPT-2 only supports English, but developed a tool that supports multiple languages.

https://mockers.io/generator

screencapture-mockers-io-generator-2020-01-18-22_40_58.png

Design

Actually it has a three-stage configuration,

  • Language identification and translation into English
  • GPT-2 main processing
  • Restore from English to the identified original language

It is a fake multilingual support.

The model is compatible with the recently published best performing 1558M, which seems to produce very natural sentences in English.
On the other hand, in Japanese, it is a very machine-translated sentence.

Problem

It will be more natural to support Japanese native as a model.

GPT-2 originally uses English-written news sites as learning data, so it is good at generating sentences that can be told there. GPT-2 basically gives a sentence first and then guesses the proper sentences to come after. For example, if you generate "President Trump" in Japanese, it will be a bit decent. Other Languages probably too.

On the other hand, giving words that are far from English-speaking cultures tends to be unnatural. In order to obtain natural Japanese, of course, we do not use translations, but it is also very important to learn Japanese-speaking sentences.

By the way, it is possible to generate sentences recursively from the generated sentences, but it is also a problem that in languages ​​other than English, the translation is repeated and it deteriorates rapidly.

But is there a need for multilingual native models?

Prospects for future AI generation

At present, sentence generation in large-scale unsupervised learning has become higher in terms of naturalness, but I feel that it is not good to adapt to something.
In my personal opinion, I have the following issues.

  • How to learn common sense and facts
  • Can I generate sentences with purpose?

If these parts are resolved, I think it will have business value.

Finally

GPT-2 related tools are highly acclaimed and under development, so if you are interested, please read this article.

https://dev.to/makotunes/by-using-gpt-2-which-is-too-dangerous-i-finetuned-a-model-with-president-trump-s-twitter-and-made-a-fake-trump-bot-2j0h

Top comments (0)