DEV Community

dinhanhx
dinhanhx

Posted on

Attention this, attention that

This post is for people who are working on or learning in deep learning with natural language processing. Minimum knowledge level: Hugging Face - Transformer or equivalent.

Who have read 3 followings papers are beneficial:

There are 3 style of attention mechanism should not be confused with namely:

Reading list recommendation:

NOTE: Please comment any attention mechanism not included in this post as well as paper, implementations.

Top comments (0)