DEV Community

Julien Simon
Julien Simon

Posted on • Originally published at julsimon.Medium on

Accelerate Transformer training with AWS Trainium

In this video, I show you how to accelerate Transformer training with AWS Trainium, a new custom chip designed by AWS.

First, I walk you through the setup of an Amazon EC2 trn1.32xlarge instance, equipped with 16 Trainium chips. Then, I run a natural language processing job where I adapt existing Transformer training code for Trainium, accelerating a BERT model to classify the Yelp restaurant review datatset. Finally, I run the job on 1, 8, and 32 Neuron cores.

Top comments (0)