Thanks for this very well written article! I am checking out the references at the end.
A minor point:
It started off with random moves and quickly became superhuman (with an ELO of about 4500) after only 3 days of training.
The number of days is probably not a good metric to judge the speed of training. It played around 5 million games against itself during those 3 days. So, it is an order of magnitude greater than even the most experienced human player.
That's a really good point. It's easy to overlook how much processing power is involved in training the network. I'm also really impressed by how DeepMind were able to break the problem down into tasks that could be massively distributed across processing units in parallel.
For further actions, you may consider blocking this person and/or reporting abuse
We're a place where coders share, stay up-to-date and grow their careers.
Thanks for this very well written article! I am checking out the references at the end.
A minor point:
The number of days is probably not a good metric to judge the speed of training. It played around 5 million games against itself during those 3 days. So, it is an order of magnitude greater than even the most experienced human player.
That's a really good point. It's easy to overlook how much processing power is involved in training the network. I'm also really impressed by how DeepMind were able to break the problem down into tasks that could be massively distributed across processing units in parallel.