I will stop arguing here, because it is meaningless.
In order to operate, each and every program should have its code base, do you understand that?
You want to tell me that, some Alpha just arrived from somewhere, installed itself on the TPU and started improving its play?
There is code guiding its actions, that is so obvious, code, written by humans.
Whether it fulfills its task on a single or multiple levels is fully irrelevant: it still does so following the instructions of the initial code base.
What do you think they are doing, when Alpha reaches an optimum and can not improve any more, they leave it to get things straight by self-learning?
Of course, they are changing the code base, trying to optimise it.
If it does not have instructions that winning is good, how could it evaluate then if a position is good or bad? Of course, it knows winning is good and that is WRITTEN in the primary code by a human.
You think it does not have instructions to learn where the pieces land? Of course it does. If it can not make distinction between different board squares, how can it then optinise its algorithms? So, it checks the squares where the pieces have landed during the game and, depending on the result, increases or decreases their values. This is still done according to the instruction that winning is good and that psqt should be increased in case of a win. That second instruction has also been written by a human.
So that, it is humans who wrote the primary code base and are constantly changing/optimising it, while the computer just follows those instructions. Even the indication that after each games colours should be reversed is written by a human. Is not that so obvious?
So, basically, Alpha just follows instructions, both during play and self-training.
Obviously the rules of the game are inputted, and the objective (winning). Yes, it starts with that. But beyond that . . . ? The neural network figures out the rest.
And yes, obviously there's a lot of computing involved to create Alpha Zero. But it's also been directed to self-learn and create its own knowledge—which is the key breakthrough.
The purpose of AlphaGoZero is really to test the proficiency of self-learning AI. The future goal of DeepMind is to apply this technology to find advances in medicine that humans haven't been able to find on our own.
Chess is just a rules-based game that provides a nice opportunity to test the AI's ability to self-learn within a closed system.
Here's an interesting article about it: https://medium.com/intuitionmachine/the-strange-loop-in-alphago-zeros-self-play-6e3274fcdd9f
World class GMs, professionals who use Stockfish every day, are absolutely in awe of the depth and beauty of Alpha Zero's games. The computer didn't just destroy Stockfish, it did it in style, rewriting some chess theory in the process!
Some of those games were spectacular!
SmyslovFan,
What do you think this portends for the near future of competitive OTB chess by humans, and the interest in competitive OTB chess by humans?
Increase, same, or decrease?