Objectively Speaking, Is Magnus a Patzer Compared to StockFish and AlphaZero? - Chess Forums - Page 5

SeniorPatzer

Dec 9, 2017

0

#81

SmyslovFan wrote:

World class GMs, professionals who use Stockfish every day, are absolutely in awe of the depth and beauty of Alpha Zero's games. The computer didn't just destroy Stockfish, it did it in style, rewriting some chess theory in the process!

Some of those games were spectacular!

SmyslovFan,

What do you think this portends for the near future of competitive OTB chess by humans, and the interest in competitive OTB chess by humans?

Increase, same, or decrease?

HobbyPIayer

Dec 9, 2017

0

#82

Lyudmil_Tsvetkov wrote:

I will stop arguing here, because it is meaningless.

In order to operate, each and every program should have its code base, do you understand that?

You want to tell me that, some Alpha just arrived from somewhere, installed itself on the TPU and started improving its play?

There is code guiding its actions, that is so obvious, code, written by humans.

Whether it fulfills its task on a single or multiple levels is fully irrelevant: it still does so following the instructions of the initial code base.

What do you think they are doing, when Alpha reaches an optimum and can not improve any more, they leave it to get things straight by self-learning?

Of course, they are changing the code base, trying to optimise it.

If it does not have instructions that winning is good, how could it evaluate then if a position is good or bad? Of course, it knows winning is good and that is WRITTEN in the primary code by a human.

You think it does not have instructions to learn where the pieces land? Of course it does. If it can not make distinction between different board squares, how can it then optinise its algorithms? So, it checks the squares where the pieces have landed during the game and, depending on the result, increases or decreases their values. This is still done according to the instruction that winning is good and that psqt should be increased in case of a win. That second instruction has also been written by a human.

So that, it is humans who wrote the primary code base and are constantly changing/optimising it, while the computer just follows those instructions. Even the indication that after each games colours should be reversed is written by a human. Is not that so obvious?

So, basically, Alpha just follows instructions, both during play and self-training.

Obviously the rules of the game are inputted, and the objective (winning). Yes, it starts with that. But beyond that . . . ? The neural network figures out the rest.

And yes, obviously there's a lot of computing involved to create Alpha Zero. But it's also been directed to self-learn and create its own knowledge—which is the key breakthrough.

The purpose of AlphaGoZero is really to test the proficiency of self-learning AI. The future goal of DeepMind is to apply this technology to find advances in medicine that humans haven't been able to find on our own.

Chess is just a rules-based game that provides a nice opportunity to test the AI's ability to self-learn within a closed system.

Here's an interesting article about it: https://medium.com/intuitionmachine/the-strange-loop-in-alphago-zeros-self-play-6e3274fcdd9f

Elroch

Dec 9, 2017

0

#83

Lyudmil_Tsvetkov wrote:

I will stop arguing here, because it is meaningless.

It would indeed be a great idea if you were to go away and first of all learn something about the topic. Then your posts might even be correct as well as becoming meaningful.

In order to operate, each and every program should have its code base, do you understand that?

Most of the code of AlphaZero is common to all the two player finite deterministic games of perfect information it might learn to play. In each it is necessary to add code to represent a position in the game and generate a list of legal moves for a position. Some types of AI would require tuning of hyperparameters, but as the AlphaZero paper says "In AlphaZero
we reuse the same hyper-parameters for all games without game-specific tuning. The
sole exception is the noise that is added to the prior policy to ensure exploration; this is
scaled in proportion to the typical number of legal moves for that game type."

You want to tell me that, some Alpha just arrived from somewhere, installed itself on the TPU and started improving its play?

The algorithms that implement neural networks are general, with nothing specific to chess. The algorithms that implement reinforcement learning are general to all finite two player deterministic games of perfect information. These both had to be designed. The former are widely used, and the latter were developed using general techniques (with no relationship to chess) developed by Richard Sutton and others, specifically model-based Q-learning, I believe.

There is code guiding its actions, that is so obvious, code, written by humans.

No. Read the previous paragraph of mine.

Whether it fulfills its task on a single or multiple levels is fully irrelevant: it still does so following the instructions of the initial code base.

So, what you do is of no significance because you are merely following the instructions of your DNA?

What do you think they are doing, when Alpha reaches an optimum and can not improve any more, they leave it to get things straight by self-learning?

Of course, they are changing the code base, trying to optimise it.

Time and time again, you wildly guess. Try learning more about the subject instead.

No. That is not what happened. They ran the self-learning algorithms and it got really good at chess.

If it does not have instructions that winning is good, how could it evaluate then if a position is good or bad? Of course, it knows winning is good and that is WRITTEN in the primary code by a human. You think it does not have instructions to learn where the pieces land?

So you are saying now that any player who has been taught the rules and the objective of the game is getting an unfair advantage?

As it happens, it would be possible to write an AI to learn the rules and objectives of chess from examples, but this is not as challenging a task as to produce the strongest chess player, so they didn't bother with it.

Of course it does. If it can not make distinction between different board squares, how can it then optinise its algorithms? So, it checks the squares where the pieces have landed during the game and, depending on the result, increases or decreases their values. This is still done according to the instruction that winning is good and that psqt should be increased in case of a win. That second instruction has also been written by a human.

No. Only the representation of the position, the legal moves and the different ways a game can terminate, with their scores.

You probably need a course on reinforcement learning. One gentle introduction is this 10 lecture course by David Silver of the DeepMind team. Very enjoyable and informative:

https://www.youtube.com/playlist?list=PLzuuYNsE1EZAXYR4FJ75jcJseBmo4KQ9-

So that, it is humans who wrote the primary code base and are constantly changing/optimising it,

No. The computer went from pathetic beginner to the best chess player in the world by self-learning without the slightest interference from humans. They did tell it enough to be a pathetic beginner and how to learn.

while the computer just follows those instructions. Even the indication that after each games colours should be reversed is written by a human. Is not that so obvious?

So, basically, Alpha just follows instructions, both during play and self-training.

Just like you and your DNA.

SeniorPatzer

Dec 9, 2017

0

#84

Elroch: "The computer went from pathetic beginner to the best chess player in the world by self-learning without the slightest interference from humans."

BoggleMeBrains (from another thread): "I think all the quibbling over what restrictions were placed on Stockfish is beside the point. Forget Stockfish. The real story about AlphaZero isn't that it's better than Stockfish, it's that it represents the point at which machines truly surpassed humans at chess. Conventional chess engines need to have their entire evaluation function provided for them by humans. They may play better than humans, but are really nothing more than human knowledge running on fast hardware. AlphaZero surpassed human chess knowledge entirely by itself. It designed its own evaluation function. The achievement is already beyond impressive even if Stockfish running on a supercomputer with opening books and tablebases might have proved stronger. People fixating on whether the games were fair are kind of missing the true marvel of technology we've witnessed here. Something with no human-provided knowledge can out-Karpov Karpov."

What does this portend for the near future of competitive OTB chess by humans?

Deep Blue did not kill it. But will AlphaZero?

Elroch

Dec 9, 2017

0

#85

IMO, tablebases would increase the strength of AlphaZero, but you would need a large database of computer chess games to make an opening book that would have any chance of competing with what AlphaZero has learnt to play. The problem is that a hundred games by weaker players could be refuted by innovations in AlphaZero's on the fly analysis (which itself are influenced by its own experience in a large number of self-play games, many at a very high standard.

jminkler

Dec 9, 2017

0

#86

Pawn_Checkmate wrote:

My ears perked up because of the timing of going from reading a thread about it to now hearing it live on ChessTV. this is called baader meinhof phenomenon- a frequency illusion.. It's like when you think about buying a new car and all over sudden you see the same type of car all over the city.

Alpha zero was running on a super computer, while SF was on a computer that's worse than mine. If one SF was on super computer and the another SF on regular computer, it would be 100-0

It wasnt playing on a super computer lol. It was playing on what you can pick up at best buy looking at a measly 80k positions per second while SF was calculating 70+million .. Smh. People read nothing...

The engines of today are bound by their human handlers and rely on brute force alone to find moves rather than breaking out of their constraints and actually learning chess.

These are some of the dumbest comments i have seen on chess.com ever.

Elroch

Dec 9, 2017

0

#87

It used 4 of googles tensor processing units (TPUs) which might be equivalent to about 1000 Intel cores (say 14 of the latest 72-core Xeons) for the purpose of neural network computations.

It's fair to say this is a super computer, although not on the scale of the really big ones used for big science.

Lyudmil_Tsvetkov

Dec 9, 2017

0

#88

SmyslovFan wrote:

World class GMs, professionals who use Stockfish every day, are absolutely in awe of the depth and beauty of Alpha Zero's games. The computer didn't just destroy Stockfish, it did it in style, rewriting some chess theory in the process!

Some of those games were spectacular!

The games were spectacular, but Smyslov would not quite have thought this way.

Lyudmil_Tsvetkov

Dec 9, 2017

0

#89

Elroch wrote:

Lyudmil_Tsvetkov wrote:

I will stop arguing here, because it is meaningless.

It would indeed be a great idea if you were to go away and first of all learn something about the topic. Then your posts might even be correct as well as becoming meaningful.

In order to operate, each and every program should have its code base, do you understand that?

Most of the code of AlphaZero is common to all the two player finite deterministic games of perfect information it might learn to play. In each it is necessary to add code to represent a position in the game and generate a list of legal moves for a position. Some types of AI would require tuning of hyperparameters, but as the AlphaZero paper says "In AlphaZero
we reuse the same hyper-parameters for all games without game-specific tuning. The
sole exception is the noise that is added to the prior policy to ensure exploration; this is
scaled in proportion to the typical number of legal moves for that game type."