Beyond Alpha Zero, where is neural networks reinforcement learning now?

Sort:

drmrboss

Sep 20, 2019

When Alpha Zero scientific experience end in Chess Go and Shogi, many neural network training are going on many games like Starcraft, Dota and many other games.

This is one of achievement where neural networks can learn and evolve exactly the same like human.

https://youtu.be/kopoLzvh5jY

redghost101

Jun 17, 2020

redghost101

Jun 17, 2020

this is reinforcement learning, the ai isn’t actually learning, it just tries different concepts and gets rewarded if it is (In the hiders case) not seen.

redghost101

Jun 17, 2020

A neural network is much more complex

KovenFan

Jun 17, 2020

redghost101 wrote:

A neural network is much more complex

If you go through the paper the video is on, you'll see that they used deep reinforcement learning which basically makes use of both deep learning and reinforcement learning principles. So they actually did use a neural net.

KovenFan

Jun 17, 2020

redghost101 wrote:

this is reinforcement learning, the ai isn’t actually learning, it just tries different concepts and gets rewarded if it is (In the hiders case) not seen.

But that is learning though.

Pawned_064

Jun 17, 2020

The more advanced opponents it plays, the stronger it gets.

Therefore, AlphaZero can get uhhh.... unpredictable as the neural networks are dependent on the opponents it plays. If it plays a 500 rated player over and over again it learns from the match after playing. The computers logic - I need to just beat that particular human, not the world. AlphaZero

cant just play with weak opponents, else it becomes weak. The more it plays with itself..... The better it gets at drawing itself.

redghost101

Jun 17, 2020

Me no professional, but as far as I know. Reinforcement learning is when they try every move possible until they get the highest reward, then continue onto the next stage. Neural networks find how to get the next move, making them more efficient than reinforcement learning

Pawned_064

Jun 17, 2020

AI is a bit funny.

redghost101

Jun 17, 2020

#10

The thing is, neural networks don’t learn from playing 500s, they learn by playing themselves. One NN as black, one as white. Once the training phase finishes, they then challenge much higher rated people or a.i

redghost101

Jun 17, 2020

#11

It can take from 10 hours to 10 days to train, but once it does. It’s almost unbeatable

Luxferre

Jun 17, 2020

#12

Good point!

redghost101

Jun 17, 2020

#13

Ta very much

redghost101

Jun 17, 2020

#14

it is not learning, this is an algorithm. A neural network is not

KovenFan

Jun 17, 2020

#15

redghost101 wrote:

it is not learning, this is an algorithm. A neural network is not

redghost101

Jun 17, 2020

#16

Flip it

Pawned_064

Jun 17, 2020

#17

redghost101 wrote:

It can take from 10 hours to 10 days to train, but once it does. It’s almost unbeatable

According to common logic, playing by yourself does improve your ability. i.e if you play with others lower than your rank, YOUR rank depletes.

redghost101

Jun 17, 2020

#18

It plays itself, each time trying a new strategy. BOth sides try it, letting the. NN see it’s weaknesses and improve ovqer time

Pawned_064

Jun 17, 2020

#19

redghost101 wrote:

It plays itself, each time trying a new strategy. BOth sides try it, letting the. NN see it’s weaknesses and improve ovqer time

hmmmmmmmmmm....

Pawned_064

Jun 17, 2020

#20

that mods its previous strategy right?

Forums

Hot Topics

Unanswered

Most Recent

Are humans and ducks in peace?

Destructimetal 9 min ago

Meap!

RandomChessPlayer62 10 min ago

so I'm a duck now.

Destructimetal 15 min ago

create the most stupid chess pieces!

Destructimetal 20 min ago

Duck Invasion Warning.

Destructimetal 27 min ago

Creating my own opening?

poco-block 31 min ago

Ideas in the Najdorf Sicilian as Black

playchessordie19 35 min ago

कुकू एफएम से रिफंड कैसे प्राप्त करे

Narayankuma 37 min ago

TOP-5 most ACTIVE group/clubs at chess.com

Ximoon 47 min ago

I want know more about Nishapur chess

QWE098123ASD 58 min ago

Forum Legend

Following

New Comments

Locked Topic

Pinned Topic