chess pgn database over 9 million games!

Sort:
GM_Alphazer0

I have put together a pgn database of 9,000,000+ games
for the pass 2 months!

Download 

use this week in chess

https://theweekinchess.com/twic 

to update the database

archives 920-1370 are in the database in already in the database.

use a pgn reader like scid

https://sourceforge.net/projects/scid/files/latest/download?source=files 

to use the database

helping me collect would be very nice

Thanks for reading.

TheMachinist87

Thx for this collection. 

locoturbo

Thanks, maybe I'll try to tap into it, if I ever bother making my chess engine. Curious though why ~956 bytes would be needed per game. If games averaged 60 moves this would be almost 16 bytes per move, seems like you could store each move in 3-4 bytes, depending...

TheMachinist87

@MrLoco76 You are right, the download is big. After imported PGN with Scid App on Android, the database is only 2,5 GB big. Anyway, the download is it worth. 

KevinOSh

Is is possible to put the file into a zip? It is 8.6 Gigabyte download

Wildfury

Awesome. 

Wildfury

Not sure if something is wrong. Downloaded, 9gb. only 688K games and just up to 1987

 

Ziryab

I have all of TWIC and ChessBase Mega.

Chess Informant remains vastly more useful, as does PowerBook. I want fewer, not more games. Larger databases means more junk. I don’t want my opening study guided by eight-year old sub-1200s, even if they may someday be world chaampions.

Ian_Rastall
Wildfury wrote:

Not sure if something is wrong. Downloaded, 9gb. only 688K games and just up to 1987

 

 

What did you use to open the PGN, and did you let the file finish downloading? I've grabbed this before. It's too large for anything other than ChessBase. If you don't have CB 16, just get CB Reader 2017, which you can find here: https://en.chessbase.com/post/chessbase-reader-2017

Open the PGN, then right-click and choose to convert to CBH. If you want to work with it in something else, try breaking it into groups of a million.

EscherehcsE

Yeah, I'd be concerned about how many junky games are in the database, and how many duplicates there might be. Unfortunately, the OP didn't say where these games came from.

I tried to open the pgn file with Scid vs PC, but it crashed after about 9 minutes. However, when I first converted the pgn file to scid format with the "pgnscid" utility, everything worked fine. However, there *were* 1,344 errors or warnings in the ".err" file. I didn't look at the ".err" file, but that sounds like too many warnings to me.

Ziryab

I frequently get thousands of errors in ChessBase. They are classification errors—such as endgame classification—that do not affect the integrity of the games. When I import large batches of games into a large database, these errors are common. It is one reason I never update MegaDatabase. Instead, I use TWIC for recent games. That database is still huge, but no annotations get lost if I have to start fresh. 

A small donation to Mark Crowther got a clean database and the promise of support if I needed it later.

AndreaMori

Tried a few times to download, but never completes.

OpeningMaster

Guys, remember quantity is not quality. As mentioned above you don't want sub 1200 kid teaching you how to do the opening or low rated machine, it will spoil your statistics. Try to grab the real and strong OTB chess database. This month Opening Master announced 50% Christmas discount

just sayin

Cheers

Alexander

OpeningMaster
GM_Alphazer0 wrote:

I have put together a pgn database of 9,000,000+ games
for the pass 2 months!

Download 

use this week in chess

https://theweekinchess.com/twic 

to update the database

archives 920-1370 are in the database in already in the database.

use a pgn reader like scid

https://sourceforge.net/projects/scid/files/latest/download?source=files 

to use the database

helping me collect would be very nice

 

Thanks for reading.

 

GM_Alphazero what was your source? You cannot just download somebody’s intellectual property right (database collection) and offer it free! The argument chess games are free is invalid. The chess games yes, but the collection such as 9 million games is sombodys 10 years of work. Be careful on lawsuits. 

Iťs the same like hey guys I just downloaded Chessbase 17, here is the file, please use it. 

I think you would have some German lawyers chasing you soon…

 

Chpok_org

9072714 games.

40409313 checks, 779745 promotions. 368935615 pairs of moves => 40.66 per game, on average.

369844 games were skipped, mostly because the script is very simple, and does not support embedded comments.

Cozmiccle

How do I open this?? File is too large for notepad++ or notepad

Ziryab
CozmoChess30 wrote:

How do I open this?? File is too large for notepad++ or notepad

You need database software.

ChessBase (a few Franklins)

SCID vs PC (free)

I think there are others.

I’ve been using ChessBase for twenty years. It’s a great tool for chess study.

emehri

The uploader should have compressed this file using 7z which reduces simple text files (including PGN) by 5 times.

emehri

Here I uploaded a PGN containg 442000 games of GMs. I regret that some of them are duplicated.

https://www.mediafire.com/file/gqx8b0xaixl4hwe/Games+Of+GMs.7z/file

Ziryab
emehri wrote:

Here I uploaded a PGN containg 442000 games of GMs. I regret that some of them are duplicated.

https://www.mediafire.com/file/gqx8b0xaixl4hwe/Games+Of+GMs.7z/file

Why bother when it is an easy matter to both get more games and to remove duplicates? One suspects a nefarious link.