Baseball Databank is a compilation of historical baseball data in a convenient, tidy format, distributed under Open Data terms. This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License. For details see: http://creativecommons.org/licenses/by-sa/3.0/ Organisation of the files ------------------------- There are two directories in the repository. * 'core' contains the databank itself. If you are a user of the data, these are the files you need. * 'upstream' contains files used to construct the databank. Most of the data in the Databank is provided by Chadwick Baseball Bureau (http://www.chadwick-bureau.com). The data differ from the data the Bureau provides to its clients in that it contains less detail, is updated less frequently, and is provided on an as-is basis. Other sources ------------- The Databank is historically based in part on the Lahman Baseball Database, version 2015-01-24, which is Copyright (C) 1996-2015 by Sean Lahman. The tables Parks.csv and HomeGames.csv are based on the game logs and park code table published by Retrosheet. This information is available free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at http://www.retrosheet.org. Queries and suggested revisions ------------------------------- Queries and suggested revisions to the data can be posted in the issue tracker at https://github.com/chadwickbureau/baseballdatabank/issues. Files in 'core' are all generated by scripts. As such they are not edited manually (and therefore pull requests should not be submitted against these files). Files in 'upstream' are manually-maintained files which contain information specific to constructing the Databank. As they are maintained manually, it is valid to submit pull requests containing corrections or additions to these files. Data which does not originate from the 'upstream' files is data maintained by Chadwick Baseball Bureau. While enquiries regarding these data are welcomed, remember that these data are updated with some lag, and therefore may differ from data which appear elsewhere on the Internet or other sources.