Data Mining | Bestiary - lot9s/pathfinder-rpg-utils GitHub Wiki

The scripts in this folder produce a database of creatures from the Pathfinder RPG by scraping the Bestiary pages of http://www.d20pfsrd.com

Others have made creature databases for the Pathfinder RPG available for download (see http://www.pathfindercommunity.net/). Hopefully, the databases produced using these scripts will be the easier to mine for insights than previously published ones.

NOTE:

  • A creature entry with a value of -1 in an Ability Score field should be interpreted as said Ability Score value not existing for that creature.

Downloads

The easiest way to get a hold of the database is to download it in your favorite format from one of these links:

Usage

If you would like to generate the database yourself, make sure you have the following python packages installed on your system:

  • lxml

Then, simply execute the following on the command line from within the data-mining/bestiary folder of the repository:

python indexer.py
python crawler.py

As of the time of this writing, this can take about 30 minutes to complete.

NOTE:

  • Providing -h as a command line argument to crawler.py prints out all of the script's optional functionality to the terminal. (e.g. storing CR values as strings instead of real numbers)
Disclaimer

If you would like to run the scripts, there are a few things to keep in mind.

  • This project has only been tested on OS X Mavericks. Thus, if you are using a different operating system, the code may produce strange behavior.
  • This project was written using Python 2.7.11. Thus, if you are using a different version of Python to run the scripts, it may result in strange behavior.

Finally, it is important to remember that this code is made available as-is. No member of this project will be held liable for any damages that may result in the use of this software.