How to properly collect useful Pull Rate data - Pepper0ni/TTS-PTCG-Pack-Simulator GitHub Wiki

If you want to provide new data that either confirms or debunks existing pull rate data, here is how you should collect it. Be warned that doing this will probably mean watching hours of pack openings.

First, choose a set to get data for, then the existing spreadsheet in the repo for any existing hard, sourced, data for that set. If it exists you should extend that instead of making a new sheet, being careful not to duplicate data.

Then start a spreadsheet which lists every slot of interest in that set. This will usually be the rare slot, the reverse slot if there's any non-standard reverses, and maybe a random basic energy slot. Then go to youtube or whatever other video service you think has useful pack openings, and search for as many pack openings of that set as you can, being careful to avoid cases where the pack is weighed/heavy/light. Openings of Entire boxes are preferred to remove any possibility of scaled packs affecting the data.

On the spreadsheet mark down the exact card of every slot of interest. Even if we are relatively sure that certain cards are in the same rarity bracket, there have been several incidents of apparently short printed cards such as generations Charizards. It also acts as easy validation of your data collection.

Once you get an appreciable sample size, you can either edit the spreadsheet on a fork of the repo and push the update as a pull request, or simply submit the sheet directly to me for processing. Note that I will need a certain minimum sample size to displace old data. This depends on the slots involved but a minimum of 2 boxes is a good guideline for most rarities, while more modern ultra rares may need a case or 2.

If you are ever unsure, use the subsheets of the data spreadsheet as a guide.