Home - SPRACE/track-ml GitHub Wiki

Welcome to the track-ml wiki!

Dataset:

Original Kaggle Dataset:

  • /data/trackMLDB/analysis/train_1
  • /data/trackMLDB/analysis/train_2
  • /data/trackMLDB/analysis/train_3
  • /data/trackMLDB/analysis/train_4
  • /data/trackMLDB/analysis/train_5

All files of each directory were joined (30 GB each file):

  • /data/trackMLDB/analysis/train_1_real
  • /data/trackMLDB/analysis/train_2_real
  • /data/trackMLDB/analysis/train_3_real
  • /data/trackMLDB/analysis/train_4_real
  • /data/trackMLDB/analysis/train_5_real

The following script perform adjustments on header:

cat train_1/*.csv > train_1_real

sed '/170,171,172,173/d' train_1_real > train_1_realv2

head -n 1 train_1_real > head

cat head train_1_realv2 > train_1_realv3

All the files with all tracks of all events of a single directory:

  • /data/trackMLDB/analysis/train_1_realv3
  • /data/trackMLDB/analysis/train_2_realv3
  • /data/trackMLDB/analysis/train_3_realv3
  • /data/trackMLDB/analysis/train_4_realv3
  • /data/trackMLDB/analysis/train_5_realv3