Home - SPRACE/track-ml GitHub Wiki
Welcome to the track-ml wiki!
Dataset:
Original Kaggle Dataset:
- /data/trackMLDB/analysis/train_1
- /data/trackMLDB/analysis/train_2
- /data/trackMLDB/analysis/train_3
- /data/trackMLDB/analysis/train_4
- /data/trackMLDB/analysis/train_5
All files of each directory were joined (30 GB each file):
- /data/trackMLDB/analysis/train_1_real
- /data/trackMLDB/analysis/train_2_real
- /data/trackMLDB/analysis/train_3_real
- /data/trackMLDB/analysis/train_4_real
- /data/trackMLDB/analysis/train_5_real
The following script perform adjustments on header:
cat train_1/*.csv > train_1_real
sed '/170,171,172,173/d' train_1_real > train_1_realv2
head -n 1 train_1_real > head
cat head train_1_realv2 > train_1_realv3
All the files with all tracks of all events of a single directory:
- /data/trackMLDB/analysis/train_1_realv3
- /data/trackMLDB/analysis/train_2_realv3
- /data/trackMLDB/analysis/train_3_realv3
- /data/trackMLDB/analysis/train_4_realv3
- /data/trackMLDB/analysis/train_5_realv3