Data Samples Preparation - PanDAWMS/InVEx GitHub Wiki
Each data sample uploaded on server and stored in RAM on server-side must be preprocessed and prepared to be transferred to clients.
Data Preparation is implemented in /core/form_reactions.py
.
Procedures csv_file_from_server
and new_csv_file_upload
provide the following actions:
- open data file
- read file
- clean data file from NaNs
- split data file into numeric data (numeric_dataset) and auxiliary data (auxiliary_dataset). Numeric data will be used in clusterization and auxiliary (not numeric) data - will be stored separately and used for data categorization.
- numeric data is normalized - norm_dataset
- calculated statistics of the normalized dataset
- all these datasets are saved in history file with the unique ID