evaluate_family_classifier.py - cmikke97/Automatic-Malware-Signature-Generation GitHub Wiki

In this page

Imported Modules


  • import baker - easy, powerful access to Python functions from the command line - baker documentation
  • import mlflow - open source platform for managing the end-to-end machine learning lifecycle - mlflow documentation
  • import numpy as np - the fundamental package for scientific computing with Python - numpy documentation
  • import pandas as pd - pandas is a flexible and easy to use open source data analysis and manipulation tool - pandas documentation
  • import psutil - used for retrieving information on running processes and system utilization - psutil documentation
  • import torch - tensor library like NumPy, with strong GPU support - pytorch documentation
  • from logzero import logger - robust and effective logging for Python - logzero documentation

  • from nets.Family_Classifier_net import Net as Family_Net
  • from nets.generators.fresh_generators import get_generator

Back to top

Classes and functions

evaluate_network(fresh_ds_path, checkpoint_path, training_run, train_split_proportion, valid_split_proportion, test_split_proportion, batch_size, random_seed, workers) (function, baker command) - Evaluate the model on the family prediction task.

  • fresh_ds_path (arg) - Path of the directory where to find the fresh dataset (containing .dat files)
  • checkpoint_path (arg) - Path to the model checkpoint to load
  • training_run (arg) - Training run identifier (default: 0)
  • train_split_proportion (arg) - Train subsplit proportion value (default: 7)
  • valid_split_proportion (arg) - Validation subsplit proportion value (default: 1)
  • test_split_proportion (arg) - Test subsplit proportion value (default: 2)
  • batch_size (arg) - How many samples per batch to load (default: 250)
  • random_seed (arg) - If provided, seed random number generation with this value (default: None, no seeding)
  • workers (arg) - How many worker (threads) the dataloader uses (default: 0 -> use multiprocessing.cpu_count())

__main__ (main) - Start baker in order to make it possible to run the script and use function names and parameters as the command line interface, using optparse-style options


Back to top

⚠️ **GitHub.com Fallback** ⚠️