evaluate_family_classifier.py - cmikke97/Automatic-Malware-Signature-Generation GitHub Wiki
-
import configparser- implements a basic configuration language for Python programs - configparser documentation -
import json- json encoder and decoder - json documentation -
import os- provides a portable way of using operating system dependent functionality - os documentation -
import sys- system-specific parameters and functions - sys documentation -
import tempfile- used to create temporary files and directories - tempfile documentation -
import time- provides various time-related functions - time documentation -
from copy import deepcopy- creates a new object and recursively copies the original object elements - copy documentation
-
import baker- easy, powerful access to Python functions from the command line - baker documentation -
import mlflow- open source platform for managing the end-to-end machine learning lifecycle - mlflow documentation -
import numpy as np- the fundamental package for scientific computing with Python - numpy documentation -
import pandas as pd- pandas is a flexible and easy to use open source data analysis and manipulation tool - pandas documentation -
import psutil- used for retrieving information on running processes and system utilization - psutil documentation -
import torch- tensor library like NumPy, with strong GPU support - pytorch documentation -
from logzero import logger- robust and effective logging for Python - logzero documentation
from nets.Family_Classifier_net import Net as Family_Netfrom nets.generators.fresh_generators import get_generator
evaluate_network(fresh_ds_path, checkpoint_path, training_run, train_split_proportion, valid_split_proportion, test_split_proportion, batch_size, random_seed, workers) (function, baker command) - Evaluate the model on the family prediction task.
-
fresh_ds_path(arg) - Path of the directory where to find the fresh dataset (containing .dat files) -
checkpoint_path(arg) - Path to the model checkpoint to load -
training_run(arg) - Training run identifier (default: 0) -
train_split_proportion(arg) - Train subsplit proportion value (default: 7) -
valid_split_proportion(arg) - Validation subsplit proportion value (default: 1) -
test_split_proportion(arg) - Test subsplit proportion value (default: 2) -
batch_size(arg) - How many samples per batch to load (default: 250) -
random_seed(arg) - If provided, seed random number generation with this value (default: None, no seeding) -
workers(arg) - How many worker (threads) the dataloader uses (default: 0 -> use multiprocessing.cpu_count())
__main__ (main) - Start baker in order to make it possible to run the script and use function names and parameters as the command line interface, using optparse-style options