run_model - PNHP/Regional_SDM GitHub Wiki
The script that runs the random forest model would have these components
- bring in the presence points and the background points
- create an initial model to look at variable importance. Dump the least important variables (lowest 25% right now).
- checking for the number of groups (polygons) and making accuracy assessment decisions based on this (stratify by how many groups?)
- tune mtry
- runs a series of models (jack-knife routine), saves out the results. This is external validation.
- completes the final model using all presence points.
- calculates cutoff information, using one or more approaches, saves out the results.
- calculates partial plot information for metadata.
- saves the final model for possible retrieval later.