CommandLinesWorkflows - kpeaton/altanalyze_wikitest GitHub Wiki

Workflow Command Line Options in AltAnalyze

Full analysis pipelines can be run in AltAnalyze from the command line (source code and binary AltAnalyze versions). Below are example command line examples and detailed descriptions of available flags. For more information and links see: http://code.google.com/p/altanalyze/wiki/CommandLineMode

Workflow Examples

Analyzing CEL files – Affymetrix 3’ array using default options and GO-Elite

python AltAnalyze.py --species Mm --arraytype "3'array" --celdir "C:/CELFiles" --groupdir "C:/CELFiles/groups.CancerCompendium.txt" --compdir "C:/CELFiles/comps.CancerCompendium.txt" --output "C:/CELFiles" --expname "CancerCompendium" --runGOElite yes --returnPathways all

Analyzing RNA-Seq (RNASeq) data – BED files using default options

python AltAnalyze.py --species Mm --platform RNASeq --bedDir "C:/BEDFiles" --groupdir "C:/BEDFiles/groups.CancerCompendium.txt" --compdir "C:/BEDFiles/comps.CancerCompendium.txt" --output "C:/BEDFiles" --expname "CancerCompendium" 

Analyzing CEL files – Exon 1.0 array using default options

python AltAnalyze.py --species Mm --arraytype exon --celdir "C:/CELFiles" --groupdir "C:/CELFiles/groups.CancerCompendium.txt" --compdir "C:/CELFiles/comps.CancerCompendium.txt" --output "C:/CELFiles" --expname "CancerCompendium"

Analyzing CEL files – Affymetrix 3’ array using default options and GO-Elite

python AltAnalyze.py --species Mm --arraytype "3'array" --celdir "C:/CELFiles" --groupdir "C:/CELFiles/groups.CancerCompendium.txt" --compdir "C:/CELFiles/comps.CancerCompendium.txt" --output "C:/CELFiles" --expname "CancerCompendium" --runGOElite yes --returnPathways all

Analyzing Filtered Expression file – RNA-Seq using custom options

python AltAnalyze.py --species Mm --platform RNASeq –-filterdir "C:/BEDFiles" --altpermutep 1 --altp 1 --altpermute yes --additionalAlgorithm none --altmethod linearregres --altscore 2 --removeIntronOnlyJunctions yes

Analyzing CEL files – Exon 1.0 array using custom options

python AltAnalyze.py --species Hs --arraytype exon --celdir "C:/CELFiles" --output "C:/CELFiles" --expname "CancerCompendium" --runGOElite no --dabgp 0.01 --rawexp 100 --avgallss yes --noxhyb yes --analyzeAllGroups "all groups" --GEcutoff 4 --probetype core --altp 0.001 --altmethod FIRMA --altscore 8 --exportnormexp yes --runMiDAS no --ASfilter yes --mirmethod "two or more" --calcNIp yes

Analyzing CEL files – HJAY array using custom options

python AltAnalyze.py --species Hs --arraytype junction --celdir "C:/CELFiles" --output "C:/CELFiles" --expname "CancerCompendium" --runGOElite no --dabgp 0.01 --rawexp 100 --avgallss yes --noxhyb yes --analyzeAllGroups "all groups" --GEcutoff 4 --probetype core --altp 0.001 –altmethod "linearregres" --altscore 8 --exportnormexp yes --runMiDAS no --ASfilter yes --mirmethod "two or more" --calcNIp yes --additionalAlgorithm FIRMA --additionalScore 8

Analyzing Expression file – Gene 1.0 array using default options, without GO-Elite

python AltAnalyze.py --species Mm --arraytype gene --expdir "C:/CELFiles/ExpressionInput/exp.CancerCompendium.txt" --groupdir "C:/CELFiles/groups.CancerCompendium.txt" --compdir "C:/CELFiles/comps.CancerCompendium.txt" --statdir "C:/CELFiles/ExpressionInput/stats.CancerCompendium.txt" --output "C:/CELFiles"

Analyzing Filtered Expression file – Exon 1.0 array using default options

python AltAnalyze.py --species Hs --arraytype exon --filterdir "C:/CELFiles/Filtered/Hs_Exon_prostate_vs_lung.p5_average.txt" --output "C:/CELFiles"

Annotate External Probe set results – Exon 1.0 array using default options

python AltAnalyze.py --species Rn --arraytype exon --annotatedir "C:/JETTA_Results/Hs_tumor_progression.txt" --output "C:/JETTA_Results" --runGOElite yes

Filter AltAnalyze results with predefined IDs using default options

python AltAnalyze.py --species Mm --arraytype gene --celdir "C:/CELFiles" --output "C:/CELFiles" --expname "CancerCompendium" --returnAll yes

Operating System Example Folder Locations

  • PC: "C:/CELFiles"

  • Mac OSX: "/root/user/admin/CELFiles

  • Linux: "/hd3/home/admin/CELFiles

When the analysis is finished, the results will be stored just as with the GUI (e.g., alternative exon results in AltResults and gene expression results in ExpressionOutput).

Primary Analysis Variables

No default value for these variables is given and must be supplied by the user if running an analysis. For example, if analyzing CEL files directly in AltAnalyze, you must include the flags --species --arraytype --celdir --expname and --output , with corresponding values. Likewise, when analyzing an existing expression file you must include the flags --species --arraytype --expdir and ---output. Most of the variable values are file or folder locations.

Universally Required Variables

--platform or --arraytype

Platform being analyzed (enter "3'array" if none of the below apply). Values include RNASeq, exon, gene, junction, AltMouse and “3'array”. This variable indicates the general array type correspond to the input CEL files or expression file. An example exon array is the Mouse Affymetrix Exon ST 1.0 array, an example gene array is the Mouse Affymetrix Gene ST 1.0 array and example 3’array is the Affymetrix Mouse 430 version 2.0 array. See the files ArrayFileInfo.txt and array.txt in AltAnalyze/Config/ for details. --species Species codes are provided for this variable (e.g., Hs, Mm, Rn). Additional species can be added through the graphic user interface. --output Required for all analyses. This designates the directory which results will be saved to.

Analysis Specific Required Variables

--expname

Required when analyzing CEL files. This provides a name for your dataset. This name must match any existing groups and comps files that already exist. The groups and comps file indicate which arrays correspond to which biological groups and which to compare. These files must exist in the designated output directory in the folder “ExpressionInput” with the names “groups.expname.txt” and “comps.expname.txt” where expname is the variable defined in this flag. Alternatively, the user can name their CEL files such that AltAnalyze can directly determine which group they are (e.g., wildtype-1.CEL, cancer-1.CEL, cancer-2.CEL). Go here for more info. --celdir --bedDir Required when analyzing CEL files. This provides the path of the CEL files to analyze. These must all be in a single folder. --expdir Required when analyzing a processed expression file. This provides the path of the expression file to analyze. --statdir Optional when analyzing a processed expression file. This provides the path of the DABG p-value file for the designated expression file to analyze (see –expdir). --filterdir Required when aalyzing an AltAnalyze filtered expression file. This provides the path of the AltAnalyze filtered expression file to analyze. --cdfdir Required when directly processing some CEL file types. This variable corresponds to the location of the Affymetrix CDF or PGF annotation file for the analyzed array. If you are analyzing an exon, gene, junction, AltMouse or 3’arrays, AltAnalyze has default internet locations for which to download these files automatically, otherwise, you must download the compressed CDF file from the Affymetrix website (support), decompress it (e.g., WinZip) and reference it’s location on your hard-drive using this flag. If you are unsure whether AltAnalyze can automatically download this file, you can try to exclude this variable and see if annotations are included in your gene expression results file. --csvdir Required when analyzing some expression files or CEL file types. This variable corresponds to the location of the Affymetrix CSV annotation file for the analyzed array. If you are analyzing an exon, gene, junction, AltMouse or 3’arrays, AltAnalyze has default internet locations for which to download these files automatically, otherwise, you must download the compressed CSV file from the Affymetrix website (support), decompress it (e.g., WinZip) and reference it’s location on your hard-drive using this flag. If you are unsure whether AltAnalyze can automatically download this file, you can try to exclude this variable and see if annotations are included in your gene expression results file. --annotatedir Required when annotating a list regulated probe sets produced outside of AltAnalyze. This variable corresponds to the location of the directory containing one or more probe set files. These files can be in the standard JETTA export format, or otherwise need to have probe set IDs in the first column. Optionally, these files can have an associated fold change and p-value (2nd and 3rd columns), which will be reported in the results file. --groupdir Location of an existing group file to be copied to the directory in which the expression file is located or will be saved to. --compdir Location of an existing comps file to be copied to the directory in which the expression file is located or will be saved to.

Optional Analysis Variables

These variables are set as to default values when not selected. The default values are provided in the configurations text file in the Config directory of AltAnalyze (default-.txt) and can be changed by editing in a spreadsheet program.

GO-Elite Analysis Variables

AltAnalyze can optionally subject differentially or alternatively expressed genes (AltAnalyze and user determined) to an over-representation analysis (ORA) along Gene Ontology (GO) and pathways (WikiPathways) using the program GO-Elite. GO-Elite is seamlessly integrated with AltAnalyze and thus can be run using default parameters either the graphic user interface or command line. To run GO-Elite using default parameters in command line mode, include the first flag below with the option yes. --runGOElite

Default is no. Used to indicate whether to run GO-Elite analysis following AltAnalyze. Indicating yes would prompt GO-Elite to run. --mod Default is Ensembl. Primary gene system for Gene Ontology (GO) and Pathway analysis to link Affymetrix probe sets and other output IDs to. Alternative values are EntrezGene. --elitepermut

Default is 2000. Number of permutation used by GO-Elite to calculate an over-representation p-value. When set to FisherExactTest, Fisher Exact test p-values will be calculated instead. --method

Default is z-score. Sorting method used by GO-Elite to compare and select the top score of related GO terms. Alternative values are “gene number” and combined --zscore Default is 1.96. Z-score threshold used following over-representation analysis (ORA) for reported top scoring GO terms and pathways. --elitepval Default is 0.05. Permutation p-value threshold used ORA analysis for reported top scoring GO terms and pathways. --dataToAnalyze Default is both. Indicates whether to perform ORA analysis on pathways, Gene Ontology terms or both. Alternative values are Pathways or Gene Ontology --num Default is 3. The minimum number of genes regulated in the input gene list for a GO term or pathway after ORA, required for GO-Elite reporting. --GEelitepval Default is 0.05. The minimum t-test p-value threshold for differentially expressed genes required for analysis by GO-Elite. --GEeliteptype Default is rawp. Indicates whether to run rawp or adjp (Benjamini-Hochberg) p-value for filtering. --GEelitefold Default is 2. The minimum fold change threshold for differentially expressed genes required for analysis by GO-Elite. Applied to any group comparisons designated by the user. --ORAstat Default is Fisher Exact Test. When the value is set to “Permute p-value”, a permutation p-value will be calculated using the default of provided number of permutations (--elitepermut). The adjusted p-value will be calculated from the selected type of ORAstat. --additional Default is None. When the value is set to one of a valid resource or “all”, GO-Elite will download and incorporate that resource along with the default downloaded (WikiPathways and Gene Ontology). Additional resources include: “miRNA Targets”, ”GOSlim”, ”Disease Ontology”, ”Phenotype Ontology”, ”KEGG”, “Latest WikiPathways”, ”PathwayCommons”, ”Transcription Factor Targets”, ”Domains” and ”BioMarkers” (include quotes). --denom Default is None. This is the folder location containing denominator IDs for corresponding input ID list(s). This variable is only supplied to AltAnalyze when independently using the GO-Elite function to analyze a directory of input IDs (--input) and a corresponding denominator ID list. --returnPathways Default is None. When set equal to “yes” or “all”, will return all WikiPathways as colored PNG or PDF files (by default both) based on the input ID file data and over-representation results. Default value is “None”. When equal to “top5”, GO-Elite will only produce the top 5 (or other user entered number – e.g., top10) ranking WikiPathways.

AltAnalyze Expression Filtering and Summarization

These variables are used to determine the format of the expression data being read into AltAnalyze, the output formats for the resulting gene expression data and filtering thresholds for expression values prior to alternative exon analysis. Since AltAnalyze can process both convention (3’array) as well as RNA-Seq data and splicing arrays (exon, gene, junction or AltMouse), different options are available based on the specific platform.

Universal Array Analysis Variables

--logexp

Default is log for arrays and non-log for RNA-Seq. This is the format of the input expression data. If analyzing CEL files in AltAnalyze or in running RMA or GCRMA from another application, the output format of the expression data is log 2 intensity values. If analyzing MAS5 expression data, this is non-log. --inclraw Default is yes. When the value of this variable is no, all columns that contain the expression intensities for individual arrays are excluded from the results file. The remaining columns are calculated statistics (groups and comparison) and annotations. --vendor Default is Affymetrix. This variable can be set to "Other ID" when analyzing data from proteomics, metabolomics or other data not explicitely listed in the vendor/data-type menu. When entering "Other ID", also include the --platform as the specific ID system (e.g., PubChem).

RNASeq, Exon, Gene, Junction or AltMouse Platform Specific Variables

--dabgp

Default is 0.05. This p-value corresponds to the detection above background (DABG) value reported in the “stats.” file from AltAnalyze, generated along with RMA expression values. A mean p-value for each probe set for each of the compared biological groups with a value less than this threshold will be excluded, both biological groups don’t meet this threshold for a non-constitutive probe set or if one biological group does not meet this threshold for constitutive probe sets. --rawexp Default is 0 for microarrays and 1 for RNASeq reads. More stringent examples for Affymetrix arrays are 70 and 3 for RNASeq (RPKM) . For Affymetrix arrays, this value is the non-log RMA average intensity threshold for a biological group required for inclusion of a probe set. The same rules as the --dabgp apply to this threshold accept that values below this threshold are excluded when the above rules are not met. --avgallss Default is no. For RNA-Seq analyses, default is yes. Indicating yes, will force AltAnalyze to use all exon aligning probe sets or RNA-Seq reads rather than only features that align to predicted constitutive exons for gene expression determination. This option applies to both the gene expression export file and to the alternative exon analyses. --runalt Default is yes. Designating no for this variable will instruct AltAnalyze to only run the gene expression analysis portion of the program, but not the alternative exon analysis portion. --groupStat Default is moderated t-test. To designate an alternate statistic, this variable can be set to one of the following options are paired t-test, Kolmogorov Smirnov, Mann Whitney U, Rank Sums (these variable should be placed in quotes).

AltAnalyze Alternative Exon Statistics, Filtering and Summarization

Universal Array Analysis Variables

--altmethod

Default is splicing-index (exon and gene) and ASPIRE (RNASeq, junction and AltMouse). For exon, gene and junction arrays, the option FIRMA is also available and for RNASeq, junction and AltMouse platforms the option linearregres is available. --altp Default is 0.05. This variable is the p-value threshold for reporting alternative exons. This variable applies to both the MiDAS and splicing-index p-values. --probetype Default is core (exon and gene) and all (junction and AltMouse). This is the class of probe sets to be examined by the alternative exon analysis. Other options include: extended and full (exon and gene) and “exons-only”, “junctions-only”, “combined-junctions” (RNASeq, junction and AltMouse). --altscore Default is 2 (splicing-index) and 0.2 (ASPIRE).This is the corresponding threshold for the default algorithms listed under {{{--altmethod. --GEcutoff Default is 3. This value is the non-log gene expression threshold applied to the change in gene expression (fold) between the two compared biological groups. If a fold change for a gene is greater than this threshold it is not reported among the results, since gene expression regulation may interfere with detection of alternative splicing. --analyzeAllGroups Default is pairwise. This variable indicates whether to only perform pairwise alternative exon analyses (between two groups) or to analyze all groups, without specifying specific comparisons. Other options are “all groups” and both. --altpermutep Default is 0.05. This is the permutation p-value threshold applied to AltMouse array analyses when generating permutation based alternative exon p-values. Alternative exon p-values can be applied to either ASPIRE or linregress analyses. --altpermute Default is yes. This option directs AltAnalyze to perform the alternative exon p-value analysis for the AltMouse array (see –altpermutep). --exportnormexp Default is no. This option directs AltAnalyze to export the normalized intensity expression values (feature expression/constitutive expression) for all analyzed features (probe sets or RNA-Seq reads) rather than perform the typical AltAnalyze analysis when its value is yes. For junction-sensitive platforms, rather than exporting the normalized intensities, the ratio of normalized intensities for the two reciprocal-junctions are exported (pNI1/pNI2). This step can be useful for analysis of exon array data outside of AltAnalyze and comparison of alternative exon profiles for many biological groups (e.g., expression clustering). --runMiDAS Default is yes. This option directs AltAnalyze to calculate and filter alternative exon results based on the MiDAS p-value calculated using the program Affymetrix Power Tools. --calcNIp Default is yes. This option directs AltAnalyze to filter alternative exon results based on the t-test p-value obtained by comparing either the normalized intensities for the array groups examined (e.g., control and experimental) (splicing-index) or a t-test p-value obtained by comparing the FIRMA scores for the arrays in the two compared groups. --mirmethod Default is one. This option directs AltAnalyze to return any microRNA binding site predictions (default) or those that are substantiated by multiple databases (two or more). --ASfilter Default is no. This option directs AltAnalyze to only analyze probe sets or RNA-Seq reads for alternative expression that have an alternative-splicing annotation (e.g., mutually-exclusive, trans-splicing, cassette-exon, alt-5’, alt-3’, intron-retention), when set equal to yes. --returnAll Default is no. When set to yes, returns all un-filtered alternative exon results by setting all associated filtering parameters to the lowest stringency values. This is equivalent to providing the following flags: --dabgp 1 --rawexp 1 --altp 1 --probetype full --altscore 1 --GEcutoff 10000 . Since this option will output all alternative exon scores for all Ensembl annotated RNA-Seq reads or probe sets, the results file will be exceptionally large (>500,000 lines), unless the user has saved previously run alternative exon results (e.g., MADS) to the directory “AltDatabase/filtering” in the AltAnalyze program directory, with a name that matches the analyzed comparison. For example, if the user has a list of 2,000 MADS regulated probe sets for cortex versus cerebellum, then the MADS results should be saved to “AltDatabase/filtering” with the name “Cortex_vs_Cerebellum.txt” and in AltAnalyze the CEL file groups should be named Cortex and Cerebellum and the comparison should be Cortex versus Cerebellum. When the filename for a file in the “filtering” directory is contained within the comparison filename (ignoring “.txt”), only these AltAnalyze IDs or probe sets will be selected when exporting the results. This analysis will produce a results file with all AltAnalyze statistics (default or custom) for just the selected features, independent of the value of each statistic. --additionalAlgorithm Default is splicing-index. For Affymetrix arrays, setting this flag equal to FIRMA changes the individual probe set analysis algorithm from splicing-index to FIRMA for junction arrays. This method is applied to RNA-Seq data and junction arrays following reciprocal-junction analysis (e.g., ASPIRE) in a second run. To exclude this feature, set variable equal to none. --additionalScore Default is 2. Setting this flag equal to another numeric value (range 1 to infinity) changes the non-log fold change for the additional_algorithms. --removeIntronOnlyJunctions Default is no. Indicates whether to remove junctions where both splice sites align to outside of annotated exons. Setting this value to yes will remove these putative junctions prior to analysis. --normCounts Default is RPKM. Indicates whether to normalize exon and/or junctions counts using the methods RPKM. Setting this value to none will use the original counts for gene expression and alternative exon analyses. Setting this variable to none, will directly analyze counts with no normalization. --buildExonExportFile Default is no. Indicates whether to halt an RNA-Seq analysis after reading in the junction.bed files and export a new file with the suffix exon.bed in the folder BAMtoBED. This file is used as described here to obtain exon.bed files for all experimental BAM files. Setting this value to yes will use the original counts for gene expression and alternative exon analyses. --groupStat Default is “moderated t-test”. Indicates the algorithm to employ for all pairwise group comparisons in AltAnalyze. Other options include: “paired t-test”, “Kolmogorov Smirnov”, “Mann Whitney U” and “Rank Sums”. --exonRPKM Default is 0.5. Numerical filter used to threshold exons as expressed when using RPKM normalization for mean RPKM values for each biological group. At least one biological group for a pairwise comparison must meet this threshold to be included in further analyses. Increasing values of this filter will increase the stringency of exon/gene expression. --geneRPKM Default is 1. Numerical filter used to threshold exons as expressed when using RPKM normalization for mean RPKM values for each biological group. At least one biological group for a pairwise comparison must meet this threshold to be included in further analyses. Increasing values of this filter will increase the stringency of exon/gene expression. --exonExp Default is 10. Numerical filter used to threshold genes or exons as expressed based on the mean read counts values for each biological group. At least one biological group for a pairwise comparison must meet this threshold to be included in further analyses. Increasing values of this filter will increase the stringency of exon/gene expression.

AltAnalyze Database Updates

Universal Array Analysis Variables

--update

Default is empty. Setting this flag equal to Official, without specifying a version, will download the most up-to-date database for that species. Other options here are used internally by AltAnalyze.org for building each new database. See the method “commandLineRun” in AltAnalyze.py for more details. --version Default is current. Setting this flag equal to a specific Ensembl version name (e.g. EnsMart49) supported by AltAnalyze will download that specific version for the selected species, while setting this to current will download the current version. --specificArray Default is none. Indicates a sub-type of a particular array platform (e.g., junction – HJAY or hGlue) when building the database. This variable only needs to be set when currently building the hGlue junction database (see BuildingDatabases). --ignoreBuiltSpecies Default is none. Indicating yes will only build species databases for species without already built species directories. This is used during internal database release building to simultaneously build multiple species databases.

Additional Analysis, Quality Control and Visualization Options

In addition to the core AltAnalyze workflows (e.g., normalization, gene expression summarization, evaluation of alternative splicing), additional options are available to evaluate the quality of the input data (quality control or QC), evaluate associated cell types and tissues present in each biological sample (Lineage Profiler), cluster samples or genes based on overall similarity (expression clustering) and view regulation data on biological pathways (WikiPathways). These options can be run as apart of the above workflows or often independently using existing AltAnalyze results or input from other programs.

--outputQCPlots

Default is yes. Instructs AltAnalyze to calculate various QC measures specific to the data type analyzed (e.g., exon array, RNASeq) when a core workflow is run. This will include hierarchical clustering and PCA plots for genes considered to be differentially expressed (see --GEelitefold and --GEelitepval). Outputs various QC output plots (PNG and PDF) to the folder “DataPlots” in the user defined output directory. If run from python source code, requires installation of Scipy, Numpy and Matplotlib. --runLineageProfiler Default is yes. Instructs AltAnalyze to calculate Pearson correlation coefficients for each analyzed user sample relative to all cell types and tissues in the BioMarker database. Resulting z-scores for calculated from the coefficients are automatically visualized on a WikiPathways Lineage network and are hierarchically clustered. Outputs various tables to the folder ExpressionOutput and plots (PNG and PDF) to the folder “DataPlots” in the user defined output directory. Can be run as apart of an existing workflow or independently with the option --input. If run from python source code, requires installation of suds, Scipy, Numpy and Matplotlib. --input Default is None. Including this option indicates that the user is referencing an expression file location that is supplied outside of a normal AltAnalyze workflow. Example analyses include only performing hierarchical clustering, PCA or GO-Elite. --image Default is None. Including this option with any of the variables: WikiPathways, hierarchical, PCA, along with an input expression file location (--input), will prompt creation and export of the associated visualization files. These analyses are outside of the typical AltAnalyze workflows, only requiring the designated input file. If run from python source code, requires installation of suds, Scipy, Numpy and Matplotlib.

Hierarchical Clustering Variables

--row_method

Default is average. Indicates the cluster metric to be applied to rows. Other options include: average, single, complete, weighted and None. None will result in no row clustering. --column_method Default is single. Indicates the cluster metric to be applied to columns. These options are the same as row_method. --row_metric Default is “row_metric”, cosine. Indicates the cluster distance metric to be applied to rows. Other options include: braycurtis, canberra, chebyshev, cityblock, correlation, cosine, dice, euclidean, hamming, jaccard, kulsinski, mahalanobis, matching, minkowski, rogerstanimoto, russellrao, seuclidean, sokalmichener, sokalsneath, sqeuclidean and yule (not all may work). If the input metric fails during the analysis (unknown issue with Numpy), euclidean will be used instead. --column_metric Default is euclidean. Indicates the cluster distance metric to be applied to rows. These options are the same as row_metric. --color_gradient Default is red_white_blue. Indicates the color gradient to be used for visualization as up-null-down. Other options include: red_black_sky, red_black_blue, red_black_green, yellow_black_blue, green_white_purple, coolwarm and seismic. --transpose Default is False. Will transpose the matrix of columns and rows prior to clustering, when set to True.

--compendiumPlatform --compendiumType --exonMapFile --geneExp --labels --contrast --plotType --runMarkerFinder --update_interactions --includeExpIDs --degrees --genes --inputType --interactionDirs --GeneSetSelection --PathwaySelection --OntologyID --dataType --combat --channelToExtract --showIntrons --display --join --uniqueOnly --accessoryAnalysis --inputIDType --outputIDType --FEdir --channelToExtract --AltResultsDir --geneFileDir --modelSize --geneModel --reference

python AltAnalyze.py --expdir /MyExperiment/ExpressionInput/exp.tumors.txt --update markers --species Hs --platform RNASeq --genesToReport 200