Archived: OpenMS workflow parameters for Q Exactive - OpenMS/OpenMS GitHub Wiki
FeatureFinderMultiplex
charge = 1-7
isotopes_per_peptide = 3:6
RT typical
although described otherwise this seems to be an upper bound. At the moment (01/2018) 90s or higher semms to work fine for our Q-Exactive data, shorter RT typical leads to feature splitting of longer features, which is problematic for feature linking of unlabeled data
RT min = 3s
ìntensity cutoff = 1000
peptide similarity = 0.7
averagine similarity =0.6
averagine similarity scaling = 0.95
only relevant if knock-out option is selected (if lower, too many singlets will be reported)
mz_tolerance = 10 ppm
When knock_out
detection is switched on, never use a tolerance below the accuracy of the machine.
knock out = true
true for double/triple labelled data, false for label-free data
missed cleavages = 3
2 or 3 for dimethylation data, 0 for label free data
IDMapper
rt_tolerance = 20 sec
(about 1/2 of feature elution time)
MF: For labelled experiments it could be benefical to increase rt_tolerance, because the controided position in the consensusXML is used
mz_tolerance = 10 ppm
MF: For labelled experiments it could be benefical to increase mz_tolerance, because the controided position in the consensusXML is used
MFA: For TMT both tolerances could be set to 0, since ID´s and Intensities are in the same spectra.
mz_reference = peptides
MFA: Using precursor led to ID´s mapped on false features
see https://github.com/OpenMS/OpenMS/issues/2468
use_centroided_mz = true
MFA: Use true in combination with mz_reference = peptides, to avoid mapping ID´s on isotopic features (Labelfree data).
use_centroided_RT = false
For consensusXML (labelled experiments) mz and RT are automatically centroided, therefore both centroided options do not have any effect.
use subelements = true
use true for labelled data and false for label-free data
annotate ids with subelement = true
use true for labelled data (when MultiplexResolver will be used) and false for label-free data
XTandemAdapter
xtandem_executable: C:\tandem-for-OpenMS\Alanine
missed_cleavages = 0
(if samples digested separately)
precursor_mass_tolerance = 10 ppm
fragment_mass_tolerance = 20 ppm
no_isotope_error = true
(allow-isotope_error: no in older versions)
MSGFplusAdapter
precursor_mass_tolerance = 10 ppm
isotope error range 0:0
when HighRes Precursor Mass Correector was used before search engine (already finds the correct position of the monoisotopic peak)
instrument = Q_exactive
add_features = TRUE
might be a good option as this is needed for downstream analysis with PercolatorAdapter
. Otherwise the search has to be done again
max_mods
for labelled data (e.g. dimethylated) no less than 3. Take into consideration the more mods are considered the longer it will take to finish
Add Output
for mzid
files (see below)
Care should be taken with IDPosteriorErrorProbability
when using MSGF+ and in an ideal setting check for the fit. Alternatively, PeptideProphet/iProphet (from the TPP) can be used (checking as well the fit) and then converting the pep.xml file to .idxml
with IDFileConverter
and filtering for an specific FDR with IDFilter
using the PeptideProphet/iProphet minimum probability needed to achieve the maximum desired FDR. (The complete ROC values are written at the beginning of the pep.xml file)
PeptideIndexer
decoy_string = dec_
we agreed to generate all decoy databases with prefix: dec_
write_protein_sequence = TRUE
necessary to obtain sequence coverage (e.g. as a column in mzTab exporter and text exporter). All protein sequences are stored as well, what increases file size and computation times. Peptide indexer can also be run again after protein inference to obtain the sequence coverage (MF)
write_protein_description = TRUE
to obtain the protein name (e.g. as a column in mzTab exporter and text exporter)(MF)
FIDO adapter
greedy group resolution = TRUE
group level= TRUE
Accuracy = STRICT (or left empty)
For big data sets searched with MSGF+ the time required to complete might be extremely long
FalseDiscoveryRate
q-value = true
in IDFilter afterwards select: score pep = 0.01
and score prot = 0.01
for publication