Retreats Fall2010 - OpenMS/OpenMS GitHub Wiki

Retreat Kühlungsborn 09/2010

Detailed agenda

Monday

  • 11:00-15:30: Arrival, setup (ALL) and shopping (Berlin group)
  • 15:30-18:00: Individual talks (What is my research subject? What do I intend to do in the next 6 months? Propose BSc, MSc thesis for !OpenMS, Time: 10 Minutes. Prepare PPT slides. (ALL)

Tuesday

Wednesday

  • 08:30-12:00:

  • Feature Finder (Metabolites)

  • Modify !CentroidedFeaFi to work with multiple averagine distributions, come up with an easy way to specify these (in particular in the context of stable isotopes and the skewed isotope distributions caused by them) (Chris)

  • Review applicability to recent labeling techniques (e.g., dimethyl labeling, lanthanide chelates) (Chris)

  • MS to the e (Sandro)

  • Metabolomics Feature Finder -- presentation of the current status (Erhan"), benchmark against proteomics !FeaFi, compared to XCMS

  • TOPPAS workflows durchsprechen

  • Afternoon:

  • 15:00-18:00: File formats

  • Overview over OpenMS formats. (Matthias)

  • Review/extend file formats accepted by !FileConverter (DTA? !mzIdentML? !TraML?)

  • assess !mzIdentML completeness (Pieter Neerincx from ML?) (Mathias)

  • discuss first implementation of !mzQuantML (before HUPO world meeting in Sydney two weeks after the retreat!)

  • Discuss integration/use of new formats: !mzIdentML, mzQuantML, !TraML? Who is going to stay in touch with PSI working groups?

  • Fileformate: Emanuele Alpi sucht Konverter von MASCOT 2.2 zu mzIdentML -- auch solche (Mathias)

  • Zwischenformate (Mascot,...) in !IDFileConverter einbauen?

Thursday

  • 08:30-12:00: tbd
  • 13:00-18:00: tbd, Wrap up (all,done)

Friday

  • breakfast, pack, return

Other things and TODOs

  • FIX untested/broken tools (!SequestAdapter in particular!). ID team Sven, Sandro, Timo divides up responsibilites. Timo takes on Sequest with the help of Sven and Sandro to get acquainted. Compare SEQUEST interface across versions (Berlin vs Zürich [Chris], if different, the adapter might be hard to maintain --> look at Crux)

  • Review deprecations

  • Nonlinear retention time alignment?, estimation of false-matching rates in featurelinker?

  • Dealing with huge (200 GB?) raw data files – where are the bottlenecks, how to resolve that?

  • FeatureXML files can get quite big due to new "convex" Hulls - extend XML format?

  • Quality COntrol Tool? Peptizer (Vaudel, Martens) – spectrum id validation

  • Can we include !ProteoWizard in the distro somehow to make sure we can read formats?

  • code stability: not very well tested.

    • Empty input containers often lead to a crash.
    • Almost empty containers might lead to wrong distribtion estimates with not even a warning given
  • reorder error messages;

    FalseDiscoveryRate:
    ..\OpenMS\source\ANALYSIS\ID\FalseDiscoveryRate.C(148): Meta value 'target_decoy' does not exists, reindex the idXML file with 'PeptideIndexer' first (run-id='XTandem_2010-06-23T12:32:29, rank=1 of 1)!
    ..\OpenMS\source\APPLICATIONS\TOPP\FalseDiscoveryRate.C(155): FalseDiscoveryRate failed due to missing information (see above).
    

    to

    FalseDiscoveryRate:
    Meta value 'target_decoy' does not exists, reindex the idXML file with 'PeptideIndexer' first (run-id='XTandem_2010-06-23T12:32:29, rank=1 of 1)! @ ..\OpenMS\source\ANALYSIS\ID\FalseDiscoveryRate.C(148)
    FalseDiscoveryRate failed due to missing information (see above). @ ..\OpenMS\source\APPLICATIONS\TOPP\FalseDiscoveryRate.C(155)
    

Open Project responsibilites

  • dimethyl labeling (Lars)
  • Datenanalyse für Hartmut Schlüter (Displacement, iTraQ) (Chris)
  • Henning Urlaub – RNA-protein cross-link data (Timo)
  • Retention time prediction (Vergleich mit Daten von Oleg Krokhin, U Manitoba, ?) (Mathias)
  • Accurate precursor mass estimation, estimation from all feature points (!MaxQuant) integration into search engines? (Hendrik fragt Rene)
  • Nonlinear mass recalibration (Eryk, Erhan)
  • SARA – RT prediction, Feature-FWHM-Verteilung als Fkt. von RT
  • Stephan Jung (Proteome Science R&D)
  • ET/CID
  • de novo Toolkit
  • Consensus ID
  • TMT-Labeling zur Quantifizierung
  • MSMS-'Dekonvolution'

Hardware

  • Everyone: Notebook + Ethernet Kabel
  • Tübingen [coordinator: Sven]: Switch, evtl Airport, UMTS-WLAN router (Oliver)
  • Berlin:
    • !MacBook Air als SVN+FTP+Docu server, DHCP, WIKI, (Stephan)
    • Switches, Ethernet, power cords, etc.
  • Stuff to take with us (in general)
    • Towels and bed sheets

Software/data

  • aktuelle Contrib & OpenMS revisions
  • compiler, cmake & co installed as public compile platform
  • Wiki (for maintaing a Todo list) -> CAN NOT
  • Upload form or FTP deamon (for data exchange), maybe something more intuitive would be nice (e.g. a samba drop box)
  • Aktuelle Anleitungen (Contrib docus, C++ ebook)
  • Aktuelle Installer
  • MS-Daten
⚠️ **GitHub.com Fallback** ⚠️