MTE Wishlist - wkiri/MTE GitHub Wiki
MTE Wishlist
This is a list of desired extensions we'd like to make to the MTE, time and resources permitting. Contributions/PRs welcome!
Parsing
- Improve handling of tables and figures; currently the text is extracted but the structure is lost.
- Improve or customize the processing of journal papers to remove page headers and other content that is not meaningful (and currently gets embedded in the middle of sentences).
Database Content
- Add a "see also" table to explicitly connect related terms (elements, minerals, properties) and save the individual user from that effort (e.g., "ol" and "olivine", "pyroxene" and "pyroxenes", "NpOx" and "nanophase oxides").
- Make ADS search for document metadata more robust; currently it fails to find some papers if the title search does not yield an exact match.
- Remove spurious Components/Properties in final database. See https://github.com/wkiri/MTE/issues/47
Coverage
- Add targets from the Mars Science Laboratory mission. Limited content from MSL ChemCam targets from 2014-2016 is available in the MSL Analyst's Notebook, but the official MTE database does not include any MSL content yet.
- Add targets from the Mars 2020 mission.
- Expand coverage of LPSC proceedings (currently goes through 2020). See https://github.com/wkiri/MTE/issues/7
- Expand coverage of journal papers.