Dictionary Plant Secondary Metabolites - petermr/CEVOpen GitHub Wiki
Plant Secondary Metabolites DICTIONARY:
-
I collected 3330 plant secondary metabolites and removed duplicates.
TOTAL Secondary metabolites -3330
WIKIDATA ID -1417
NO WIKIDATA ID -1913
Please, find dictionary. Presently, dictionary contain terms with 1417 WIKIDATA IDs
-
I used wikipedia page (https://en.wikipedia.org/wiki/Secondary_metabolite#Plant_secondary_metabolites) to find out only names of plant secondary metabolite classes. I also used following SPARQL queries
SELECT ?instance_of ?instance_ofLabel WHERE {
SERVICE wikibase:label { bd:serviceParam wikibase:language "[AUTO_LANGUAGE],en". }
?instance_of wdt:P31 wd:Q11173.
?instance_of wdt:P31 **wd:Q331818**.}
[ replacing wd:Q331818 (gibberellins) with wd:Q70702 (alkaloid ),
wd:Q32528 (monoterpene), wd:Q80218 (sesquiterpene), wd:Q3234924 (isoflavonoid) and wd:Q222174 (flavonoid) ]
Most of the Wikidata IDs were searched manually by searching papers.
-
To get names of plant secondary metabolites, I also searched papers (supplementary data) of genome wide association studies of Arabidopsis and Rice as well as Tomato (Introgression Lines) (Alseekh et al., 2015; Matsuda et al., 2014; Wu et al., 2018).
(https://academic.oup.com/plcell/article/27/3/485/6118131,https://www.pnas.org/content/110/50/20320,
https://onlinelibrary.wiley.com/doi/10.1111/tpj.12681,
https://doi.org/10.1016/j.molp.2017.08.012)
-
Here, we are focusing on only PLANT SECONDARY METABOLITES.
-
There are natural product databases available (Mangal et al., 2013; Tung 2014). I used database developed by Mangal et al., 2014. Some of the databases contain microbial natural products also and some require subscription and are not open access.
I got database table from Tung 2014 paper
No. | Database | Reference | |
---|---|---|---|
1. | https://webs.iiitd.edu.in/raghava/npact/browse.php | Mangal et al., 2013 | working |
2. | http://alkamid.ugent.be/ | Boonen et al., 2012 | working |
3. | http://bif.uohyd.ac.in/nemedplant/ | Meetei et al., 2012 | working |
4. | http://kanaya.naist.jp/KNApSAcK_Family/ | Takahashi et al., 2011 | working |
5. | http://bioinformatics.charite.de/supernatural | Dunkel et al., 2006 | working |
6. | http://tcm.cmu.edu.tw/ | Chen et al., 2011 | working |
7. | http://cwtung.kmu.edu.tw/tipdb | Lin et al., 2013 | working |
8. | http://nubbe.iq.unesp.br/nubbeDB.html | Valli et al., 2013 | Not working |
9. | http://www.3dmet.dna.affrc.go.jp/ | Maeda et al., 2013 | Not working |
10. | http://www.ars-grin.gov/duke/ | Duke and Bogenschutz | Not working |
11. | http://www.megabionet.org/tcmid/ | Xue et al., 2013 | Not working |
12. | http://lifecenter.sgst.cn/hit/ | Ye et al., 2011 | Not working |
13. | http://www.hccbif.info/ | Daisy et al., 2011 | Not working |
14. | http://biolinfo.org/cmkb/ | Gaikwad et al., 2008 | Not working |
15. | http://tcm.cz3.nus.edu.sg/group/tcm-id/tcmid.asp | Ji et al., 2006 | Not working |
-
I have tested this dictionary on minicorpus of 10 papers. I used following commands.
getpapers -q "plant secondary metabolites" -o secompoundMAP -x -p -k 10 -f corpora1/log.txt
ami -p "secompoundMAP" section
ami -p "secompoundMAP" search --dictionary secondary.xml
It gave full data tables as