Documentation for docanalysis - petermr/CEVOpen GitHub Wiki

Latest documentation

To run docanalysis

  1. Install docanalysis by using the following command: pip install docanalysis==0.0.8

  2. Execute docanalysis –run_pygetpapers -q “IPCC” -k 10 --project_name ipcc_10 --dictionary “copy path of the xml file” --output entities.esv --entities ORG

  3. Next, run docanalysis –run_sectioning --project_name ipcc_10 --dictionary “copy path of the xml file” --output entities.esv --entities ORG

Incase, you see a prompt to import nltk execute python -m nltk.downloader punkt And re-run the sectioning command.

  1. The output gets saved in a csv file named ‘entities’ in the ipcc_10 folder.