Documentation for docanalysis - petermr/CEVOpen GitHub Wiki
Latest documentation
docanalysis
To run -
Install docanalysis by using the following command:
pip install docanalysis==0.0.8
-
Execute
docanalysis –run_pygetpapers -q “IPCC” -k 10 --project_name ipcc_10 --dictionary “copy path of the xml file” --output entities.esv --entities ORG
-
Next, run
docanalysis –run_sectioning --project_name ipcc_10 --dictionary “copy path of the xml file” --output entities.esv --entities ORG
Incase, you see a prompt to import nltk
execute
python -m nltk.downloader punkt
And re-run the sectioning command.
- The output gets saved in a csv file named ‘entities’ in the ipcc_10 folder.