Automated Actions (Dhruv Satyapanthi) - mgenuardi/TAEP_documentation GitHub Wiki

This section was written by Dhruv Snehal Satyapanthi.

Automated report generation of the Wikidata profiles

Performs:

  1. Read the CSV file with names and descriptions.
  2. Filter out the companies’ entries.
  3. Search the Wikidata profile by name for each entry.
  4. Extract the birth year and death year of a person from its description.
  5. Find the most fitted Wikidata profile by comparing it with those extracted years.
  6. Calculate the match score for each entry.
  7. Create the excel sheet with entries sorted in descending order by score.

Command:

Installation: pip install requests pip install pywikibot pip install regex pip install pandas pip install tqdm pip install fuzzywuzzy

Execution: Requires “TAEP-NamesAndOrgs-Legacy.xlsx” in the same folder python fetchWikiDataQuery.py


Automated OREP

Performs:

  1. Aggregates the image files for each folder.
  2. Splits the respective document into multiple parts.
  3. Combine each part with their respective images.
  4. Combine each part and save the whole as a PDF document.

Command:

Installation: pip install docx2pdf pip install PyPDF2 pip install pandas pip install pillow pip install PyMuPDF

Execution: python saveWord.py