Automated Actions (Dhruv Satyapanthi) - mgenuardi/TAEP_documentation GitHub Wiki
This section was written by Dhruv Snehal Satyapanthi.
Automated report generation of the Wikidata profiles
Performs:
- Read the CSV file with names and descriptions.
- Filter out the companies’ entries.
- Search the Wikidata profile by name for each entry.
- Extract the birth year and death year of a person from its description.
- Find the most fitted Wikidata profile by comparing it with those extracted years.
- Calculate the match score for each entry.
- Create the excel sheet with entries sorted in descending order by score.
Command:
Installation: pip install requests pip install pywikibot pip install regex pip install pandas pip install tqdm pip install fuzzywuzzy
Execution: Requires “TAEP-NamesAndOrgs-Legacy.xlsx” in the same folder python fetchWikiDataQuery.py
Automated OREP
Performs:
- Aggregates the image files for each folder.
- Splits the respective document into multiple parts.
- Combine each part with their respective images.
- Combine each part and save the whole as a PDF document.
Command:
Installation: pip install docx2pdf pip install PyPDF2 pip install pandas pip install pillow pip install PyMuPDF
Execution: python saveWord.py