7 12 2021 Tech Team Report - QualitativeDataRepository/TechnicalTeam GitHub Wiki
7-12-2021
Logged Tasks
Date | Task | Hours (Main) | Hours (EOLS) | Hours (PII) |
---|---|---|---|---|
28-Jun-2021 | Report, meeting, curation tags updates | 2 | ||
29-Jun-2021 | Anonymization - update PR to not allow if setting doesn't exist, added docs, helped testing for merge | 2 | ||
30-Jun-2021 | Anonymization - investigate issue w.r.t create with user having no affiliation, check Drupal 8 eol info | 2 | ||
6-Jul-2021 | Investigate match with Hypothesis - fix comment text and anchor issues, deploy update, investigate pdf parsing anchor changes/overlap | 5 | ||
7-Jul-2021 | Coord re: AnnoRep/DV API use, investigate pdf parsing | 2 | ||
8-Jul-2021 | Coord re: AnnoRep/DV API use, check that action logging happens for status labels | 1 |
Summary
Operations:
- Checked on the Drupal 8 EOL: Nov 2, 2021
Dataverse:
- Helped move the anonymizedAccess PR through test/review/merge - fixed a couple simple but important bugs, added docs
- Updated curation tags/labels implementation - added a setting for allowed values and changed to require publish permissions on the dataset to add/remove a tag.
Anno-Rep
- Investigated how well anchors matched w.r.t. what Hypothesis generates directly. For .docx, there were some simple differences, i.e. Hypothesis doesn't include \r or \n in the character count or anchor strings. After changes, .docx output matches Hypothesis.
- For pdf, with a new test file, I'm finding larger issues: overlapping tags aren't being handled correctly and when I look for text within a highlighted bounding box, I sometimes get extra letters. It's not yet clear to me whether these are a result of using the free Adobe PDF reader to create the comments, or if something has changed/broken with the parsing. I'm continuing to investigate.
Plans
- Anno-Rep work
- Investigate PDF issues
- Drupal 9 - start going through upgrade processes
- Dataverse
- still want to investigate the guestbook responses re version info not being included.
Still TBD:
- Composer 2-->3