8 15 2022 Tech Team Report - QualitativeDataRepository/TechnicalTeam GitHub Wiki

8-15-2022

Logged Tasks

                            Date             Task Hours (Main) Hours (EOLS) Hours (PII) Hours (QDAS)
8-Aug-2022 Report, meeting 1
9-Aug-2022 Configure autovacuum on prod guestbookresponse 1
10-Aug-2022 Check autovac progress (none yet), setup/start storj test on dev 2
11-Aug-2022 Monitor/update storj testing to use random text files 1
12-Aug-2022 Investigate 502/other failures in storj test, update DV code to avoid full-text index of 0 byte files. 2

Operations

  • Configured autovacuum on the guestbookresponse table. It triggered after ~2 days and improved the accuracy of download counts significantly (from several % to <0.1 %). Reporting this in #8840 aiming to recommend this change or automate it.
  • Setup a storj test on dev, initially uploading random files and then, to make sure full-text-indexing would trigger, small random text/.txt files. So far, using the old DVUploader which updates teh dataset after every file, I've seen one 502 error where I'm coordinating with Dan Willoughby to see if storj can find the source on their end. Overall upload is going very slowly with a few hundred 1 K files in the dataset. I suspect DataCite's test services are as much to blame as storj (an in fact this AM, I'm seeing failures at DataCite that look like load issues.)

Dataverse

  • Updated to avoid error in full-text-indexing if file has 0 bytes (as one of my 1000 random test files does).

Discussion

Plans

  • QDAS Previewer
    • Investigate allowing selection/deletion of related codes/excerpts/sources, etc.
    • Updates per request
    • Investigate writing aux file/previewing lower-sensitivity version and/or other write options
  • AnnoRep - continue to explore/fix docx/pdf github issues
    • Deploy updates to dev/stage/prod
  • Dataverse
    • Popup info accessibility - IQSS likes the recommendations from the source I linked to, so this can be implemented along those lines.
    • Still want to investigate the guestbook responses re version info not being included.
  • TBD: FRDR Security
  • Other tasks as discussed in strategic planning