January 3, 2023 - UTMediaCAT/mediacat-docs GitHub Wiki

Agenda

  • meeting with Irfan in order to think through server issues
  • update on Twitter crawl

server issues:

  • we are now able to access the smaller instance on Graham
    • moved the storage to the smaller instance and all data were there
    • will r-sync over to Arbutus
  • still can't access test_2 which is the larger instance

Twitter crawl

  • finished Washington Post crawl
  • following users no longer exist: annmarieadams WithEdSimon WPLyndaRobinson rbbrenner MattSchudel faaawnt WJuckno
  • earlier batch of users: raulp_213 jooleesah DamonYoungVSB leslieagarrettf

Action Items