November 25, 2022 - UTMediaCAT/mediacat-docs GitHub Wiki

Agenda

  • should I now email Lucas and Shawn to ask about image download
  • look at MediaCAT set up
  • if possible, start MediaCAT WashingtonPost Twitter Crawl
  • look around arbutus to see if the FoxNews Twitter crawl is working
  • look around Arbutus to see if there's a large instance

Server issues

  • 2 interfaces: CLI & Openstack
    • Shawn could see how to install OpenStack but couldn't figure out how to connect with Graham
    • has sent Shengsong an email, but if we don't hear back, we'll write to Lucas
  • no foxnews crawl going, can't find new instance created
  • Alejandro: write to Lucas at Dig Alliance to ask about connecting OpenSTack and creating large instance

MediaCAT set up

  • got the crawler onto the instance that he created but still getting error
  • permission errors, problem with tokens; could be a premium feature of a token
  • instructions were pretty good; managed to set it up and getting it going
  • looks like can get the domain crawler going

Action Items

  • Shawn will continue to try and troubleshoot the server issues, both Graham and Arbutus
  • waiting on answers from Lucas and Shengsong to hopefully overcome server issues
  • Shawn will try and set up a large server with 40 VCPUs and 5-6 TB of storage (128 or so RAM)

Backburner

  • update server notes and documentation about how to connect to Arbutus