October 14, 2022 - UTMediaCAT/mediacat-docs GitHub Wiki

Agenda

Shawn: access jupyter-lab and try to produce a visualization on arbutus with NYTimes twitter data
- see here for placement: https://docs.google.com/document/d/14yylvd_zl5BaOvD8WbM0AEV8opVIgr_zJ2pXuMFBzcQ/edit#
Fenil: install Twitter crawler on Arbutus and try to crawl FoxNews handles:
- https://docs.google.com/spreadsheets/d/14fOCbBGFMO2okJCEnajk4KpzvzNS3oFxcoTFYIY5h2U/edit#gid=1609056254
Fenil: try to get proquest working for mass download
Fenil & Shawn are meeting on Saturday to go over entering Arbutus.
Alejandro: ask digital alliance to increase Arbutus allowance
Alejandro: ask Shengsong about moving storage to Arbutus, cc Shengsong

Alejandro: send Shawn the updated results from NYT archive crawl
Fenil: update server notes and documentation about how to connect to Graham cloud
Fenil: look into using web interface on Graham cloud to download an image.
Fenil: set up instance with 40 VCPUs & 5-6 TB of storage.
Shawn: set up vector diagram software and Jupyter Lab environment on Arbutus cloud small instance
Fenil: run complex query with all possible text aliases in proquest, send Alejandro number with an example of top 100.