November 11, 2021 - UTMediaCAT/mediacat-docs GitHub Wiki

Agenda

  • Kirsta and John unavailable, John sent an update
  • CSV with additional items?
  • have we looked over the stacked area graph?

Stacked Area graph

  • still checking the numbers

Issues with CSV of Twitter output

  • a lot of CSV's are coming in with null author
    • What Colin sees in interest-output is that many of these
  • should CSV show "found url"s, and if so, shouldn't found URL become a separate row with a hit count, author, etc
  • new twitter crawl seems to include a lot of info that isn't being propagated to the postprocessed output (like, retweet, etc counts)

Pull Requests

Action Items

  • Colin will check what the stacked area chart is counting because the count is currently off
  • Python crawler: Colin will look into this if time allows.