The Storage Loader - thebeansgroup/snowplow GitHub Wiki

HOME > [SNOWPLOW TECHNICAL DOCUMENTATION](Snowplow technical documentation) > [Storage](storage documentation) > The Storage Loader

An overview of how the StorageLoader instruments the loading of data from S3 into Redshift

  • Data from enriched Snowplow event files generated by the Scalding process on EMR is read adn written to Amazon Redshift
  • The enriched event files are then moved from teh in-bucket (which was the archive bucket for the EmrEtlRunner) to the archive bucket (for the StorageLoader)

The StorageLoader is configured via its config file. (Example config files for Redshift users can be found [here] redshift.yml.sample and Postgres users [here] postgres.yml.sample.) For more information, see the guide to setting up the StorageLoader.

⚠️ **GitHub.com Fallback** ⚠️