07 Team Reading Catalog - Observatorio-do-Trabalho-de-Pernambuco/documentation GitHub Wiki
A curated list of recommended resources—books, articles, courses, and more—that can help deepen our data engineering knowledge. Please add or update this list as new materials become relevant.
-
“Designing Data-Intensive Applications” by Martin Kleppmann
- Covers data systems, pipelines, and architectures.
-
“The Data Warehouse Toolkit” by Ralph Kimball
- Fundamentals of dimensional modeling and data warehouse design.
-
“Building Microservices” by Sam Newman
- Helpful if our architecture involves microservices and data flows.
-
Uber Engineering Blog
- Offers insight into large-scale data solutions, real-time analytics, and more.
-
Netflix Tech Blog
- Shares how they handle high-volume streaming and data pipeline optimizations.
-
Coursera: Data Engineering on Google Cloud Platform
- Broad overview of GCP’s data tools and pipelines.
-
Udemy: Data Warehouse & BI Fundamentals
- Intro to data warehousing concepts, ETL, and basic BI techniques.
-
Strata Data Conference
- Leading event for data science, machine learning, and analytics.
-
Data Council
- Community-driven event focusing on data engineering and data science.
- Suggest New Resources: Open the Wiki page, click “Edit,” and add your recommendation.
- Retirement of Outdated Material: If a link or resource becomes obsolete, consider removing or updating it.
- Categorize Appropriately: If adding a new section, maintain the numbering scheme for clarity.