Data Engineering - LaunchCode-Code-Connect/CandidateResources GitHub Wiki
Introduction to Data Engineering
Data Engineering is the act of working with raw data from a data source and processing it so that it's organized and usable for data analysis, business intelligence, or machine learning model training. Data Engineers are the folk who make sure that data analysts and other data professionals have clean data to work with!
Why we recommend Data Engineering
As companies continue to invest in AI and using it to process information, it's increasingly important to ensure that the data they are processing is clean and usable. Data Engineering is going to continue to grow as a field and as a necessary part of the development team for companies, so we highly encourage you to start learning about it!
Skill Update!
Have you completed one of the courses below or a different course or certification? If you are an active candidate working with LaunchCode, please let us know by filling out this form. Our team reviews submissions on a weekly basis and will update your LaunchCode candidate profile with a note to let the Career team know that you have completed a new course!
Getting Started with Data Engineering Resources
Course Name |
Description |
Skills Covered |
Time to Complete |
Data Engineering Core Skills |
This Pluralsight course will take you through all of the basics on how to get started as a Data Engineer |
ETL, Data Modeling, SQL, Data Warehousing, Database Design |
7 hours |
OneLake and Data Engineering with Microsoft Fabric |
This Pluralsight course will teach you more about the Microsoft Fabric platform and how to use it with OneLake. |
Data Warehousing |
1 hour |
Languages and Frameworks for Data Engineering Resources
Python
Course Name |
Description |
Skills Covered |
Time to Complete |
Python for Data Engineers |
This comprehensive Pluralsight path will teach you all the ins and outs of how to use Python for Data Engineering. There are Pluralsight labs included in this course, and we highly recommend following them to get some hands on experience with the concepts you'll learn about! |
Pandas, PySpark, Airflow, ETL, Automation |
10 hours |
Python Data Essentials |
This Pluralsight course will introduce you to coding with Python, specifically for working in a data related career. |
Python, Data Structures |
5 hours |
SQL for Data Engineering Resources
Course Name |
Description |
Skills Covered |
Time to Complete |
SQL For Data Engineering |
This Pluralsight course focuses specifically on how to use your SQL skills in Data Engineering. You'll learn about analysis, query optimization, workflow integration, and more! |
SQL, Query Optimization |
1 hour |
Querying Data from PostgreSQL |
PostgreSQL is being used more and more when it comes to data engineering and data analysis. This Pluralsight course will walk you though how to query data from PostgreSQL |
Data Querying |
2 hours |
Combining and Filtering Data with PostgreSQL 9 |
This course will teach you how to use PostgreSQL to expand your queries to filter and combine data. |
Join Types, SQL Functions |
2 hours |
Additional Languages and Frameworks
Course Name |
Description |
Skills Covered |
Time to Complete |
Apache Spark 3 Fundamentals |
This Pluralsight course will help you build a foundational knowledge of Apache Spark |
Apache Spark |
7 hours |
Terraform - Getting Started |
This Pluralsight course is focused on building the fundamentals of how to use Terraform. It's a toolset that helps automate infrastructure in the public and private cloud. |
Terraform |
5 hours |