Dataprep - bobbae/gcp GitHub Wiki

Introduction

Cloud Dataprep based on Trifacta Wrangler is a serverless data service for visually exploring, cleaning, and preparing structured and unstructured data for analysis, reporting, and machine learning. Cloud Dataprep jobs are executed by Cloud Dataflow workers.

Use Cloud Dataprep to explore and transform raw data from disparate and/or large datasets into clean and structured data for further analysis and processing.

https://www.youtube.com/watch?v=DEh3pZIgJ9k

Another Dataprep

https://dataprep.ai

Dataprep Clean

https://towardsdatascience.com/dataprep-clean-accelerate-your-data-cleaning-83406b4645bf

Tutorials

Quickstart

https://cloud.google.com/dataprep/docs/quickstarts/quickstart-dataprep

No coding dataprep

https://medium.com/flux-tech-blog/dataprep-is-all-you-need-for-a-data-preparation-job-on-gcp-eeb4f547358d

Automate a Cloud Dataprep Pipeline When a File Arrives

https://www.trifacta.com/blog/automate-cloud-dataprep-pipeline-data-warehouse/

ML Automation with Dataprep, BigQuery ML and Cloud Composer

https://medium.com/google-cloud/automation-of-data-wrangling-and-machine-learning-on-google-cloud-7de6a80fde91

Data Driven Price Optimization

https://cloud.google.com/blog/products/data-analytics/centralize-data-sources-into-bigquery-with-dataprep

Qwiklabs

Transform and Clean your Data with Dataprep

https://www.qwiklabs.com/quests/156

GSP050

Working with Dataprep

GSP430

Creating a Data Transformation Pipeline with Cloud Dataprep

GSP279

Streaming IoT data using Data prep