External Resources - vmware/versatile-data-kit GitHub Wiki

Versatile Data Kit Community

Welcome to the Versatile Data Kit community page, where you can find:

How to submit a link to this page

It would be great to hear from you, especially if you have any of the above and want to share it with the rest of the community. Edit the page if you can, or submit a GitHub Issue.

Next Community Meeting

Info, links, and dates here

Community Meeting Recordings

Visit Community Meeting and Open Discussion page Or Community Meeting YouTube Playlist

Featured Videos

Back to top

Title Presenter Date
VDK Introduction playlist - Shorts Agita Jaunzeme 16-October-2023
DAGs in Versatile Data Kit Gabriel Georgiev 22-May-2023
VDK UI - Installation and Getting Started Paul Murphy 28-April-2023
VDK Operations User Interface - Versatile Data Kit Dilyan Marinov 4-April-2023
Open-Source Spotlight - Versatile Data Kit - Gabriel Georgiev Gabriel Georgiev 18-January-2023
Incremental Ingestion with Versatile Data Kit Desislava Borrisova 14-September-2022
Data Transformation with Versatile Data Kit Gabriel Georgiev 16-December-2021
Data Ingestion with Versatile Data Kit Gabriel Georgiev 09-December-2021
Versatile Data Kit Introduction Dimira Petrova 02-December-2021

For all YouTube videos visit the Versatile Data Kit channel

Blog posts

Back to top

Blog/repo name and description Author Site Date
Versatile Data Kit 1.0 Iva Koleva medium.com 29-June-2023
Orchestrating VDK Data Jobs in a directed acyclic graph using VDK DAGs Gabriel Georgiev medium.com 6-June-2023
How to Keep Track of Data Versions Using Versatile Data Kit Angelica Lo Duca towardsdatascience.com 3-May-2023
Comparing Versatile Data Kit with Apache Airflow Gabriel Georgiev medium.com/versatile-data-kit 25-January-2023
Data Engineering Tricks: How To Get Dirty Data Cleaned through VDK Angelica Lo Duca medium.com 20-December-2022
Versatile Data Kit data engineering framework for dbt users Desislava Borisova medium.com 24-October-2022
How to Create a Data Formatting Plugin in VDK Angelica Lo Duca medium.com/versatile-data-kit 13-October-2022
Handling Missing Values in Versatile Data Kit Angelica Lo Duca medium.com/versatile-data-kit 31-August-2022
Integrating Apache Airflow with Versatile Data Kit Gabriel Georgiev medium.com/versatile-data-kit 15-July-2022
How to Build a Web App with Data Ingested through Versatile Data Kit Angelica Lo Duca towardsdatascience.com 23-May-2022
Efficient Data Troubleshooting Antoni Ivanov medium.com/versatile-data-kit 12-April-2022
Using Versatile Data Kit to Ingest and Process Data from REST API Angelica Lo Duca towardsdatascience.com 15-March-2022
From Raw Data to a Cleaned Database: A Deep Dive into Versatile Data Kit Angelica Lo Duca towardsdatascience.com 11-Feburary-2022
The Versatile Data Kit Mantra: Efficient Data Engineering Rumen Barov blogs.vmware.com 23-November-2021
An Overview of Versatile Data Kit Angelica Lo Duca towardsdatascience.com 17-November-2021
Efficient Data Engineering with Versatile Data Kit Antoni Ivanov blogs.vmware.com 7-October-2021

Events

Back to top

Event name Description (link to recording) Speaker Location Date
ISTA DataOps as a Service Antoni Ivanov Online 12-October-2023
KubeNative 2023 Applying DevOps practices in Data and ML Engineering Antoni Ivanov Online 28-September-2023
Data Ceili DataOps Mesh Agita Jaunzeme Trinity College Dublin 9-June-2023
DSC Adria 23 Practical Kimball Data Patterns Antoni Ivanov Zagreb, Croatia 18-May-2023
New Stars Of Data 2023 DevOps best practices for DataOps Mesh Agita Jaunzeme Online 12-May-2023
New Stars Of Data 2023 Preserving customer privacy using Differential Privacy and Versatile Data Kit. Paul Murphy Online 12-May-2023
Conf42 DevOps 2023 DataOps as a Service Antoni Ivanov Online 26-January-2023
Conf42 DevOps 2023 DevOps Best Practices for DataOps Mesh Agita Jaunzeme Online 26-January-2023
DSC Europe 2022 DataOps as a Service Antoni Ivanov and Dimira Petrova Belgrade, Serbia 18-November-2022 12:30 CET
Data Science Summit DevOps for Data as a Service Antoni Ivanov and Dimira Petrova Online 17-November-2022
DSC Europe 2022 Practical Kimball Data Patterns Antoni Ivanov Belgrade, Serbia 16-November-2022 16.30-19.30 CET
DSC Europe 2022 Integrating with pep249 Victor Mishovski Belgrade, Serbia 15-November-2022 12.00 CET
OpenFest Sofia 2022 Close the Gap between Proof-of-Concept & Data Science Product Antoni Ivanov Sofia, Bulgaria 15-October-2022
MLOps meetup Applying DevOps Practices in Data and ML Engineering Antoni Ivanov Online 5-September-2022
SCaLE DevOps for Data as a Service Antoni Ivanov Los Angeles, USA 30-July-2022
Marquez Community Meeting Integration between Marquez and VDK Antoni Ivanov Online 26-May-2022
AMLD Close the Gap between Proof-of-Concept & Data Science Product Workshop Dimira Petrova Lausanne, Switzerland 26-March-2022

Upcoming Events

Back to top

Event name Description (link to recording) Speaker Location Date
DSC Europe Productionizing Jupyter Notebooks Duygu Hasan Online 20-November-2023
DSC Europe Make data central feature Antoni Ivanov Belgrade, Serbia 20-November-2023
VMware Explore Centralized Control and Flexibility in Machine Learning Operations Paul Murphy Barcelona, Spain 6-November-2023
VMware Explore A Case Study in creating maintainable datasets for AI/ML Antoni Ivanov Barcelona, Spain 6-November-2023
VMware Explore Productionizing Jupyter notebooks Duygu Hasan Barcelona, Spain 6-November-2023

Upcoming Blogs

Back to top

Title / Topic Author Platform Date
Batch Processing with VDK Angelica Lo Duca medium.com