Home - vmware/versatile-data-kit GitHub Wiki


Versatile Data Kit - one framework to🧑‍💻 Develop ▶️ Deploy and 📊 Operate
data workflows with Python and SQL


Versatile Data Kit allows you to implement automated pull ingestion and batch data processing. Typical use cases are:

  • ingest data from another system, database, or API;
  • clean and transform ingested data into a schema convenient for reporting (such as a data warehouse model);
  • refresh or aggregate data on a scheduled basis;
  • reuse or expose a simplified / use-case-specific / cleaned-up schema to third parties.

Develop Your First Data Job See all VDK Examples
  • "Hello world!" guide
  • Install VDK
  • Create a data job
  • Run your data job

Complete collection of our
step-by-step tutorials

⚠️ **GitHub.com Fallback** ⚠️