Data Application Specification - stonezhong/DataManager GitHub Wiki
Brief
This document is for Data Engineer who writes Data Applications.
Main entry
def main(spark, input_args, sysops={}):
...
- spark: the saprk session
- input_args: the input arguments
- sysops: system options, normally user should ignore this.
Input Arguments
It looks like below
- app_args is what user specifies in UI
{
"pipeline_group_context": {
"dt": "2020-12-27",
"xcom": {
"import-trading-data-nasdaq": null
}
},
"application_id": "0d8d2c02-0a32-4aa5-9301-505de05c30a5",
"app_args": {
"action": "import-data",
"market": "NASDAQ",
"data_root": "oci://BetaTradings@idrnu3akjpv5"
},
"dc_config": {
"url_base": "http://dmdemo1:8888/api",
"username": "stonezhong",
"password": "****"
}
}