Versatile Data Kit 0.11 #1667
antoniivanov
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Major features include:
Introduce data quality checks POC (for scd1 template)
Allow quality checks to be made before the data is inserted into the target table.
Currently, the checks done on the processing step are not covering if the semantics of the data is correct. Therefore, bad data could went into the target table which could be unwanted behavior.
Example:
Jobs Query API (GraphQL) wildcard matching filter for team and job names
When querying information about jobs now users of the Jobs QUery API can use wildcard matches :
wildcard matching for example
*search*
in graphQl filters forjob name
andteam name
as well as before exact matching of search stringsProvide User Agent when using VDK CLI
Users are looking to be able to determine where requests originated from when analyzing and browsing the telemetry data about VDK Control Service usage.
or in config.ini
If not set it would default to
"vdk-control-cli/{version} ({os.name}; {sys.platform})" + {python version}
New plugin: vdk-notebook
A new VDK plugin that supports running data jobs which consists of .ipynb files. You can see VDK Notebook plugin page for more information.
vdk-ipython
This extension introduces a magic command for Jupyter. The command enables the user to load job_input for his current data job and use it freely while working with Jupyter.
You can see VDK ipython plugin page for more information.
What's Changed
New Contributors
Full Changelog: v0.10...v0.11
This discussion was created from the release Versatile Data Kit 0.11.
Beta Was this translation helpful? Give feedback.
All reactions