Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for Databricks Notebook Task Lineage as Assets #19910

Open
Prajwal214 opened this issue Feb 21, 2025 · 0 comments
Open

Support for Databricks Notebook Task Lineage as Assets #19910

Prajwal214 opened this issue Feb 21, 2025 · 0 comments
Assignees

Comments

@Prajwal214
Copy link
Contributor

Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
-- Currently, OpenMetadata ingests Databricks workflows (pipelines) and displays the sequence of tasks within the "List" and "DAG" sections. However, the individual tasks are not treated as separate assets, and their lineage is not explicitly captured.

On Databricks, each task has its own lineage, showing both input and output tables. Users expect to see the same level of granularity in OpenMetadata, with tasks represented as assets and their lineage captured accordingly.

Describe the solution you'd like
A clear and concise description of what you want to happen.

  • Each task within a Databricks pipeline should be ingested as an individual asset.
  • The lineage of each task should include its source and destination tables, similar to what is visible in Databricks.
  • The DAG representation should accurately reflect the relationships between tasks and their respective lineage.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Additional context
Add any other context or screenshots about the feature request here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Integration
Development

No branches or pull requests

2 participants