Skip to content

Commit df6e6a9

Browse files
authored
docs(search): improve docs for upstream and downstream advanced query (#15167)
1 parent 660a26b commit df6e6a9

File tree

1 file changed

+5
-3
lines changed

1 file changed

+5
-3
lines changed

docs/how/search.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -166,10 +166,12 @@ If you want to:
166166
- These 2 filters will be supported starting from release `0.3.14.x` of DataHub Cloud.
167167
- Note that it does not check whether the field is zero. It just checks for whether the metadata was emitted or not.
168168

169-
- Find the number of upstreams or downstreams a dataset has.
169+
- Find the number of upstreams via `upstreamCountFeature` field or downstreams via `downstreamCountFeature` field. Only 1 hop lineage is considered.
170170

171-
- `/q upstreamCountFeature:>2`
172-
- `/q downstreamCountFeature:<3`
171+
- `/q upstreamCountFeature:>2` -> Greater than 2 upstreams at 1 hop
172+
- `/q downstreamCountFeature:<3` -> Less than 3 downstreams at 1 hop
173+
- `/q upstreamCountFeature:<=10` -> Less than or equal to 10 upstreams at 1 hop
174+
- `/q upstreamCountFeature:[5 TO *]` -> To find out where at 1 hop an asset has Greater than or equal to 5 upstream lineage
173175
- The advantage of `upstreamCountFeature` over `hasUpstreams` is that it considers whether the upstreams and downstreams are valid URNs.
174176
- The disadvantage of `upstreamCountFeature` over `hasUpstreams` is that these are updated once a day and are not real-time like `hasUpstreams`.
175177
- The reason `upstreamCountFeature` is useful is that after lineage is emitted once it will probably not change drastically for most of the tables. So this information will be almost up-to-date for all tables with a lag of around 24 hours.

0 commit comments

Comments
 (0)