-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: add internal lineage RFC to open source #9
Conversation
Signed-off-by: feng-tao <fengtao04@gmail.com>
this RFC is not ready for review |
@@ -0,0 +1,133 @@ | |||
- Feature Name: (fill me in with a unique ident, my_awesome_feature) | |||
- Start Date: (fill me in with today's date, YYYY-MM-DD) | |||
- RFC PR: [amundsen-io/rfcs#0000](https://github.com/amundsen-io/rfcs/pull/0000) (after opening the RFC PR, update this with a link to it and update the file name) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- RFC PR: [amundsen-io/rfcs#0000](https://github.com/amundsen-io/rfcs/pull/0000) (after opening the RFC PR, update this with a link to it and update the file name) | |
- RFC PR: [amundsen-io/rfcs#0009](https://github.com/amundsen-io/rfcs/pull/9) |
## Support Lineage In Amundsen | ||
|
||
|
||
This RFC drafts how lineage should be supported in Amundsen and why the feature is important for the trust in data initiative. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This RFC drafts how lineage should be supported in Amundsen and why the feature is important for the trust in data initiative. | |
This RFC drafts how lineage should be supported in Amundsen and why the feature is important. |
* Column lineage: It tracks information in fine detail, at the level of particular columns rather than entire tables. It shows how data of a given column is generated based on upstream columns, and how it affects which downstream columns. | ||
* _Service and event lineage_: which service produces and generates the message for a given event table. | ||
|
||
Currently Amundsen doesn’t support showing Table level lineage in the UI but instead redirect users to use , which is a 3rd party tool at Lyft, to refer to the lineage information. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
to use X ,
Something missing here, right?
active-rfcs/0003-lineage.md
Outdated
|
||
|
||
|
||
* The link to in-house lineage tool doesn’t always work. If the table/dataset is not indexed in in-house lineage tool, the user will get a 404 (e.g [user report](https://lyft.slack.com/archives/CBHCUQB2Q/p1584997367078000)). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We'll probably want to remove that report from our internal Slack
active-rfcs/0003-lineage.md
Outdated
|
||
|
||
* The link to in-house lineage tool doesn’t always work. If the table/dataset is not indexed in in-house lineage tool, the user will get a 404 (e.g [user report](https://lyft.slack.com/archives/CBHCUQB2Q/p1584997367078000)). | ||
* The UI in in-house lineage tool is hard to browse the actual lineage as it displays the full lineage instead one level at a time (E.g [graph](https://in-house lineage tool.lyft.net/lineage-report/relation/warehouse.core.fact_rides.html?cwRelationContent=graph&cwRelationGraph=dataflow) ) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Another Lyft only link
* Other popular OSS projects (e.g Apache Atlas) provide the UI support on lineage which is a feature gap for Amundsen adoption in the community. | ||
|
||
|
||
### Why is lineage important for trust in data? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
### Why is lineage important for trust in data? | |
### Why is lineage important? |
close this pr to favor of #24 which is a more updated version. |
Move the internal google doc RFC to open source, I will refine it or others will help to refine it once we have the UX mock.
Signed-off-by: feng-tao fengtao04@gmail.com