Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: add internal lineage RFC to open source #9

Closed
wants to merge 2 commits into from

Conversation

feng-tao
Copy link
Member

Move the internal google doc RFC to open source, I will refine it or others will help to refine it once we have the UX mock.

Signed-off-by: feng-tao fengtao04@gmail.com

Signed-off-by: feng-tao <fengtao04@gmail.com>
@feng-tao feng-tao requested a review from a team as a code owner October 23, 2020 06:18
@feng-tao feng-tao removed the request for review from a team October 23, 2020 06:18
@feng-tao
Copy link
Member Author

this RFC is not ready for review

@@ -0,0 +1,133 @@
- Feature Name: (fill me in with a unique ident, my_awesome_feature)
- Start Date: (fill me in with today's date, YYYY-MM-DD)
- RFC PR: [amundsen-io/rfcs#0000](https://github.com/amundsen-io/rfcs/pull/0000) (after opening the RFC PR, update this with a link to it and update the file name)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- RFC PR: [amundsen-io/rfcs#0000](https://github.com/amundsen-io/rfcs/pull/0000) (after opening the RFC PR, update this with a link to it and update the file name)
- RFC PR: [amundsen-io/rfcs#0009](https://github.com/amundsen-io/rfcs/pull/9)

## Support Lineage In Amundsen


This RFC drafts how lineage should be supported in Amundsen and why the feature is important for the trust in data initiative.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
This RFC drafts how lineage should be supported in Amundsen and why the feature is important for the trust in data initiative.
This RFC drafts how lineage should be supported in Amundsen and why the feature is important.

* Column lineage: It tracks information in fine detail, at the level of particular columns rather than entire tables. It shows how data of a given column is generated based on upstream columns, and how it affects which downstream columns.
* _Service and event lineage_: which service produces and generates the message for a given event table.

Currently Amundsen doesn’t support showing Table level lineage in the UI but instead redirect users to use , which is a 3rd party tool at Lyft, to refer to the lineage information.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

to use X ,

Something missing here, right?




* The link to in-house lineage tool doesn’t always work. If the table/dataset is not indexed in in-house lineage tool, the user will get a 404 (e.g [user report](https://lyft.slack.com/archives/CBHCUQB2Q/p1584997367078000)).
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We'll probably want to remove that report from our internal Slack



* The link to in-house lineage tool doesn’t always work. If the table/dataset is not indexed in in-house lineage tool, the user will get a 404 (e.g [user report](https://lyft.slack.com/archives/CBHCUQB2Q/p1584997367078000)).
* The UI in in-house lineage tool is hard to browse the actual lineage as it displays the full lineage instead one level at a time (E.g [graph](https://in-house lineage tool.lyft.net/lineage-report/relation/warehouse.core.fact_rides.html?cwRelationContent=graph&cwRelationGraph=dataflow) )
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another Lyft only link

* Other popular OSS projects (e.g Apache Atlas) provide the UI support on lineage which is a feature gap for Amundsen adoption in the community.


### Why is lineage important for trust in data?
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
### Why is lineage important for trust in data?
### Why is lineage important?

Signed-off-by: feng-tao <fengtao04@gmail.com>
@feng-tao
Copy link
Member Author

close this pr to favor of #24 which is a more updated version.

@feng-tao feng-tao closed this Feb 22, 2021
@feng-tao feng-tao deleted the tfeng_add_lineage_rfc branch March 10, 2021 17:28
@Golodhros Golodhros added Status: Rejected closed without being merged and removed Status: Draft labels May 28, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Status: Rejected closed without being merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants