Amundsen is a data discovery and metadata engine for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Think of it as Google Search for data.
- Deployed using Kubernetes and Running in Staging --> Production
- Feedback from Data Engineers, Data Scientists, Data Analysts & Researchers
- Access by Non Technical Users as well
- Deployed and made changes in the code!
- Using ElasticSearch external & Neo4j from Helm
- OSS Community is Awesome
- Slack Community is very active
- Check RFCs and PR for upcoming features
- Code is very well documented and it really easy to make changes
For any further discussion please reach out in Linkedin or Telegram https://t.me/datainfra
Process is Straight forward using Docker https://www.amundsen.io/amundsen/installation/. This is the easiest point to start and understand the setup. You can write your custom Data Uploader to verify
- Configuration Changes need building Docker Image and then using it inside Kubernetes. This is not easy if you are changing stuff frequently.
- Many New/POC features need config update.
- Main UI is clean and very easy to understand
- Tags are not easily searchable, sometimes nothing comes up
- Search is quick and clean. Shows enough infomartion
- Resource UI has enough kwds to search but can be improved