Dead link data incorporation #3585
Labels
💻 aspect: code
Concerns the software code in the repository
✨ goal: improvement
Improvement to an existing user-facing feature
🧭 project: thread
An issue used to track a project and its progress
🧱 stack: catalog
Related to the catalog and Airflow DAGs
Description
Summary
Develop and document a process for handling dead links in the Catalog, in order to make dead link validation in the API faster.
Details
Dead link validation is a critical part of the API, and sometimes slows down the API responses. Currently, we do not have a well described process for removing dead links from the catalog. We need:
removed_from_source
set, perhaps using thebatched_update
DAG?DeletedMedia
tables?Documents
Milestones/Issues
Prior Art
This project combines the project ideas of Establish Guidelines and Practices for Dead links and Set up Dead links Removal Pipeline Using Redis Cache.
The text was updated successfully, but these errors were encountered: