You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Did a harvest source clear and reharvest. The issue is back but the duplicate dataset changed to international-summary-statistics international-summary-statistics-2ddc8
The root cause of the issue is that package international-summary-statistics has a harvest object associated with it on the UI and SOLR but that harvest object has no package_id in the DB. This discrepancy makes duplicate dataset keeps coming back after dedupe process.
Manually api calls have fixed the packages for the the harvest source federal-reserve.
To address the issue, we need to improve dedupe process to detect and delete packages associated with harvest objects that has no pacakge_ids.
After dedupe, a duplicate of https://catalog.data.gov/dataset/national-settlement-service-data keeps coming back after next harvest job.
How to reproduce
Expected behavior
The dataset number should not change.
Actual behavior
One duplicate created for dataset https://catalog.data.gov/dataset/national-settlement-service-data
Sketch
Three approaches to fix the issue
ckanext-datajson
.The text was updated successfully, but these errors were encountered: