Allow for negative lists #2089

Popolechien · 2024-09-24T09:36:44Z

Currently WP1 allows us to build a zim file based off a selection of articles. But we have an increasing number of deployments where would-be institutional users need to back out because some elements are deemed inappropriate:

prisons, where apparently inmates will look up their fellow prisoners and shiv whoever has an entry to their name for sexual crimes;
schools, where kids' first instinct is to look up 69, panda porn and so on.

Ideally we should be able to run a bespoke recipe and attach a .tsv list of articles that would be skipped entirely during the scraping process. Having looked at the rapist issue I would actually recommend that all articles containing the given strings be omitted, so that even if there is no proper article available even a cursory mention could not be searched for.

kelson42 · 2024-09-24T18:44:40Z

This is called "Article List to ignore" and this is already implemented.

Duplicate of #1706

Popolechien added the enhancement label Sep 24, 2024

kelson42 closed this as completed Sep 24, 2024

kelson42 self-assigned this Sep 24, 2024

kelson42 added this to the 1.14.0 milestone Sep 24, 2024

kelson42 added the duplicate label Sep 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow for negative lists #2089

Allow for negative lists #2089

Popolechien commented Sep 24, 2024 •

edited

Loading

kelson42 commented Sep 24, 2024 •

edited

Loading

Allow for negative lists #2089

Allow for negative lists #2089

Comments

Popolechien commented Sep 24, 2024 • edited Loading

kelson42 commented Sep 24, 2024 • edited Loading

Popolechien commented Sep 24, 2024 •

edited

Loading

kelson42 commented Sep 24, 2024 •

edited

Loading