feat: 🎸 replace Queue.add_job with Queue.upsert_job #694

severo · 2023-01-23T14:54:56Z

upsert_job ensures only one waiting job for the same set of parameters. On every call to upsert_job, all the previous waiting jobs for the same set of parameters are canceled, and a new one is created with a new "created_at" date, which means it will be put at the end of the queue. It will help to fight against datasets that are updated very often (eg, every minute): they will be treated only when there are available workers.

It's a quick PR to fix the issue that the queues are increasing faster than the availability of the workers and that most of these jobs will be later skipped. Better to reduce the number of results in the queries (by reducing the number of waiting jobs).

upsert_job ensures there is only one waiting job for the same set of parameters. On every call to upsert_job, all the previous waiting jobs for the same set of parameters are cancelled, and a new one is created, with a new "created_at" date, which means it will be put at the end of the queue. It will help to fight against datasets that are updated very often (eg every minute): they will be treated only when there is available workers.

severo · 2023-01-23T15:18:40Z

See for example: https://huggingface.co/datasets/atokforps/latent_v1_fullrun_alpha3_13/commits/main. Two commits are pushed every minute. The queues contain thousands of waiting jobs.

severo · 2023-01-23T15:47:20Z

The issue is effectively fixed.

For the "/first-rows" queue, we will have first to wait for the /splits to be run again on them

severo · 2023-01-23T17:39:57Z

OK, #695 helped reduce the number of waiting jobs for /first-rows. There is no way to go further, since there are no more duplicates (for example, allenai/nllb has 2.656 splits, thus up to 2.656 jobs)

severo added 3 commits January 23, 2023 14:54

feat: 🎸 update libcommon

9ab3e6e

feat: 🎸 update docker images

092babe

severo merged commit 984f0b5 into main Jan 23, 2023

severo deleted the reduce-size-of-queue branch January 23, 2023 15:17

severo mentioned this pull request Jan 23, 2023

feat: 🎸 launch children jobs even when skipped #695

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: 🎸 replace Queue.add_job with Queue.upsert_job #694

feat: 🎸 replace Queue.add_job with Queue.upsert_job #694

severo commented Jan 23, 2023 •

edited

Loading

severo commented Jan 23, 2023

severo commented Jan 23, 2023

severo commented Jan 23, 2023

feat: 🎸 replace Queue.add_job with Queue.upsert_job #694

feat: 🎸 replace Queue.add_job with Queue.upsert_job #694

Conversation

severo commented Jan 23, 2023 • edited Loading

severo commented Jan 23, 2023

severo commented Jan 23, 2023

severo commented Jan 23, 2023

severo commented Jan 23, 2023 •

edited

Loading