Missing info from README #6

manuelmeurer · 2012-11-13T23:16:16Z

I just found this project while googling how to make sure certain Sidekiq jobs are not executed multiple times. sidekiq-unique-jobs seems to do exactly that... awesome!

I think there is some info missing in the README though, specifically:

Are worker arguments taken into account? So if I have a HardWorker and I call HardWorker.perform_async('bob', 5) multiple times, that job should obviously only be queued once. But what if I call HardWorker.perform_async('bob', 5) and HardWorker.perform_async('jane', 10)? Are both those jobs queued? I suppose so but I'm not 100% sure.
Why is the expiration parameter needed? Does it mean that by default the same job cannot be enqueued again up to 30min after it was removed from the queue?

I think both these points (and possibly more) should be explained in the README.
I'm happy to prepare a pull request for it, if you answer my questions in here.

Thanks for your work on this!

The text was updated successfully, but these errors were encountered:

philostler · 2013-03-28T19:38:29Z

+1

varunlalan · 2013-03-30T08:19:34Z

+1

mhodgson · 2013-04-09T12:57:03Z

+1

abacha · 2013-04-25T15:24:47Z

+1

mhenrixon · 2013-04-25T21:02:03Z

Ok so be gentle with me while I try to explain. I'll be the first one to admit I suck at both documentation and explaining this. Work in progress sort of speak.

Worker arguments are taken into account. It is also possible to select what arguments should be taken into consideration for uniqueness by specifying a lambda or a class method to handle this. Might have been added since after you asked your question by the by.
The expiration doesn't need to be set, it can be thought of as a simple time out for uniqueness meaning if you set it to two hours no jobs with the same arguments or unique_arguments will be scheduled until that time has passed.

Unfortunately it seems like workers calling nested workers causes jobs to be duplicated still (like in #10 ). If anyone want to take a stab at reproducing the problem in a test we should be able to fix it.

nberger · 2013-06-20T22:13:50Z

I don't understand what exactly the expiration parameter does, either. Does it only affect jobs scheduled with #perform_in, but not with #perform?

If it affects #perform, I think a better default should be 0, instead of the current 30 * 60 (30 minutes).

mhenrixon · 2013-06-21T05:51:54Z

@nberger it only affects jobs scheduled with perform_in or perform_at.

astjohn · 2014-02-27T17:04:14Z

@mhenrixon I wouldn't mind also a brief explanation in the README of exactly how the uniqueness is established. This would be nice so I don't have to dig through the code to ensure locking is performed properly. I would much rather take your word for it!

That said, I noticed that setex is used to increment a counter on a key that is the arguments to the worker. Is that correct? I noticed a pattern in the redis documentation using setnx (which now states to use plain old set to implement a locking system instead.

A small explanation of the locking procedure and how it is thread safe would definitely help me, and I'm sure many others, gain even more confidence in using the gem. Any chance you could clarify it for me? Thanks!

tyetrask · 2014-03-27T15:54:40Z

Hello everyone,

I had a few questions about this gem and this issue thread seems to be somewhat centered around my questions.

Essentially, if I have a worker ("DoStuff") and I queue a job for that worker by the following:

DoStuff.perform_async("with unique argument")

and then I run that same command again with the same arguments, I want it to not add the second instance of the job to the queue if the first instance has not completed yet.

The way I've read the documentation, I expect the job not to be duplicated no matter how long it has been if a job with the same worker, queue, and arguments is still waiting to be processed.

What I'm currently experiencing is that the job won't add to the queue if it's within the expiration time. However, if the first job has not been completed, but the unique job expiration time has passed (10 minutes, in my case) and I run it again, it does add the duplicate job even if the first one has not completed!

Is this the expected behavior of the gem? If not, is there a configuration option I am missing?

Here is an example of a worker and the options I'm using:

class DoStuff
  include Sidekiq::Worker
  sidekiq_options :queue => "queue_1"
  sidekiq_options unique: true, unique_job_expiration: 60 * 10

  def perform(arguments)
    # Do some unique things.
  end
end

Thanks,
Tye

mhenrixon · 2014-05-11T08:26:37Z

@tyetrask yeah that is sort of expected. I suggest you try something like sidekiq-throttler instead. That should better help you achieve what you want I am opening an issue for deciding on how to proceed with this.

tyetrask · 2014-05-12T14:32:51Z

Hey @mhenrixon, thanks for the information! We needed to move forward with our project so we ended up writing the middleware that performed how we needed. I appreciate all of the work on this and will be keeping an eye on it in the future. Thanks again!

mpoisot mentioned this issue Oct 9, 2013

What is the use case for the uniqueness window? #22

Closed

mhenrixon closed this as completed Dec 20, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing info from README #6

Missing info from README #6

manuelmeurer commented Nov 13, 2012

philostler commented Mar 28, 2013

varunlalan commented Mar 30, 2013

mhodgson commented Apr 9, 2013

abacha commented Apr 25, 2013

mhenrixon commented Apr 25, 2013

nberger commented Jun 20, 2013

mhenrixon commented Jun 21, 2013

astjohn commented Feb 27, 2014

tyetrask commented Mar 27, 2014

mhenrixon commented May 11, 2014

tyetrask commented May 12, 2014

Missing info from README #6

Missing info from README #6

Comments

manuelmeurer commented Nov 13, 2012

philostler commented Mar 28, 2013

varunlalan commented Mar 30, 2013

mhodgson commented Apr 9, 2013

abacha commented Apr 25, 2013

mhenrixon commented Apr 25, 2013

nberger commented Jun 20, 2013

mhenrixon commented Jun 21, 2013

astjohn commented Feb 27, 2014

tyetrask commented Mar 27, 2014

mhenrixon commented May 11, 2014

tyetrask commented May 12, 2014