Negative Loss on gpu ALS model #367

sikhad · 2020-07-14T18:19:00Z

I'm getting a negative loss value when running ALS using GPU (loss = -.0346) regardless of varying all parameters. When running the same data/parameters on CPU, I'm getting a positive loss. I'm confused why loss could be negative.

It's a ~6500 x 1m csr matrix.

params = {'factors':64, 
          'use_gpu':True, 
          'use_native':True, 
          'use_cg':True, 
          'regularization':0, 
          'num_threads':0,
          'iterations':5,
          'calculate_training_loss':True}

# initialize a model
model = implicit.als.AlternatingLeastSquares(**params)

# train the model on a sparse matrix of item/user/confidence weights
model.fit(csr, show_progress=True)

benfred · 2022-01-13T16:29:33Z

Its looking like the GPU loss calculation might be buggy (See also #441 )

The GPU ALS model would sometimes return incorrect results with the `calculate_training_loss` parameter enabled. This happend when the number_of_users * number_of_items was bigger than 2**31 due to an overflow in the loss function calculation. Fix and tests that would have caught this bug Closes #367 Closes #441

benfred · 2023-06-06T21:08:06Z

There was a bug with the calculate_training_loss parameter - when the number_of_items * number_of_users was bigger than 2**31. This will be fixed by #663 in the next release.

thanks for reporting - sorry about the lengthy delay in getting this resolved.

The GPU ALS model would sometimes return incorrect results with the `calculate_training_loss` parameter enabled. This happend when the number_of_users * number_of_items was bigger than 2**31 due to an overflow in the loss function calculation. Fix and add tests that would have caught this bug Closes #367 Closes #441

gallir · 2023-06-06T21:41:31Z

Thanks. Will you release a new pip module version?

benfred · 2023-06-06T22:44:32Z

@gallir - I'm working on getting a new version together - I also want to get changes like #661 and #656 pushed out to people too.

I'd also like to fix the conda packaging errors with this version - once I have a handle on that I'll push out a new release.

benfred · 2023-06-13T05:52:15Z

@gallir - fix is in v0.7.0

gallir · 2023-06-13T10:26:39Z

v0.7.0

Thank you very much. I had modified your build yml to use your latest version, it worked better than before https://github.com/gallir/implicit

benfred changed the title ~~Negative Loss~~ Negative Loss on gpu ALS model Jan 22, 2022

benfred mentioned this issue Jun 6, 2023

Fix ALS loss calculation on GPU #663

Merged

benfred closed this as completed in #663 Jun 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Negative Loss on gpu ALS model #367

Negative Loss on gpu ALS model #367

sikhad commented Jul 14, 2020 •

edited

Loading

benfred commented Jan 13, 2022

benfred commented Jun 6, 2023

gallir commented Jun 6, 2023

benfred commented Jun 6, 2023

benfred commented Jun 13, 2023 •

edited

Loading

gallir commented Jun 13, 2023

Negative Loss on gpu ALS model #367

Negative Loss on gpu ALS model #367

Comments

sikhad commented Jul 14, 2020 • edited Loading

benfred commented Jan 13, 2022

benfred commented Jun 6, 2023

gallir commented Jun 6, 2023

benfred commented Jun 6, 2023

benfred commented Jun 13, 2023 • edited Loading

gallir commented Jun 13, 2023

sikhad commented Jul 14, 2020 •

edited

Loading

benfred commented Jun 13, 2023 •

edited

Loading