Re-implementation of the LHCb Collider DY Datasets #1826

Radonirinaunimi · 2023-10-25T20:13:48Z

The following PR implements the LHCb Collider DY datasets in the new commondata format. The table summarizes the status of the implementation and provides general comments on the comparisons w.r.t. the old implementation. The dataset names in the new tables are the new ones and the mapping to the old ones is tracked in dataset_names.yml.

Dataset Name	Comparison vs. Old	Status
LHCB_Z0_7TEV_DIELECTRON_Y	☑️	☑️
LHCB_Z0_8TEV_DIELECTRON_Y	☑️	☑️
LHCB_Z0_13TEV_Y-DIELECTRON	☑️	☑️
LHCB_Z0_13TEV_Y-DIMUON	☑️	☑️
LHCB_Z0_7TEV_MUON_Y	☑️	☑️
LHCB_DY_7TEV_MUON_Y	☑️	☑️
LHCB_Z0_8TEV_MUON_Y	☑️	☑️
LHCB_DY_8TEV_MUON_Y	☑️	☑️
LHCB_WPWM_7TEV_MUON_Y	☑️	☑️
LHCB_WPWM_8TEV_MUON_Y	☑️	☑️

Final Report: https://vp.nnpdf.science/4ub8cAYxTa2xPP6oM6X4_w==

New jet data

scarlehoff

I know this is just a draft, but since I was already reviewing some other dataset PR I thought it was better to add some quick comments so that they are automatically applied to the other datasets that are going to be implemented!

buildmaster/LHCB_DY_13TEV_DIELECTRON/metadata.yaml

Radonirinaunimi · 2023-10-26T11:50:36Z

Thanks for the quick feedback @scarlehoff. I was indeed about to address some of the points you raised above after reviewing everything and tested with #1678. I haven't done so yet but I already addressed most of your comments in cc85bb9.

Maybe as a general comment, it would be useful to have #1708 up to date, to include kinematic_override for example? (This was the reason why I forgot about it although we discussed it)

scarlehoff · 2023-10-26T12:03:10Z

kinematic_override might not be necessary (at the moment for the ones that don't use it I asked @t7phy to just put identity for the time being until I decide what to do with the xq plot) but when doing a re-implementation of the old commondata the whole plotting file need to be ported over.
Another option is of course modifying the kinematics so the override is not necessary but I think for the re-implementation it is better to leave them as close to the original as possible.

cschwan · 2023-10-27T07:37:18Z

Commit eb45404 adds the dataset that I've been working on, which still has lots of unfilled fields (marked with TODO). If someone could give this a review I'd be grateful!

buildmaster/ATLAS_DY_7TEV_LOMASS_EXT/metadata.yaml

Radonirinaunimi

Thanks @cschwan! I've tried to address the TODO's below. A minor comment, you should also add the mapping to the old names in dataset_names.yml.

buildmaster/ATLAS_DY_7TEV_LOMASS_EXT/metadata.yaml

Co-authored-by: Tanjona Rabemananjara <rrabeman@nikhef.nl>

scarlehoff

Hi @Radonirinaunimi, many thanks for this.

I've had a first go at this set of datasets, I've added the possibility of using extra_labels with label_based figure_by to #1678 so now the datasets in this branch would work.

The tests I'm doing for every datasets are three:

Check that the central data is exactly the same.
Check that computing the chi2 agrees exactly (meaning that the covmat is read and created equally)
And finally doing a report.

The test 1 is passed by every dataset.
Test 2 is passed for the following pairs:

"LHCBWZMU7TEV", "LHCB_DY_7TEV_MUON_Y"
"LHCBWZMU8TEV", "LHCB_DY_8TEV_MUON_Y"
"LHCBZEE2FB_40", "LHCB_Z0_8TEV_DIELECTRON_Y"
"LHCB_Z_13TEV_DIELECTRON", "LHCB_Z0_13TEV_Y-DIELECTRON"
"LHCB_Z_13TEV_DIMUON", "LHCB_Z0_13TEV_Y-DIMUON"

while for these

"LHCBZMU7TEV", "LHCB_Z0_7TEV_MUON_Y",
"LHCBZMU8TEV", "LHCB_Z0_8TEV_MUON_Y",
"LHCBWMU7TEV", "LHCB_WPWM_7TEV_MUON_Y",
"LHCBWMU8TEV", "LHCB_WPWM_8TEV_MUON_Y",
"LHCBZ940PB", "LHCB_Z0_7TEV_DIELECTRON_Y"

the value of the chi2 is created differently. Please have a look at them!
I've had a look at the covmat and even the diagonal is different.
In particular, for LHCB_Z0_7TEV_DIELECTRON_Y there's something going on because the new covmat has values like 10^6. There might be an order-of-magnitude problem in some uncertainties?

I've done the report for the 5 dataset that fully agree so that you can have a look and check that everything is correct. I cannot see any obvious problems but of course you've been implementing them so might know better!
The report is ordered by pairs (so first/second, third/four, etc)

https://vp.nnpdf.science/vY3OvQMqTxybygGVncMkvQ==

Edit: If there is no obvious fix to the 5 that produce a different covmat I would be happy with separating the 5 that are 100% ok and merging them by themselves.

buildmaster/LHCB_Z0_13TEV/metadata.yaml

Co-authored-by: Juan M. Cruz-Martinez <juacrumar@lairen.eu>

Radonirinaunimi · 2023-11-13T20:58:08Z

@scarlehoff, if you run the checks for the remaining datasets everything should now be Ok. I wasn't able to produce the report which includes all the datasets myself due to the pandoc stacksize error (that I need to find a solution to).

There were basically two main problems. For LHCB_Z0_7TEV_DIELECTRON_Y, the issue was that the normalization uncertainty was off by $10^3$. For the rest, since these are separation of the NC and CC components from the same measurement, I previously only dumped the uncertainties corresponding to the process-bins. Now, all the correlated uncertainties are dumped and that fixed the issue.

scarlehoff · 2023-11-14T11:16:29Z

Thank you very much! I find agreement for all of them and the plots generally look the same as before afaics

Here's a report with all datasets. Please check that everything looks as you expected. By that I mean, plotting options, axis scale and all that.
(I've used the QCD cfactor for the prediction, and NRM for the ones that needed it as well).

https://vp.nnpdf.science/4ub8cAYxTa2xPP6oM6X4_w==

Once you confirm this is ok, this is ready to merge. However, after looking at #1785 I've realized the old-new mapping syntax might need to change a bit. If it does, I'll add the appropriate change it here (and update the docs) so nothing to do from your side, but just to mention it!

(pinging @enocera in case he wants to have a second look or want to add anything to any of it before merging)

enocera · 2023-11-14T11:19:45Z

@Radonirinaunimi Thanks for your work. @scarlehoff I'd like to have a second look at the implementation before merging, if you don't mind.

Radonirinaunimi · 2023-11-14T13:10:47Z

As far as I am concerned, this is basically ready. So once @enocera approves (thanks for also having a look) we can merge this.

scarlehoff

I've repeated the checks with the final version fo this branch:

Central value are the same
The covariance matrix are the same
The computed chi2 is the same
The t0 chi2 is the same (so multiplicative and additive are equivalent in the new and old implementation)

The report in this comment #1826 (comment) is unchanged since @Radonirinaunimi has not updated any metadata.yaml files (only the uncertainties) in his last commits.

So this one is perfectly ready to go from my side.

Radonirinaunimi · 2023-12-05T21:02:22Z

Thanks a lot for these additional checks @scarlehoff! This is completely done from my side as well.

scarlehoff and others added 9 commits October 9, 2023 14:54

add dataset_names.yml

c7bd39a

atlas jets

730a872

cms jets

01a655f

address comments atlas jet

d41ab11

address comments atlas dijet

c95aa17

address comments cms jet

f15340f

Merge pull request #1821 from NNPDF/ncd_new_jets

b2e9195

New jet data

Implementing subsets of LHCb data

af37420

Update dataset namings

655c1b9

Radonirinaunimi added the data toolchain label Oct 25, 2023

Radonirinaunimi assigned cschwan, Radonirinaunimi, niclaurenti and peterkrack Oct 25, 2023

Radonirinaunimi marked this pull request as draft October 25, 2023 20:14

scarlehoff reviewed Oct 26, 2023

View reviewed changes

Address comments from pre-review

cc85bb9

Radonirinaunimi and others added 2 commits October 26, 2023 16:42

Combined di-electron and di-muon for Z 13TeV

77a6e4e

Add tentative implementation for ATLAS DY 7 TeV low-mass measurement

eb45404

scarlehoff reviewed Oct 27, 2023

View reviewed changes

buildmaster/ATLAS_DY_7TEV_LOMASS_EXT/metadata.yaml Outdated Show resolved Hide resolved

buildmaster/ATLAS_DY_7TEV_LOMASS_EXT/metadata.yaml Outdated Show resolved Hide resolved

buildmaster/ATLAS_DY_7TEV_LOMASS_EXT/metadata.yaml Outdated Show resolved Hide resolved

Radonirinaunimi commented Oct 27, 2023

View reviewed changes

cschwan and others added 6 commits October 27, 2023 10:14

Apply suggestions from code review

4d5538d

Co-authored-by: Tanjona Rabemananjara <rrabeman@nikhef.nl>

Rename NC DY -> Z0

bea9feb

fix setname in renaming NC DY -> Z0

58b3a21

add NC & CC DY productions in muon rapidity at 7TeV

41717e2

add NC & CC DY productions in muon rapidity at 8TeV

db3dbab

fix some metadata entries in NC & CC DY at 7 TeV

a08cd76

scarlehoff reviewed Nov 13, 2023

View reviewed changes

buildmaster/LHCB_Z0_13TEV/metadata.yaml Outdated Show resolved Hide resolved

scarlehoff mentioned this pull request Nov 13, 2023

Status of the new commondata format implementation #1709

Closed

77 tasks

Radonirinaunimi and others added 2 commits November 13, 2023 12:21

Fix incorrect set name

15013e3

Co-authored-by: Juan M. Cruz-Martinez <juacrumar@lairen.eu>

fixed remaining problematic datasets

8197945

scarlehoff and others added 7 commits November 14, 2023 15:35

swap old and new and add an example with a variant

ac996c0

add dataset_names.yml

4702dc3

atlas jets

6bbea80

cms jets

12abba3

address comments atlas jet

6567d91

address comments atlas dijet

f56f44c

address comments cms jet

e64d092

scarlehoff force-pushed the new_commondata_collected branch from b2e9195 to e64d092 Compare November 17, 2023 16:20

scarlehoff and others added 3 commits November 17, 2023 17:22

Merge branch 'new_commondata_collected' into collider_dy_ncd

ce5aa48

Fix minor details in descriptions

7268200

fix ambiguities in defining syst treatments

666c786

scarlehoff approved these changes Dec 5, 2023

View reviewed changes

replace sqrt_s -> sqrts

f414e35

scarlehoff added a commit that referenced this pull request Feb 1, 2024

merged #1826, collider DY LHCb

eaa1a2f

scarlehoff force-pushed the new_commondata_collected branch from e64d092 to 929b692 Compare February 1, 2024 14:26

scarlehoff closed this Feb 1, 2024

scarlehoff added a commit that referenced this pull request Feb 6, 2024

merged #1826, collider DY LHCb

740cc8e

scarlehoff added a commit that referenced this pull request Feb 7, 2024

merged #1826, collider DY LHCb

76d31f5

scarlehoff added a commit that referenced this pull request Feb 7, 2024

merged #1826, collider DY LHCb

d657ed7

scarlehoff added a commit that referenced this pull request Feb 12, 2024

merged #1826, collider DY LHCb

136a36f

scarlehoff deleted the collider_dy_ncd branch November 14, 2024 10:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-implementation of the LHCb Collider DY Datasets #1826

Re-implementation of the LHCb Collider DY Datasets #1826

Radonirinaunimi commented Oct 25, 2023 •

edited

Loading

scarlehoff left a comment

Radonirinaunimi commented Oct 26, 2023

scarlehoff commented Oct 26, 2023

cschwan commented Oct 27, 2023

Radonirinaunimi left a comment

scarlehoff left a comment •

edited

Loading

Radonirinaunimi commented Nov 13, 2023 •

edited

Loading

scarlehoff commented Nov 14, 2023

enocera commented Nov 14, 2023

Radonirinaunimi commented Nov 14, 2023

scarlehoff left a comment

Radonirinaunimi commented Dec 5, 2023

Re-implementation of the LHCb Collider DY Datasets #1826

Re-implementation of the LHCb Collider DY Datasets #1826

Conversation

Radonirinaunimi commented Oct 25, 2023 • edited Loading

scarlehoff left a comment

Choose a reason for hiding this comment

Radonirinaunimi commented Oct 26, 2023

scarlehoff commented Oct 26, 2023

cschwan commented Oct 27, 2023

Radonirinaunimi left a comment

Choose a reason for hiding this comment

scarlehoff left a comment • edited Loading

Choose a reason for hiding this comment

Radonirinaunimi commented Nov 13, 2023 • edited Loading

scarlehoff commented Nov 14, 2023

enocera commented Nov 14, 2023

Radonirinaunimi commented Nov 14, 2023

scarlehoff left a comment

Choose a reason for hiding this comment

Radonirinaunimi commented Dec 5, 2023

Radonirinaunimi commented Oct 25, 2023 •

edited

Loading

scarlehoff left a comment •

edited

Loading

Radonirinaunimi commented Nov 13, 2023 •

edited

Loading