Making theory order local per dataset #99

jacoterh · 2024-11-26T10:54:15Z

This PR removes the pQCD order key in the runcard and replaces it by locally specifying it per dataset, e.g:

datasets:

  ATLAS_CMS_SSinc_RunI: {"order": "NLO_QCD"}
  ATLAS_CMS_tt_AC_8TeV: {"order": "NLO_QCD"}
  ATLAS_SSinc_RunII: {"order": "NLO_QCD"}
  ATLAS_STXS_runII_13TeV: {"order": "NLO_QCD"}
  ATLAS_WH_Hbb_13TeV: {"order": "NLO_QCD"}

The reason we need this feature is because we now have different higher order corrections that we like to distinguish: NLO_QCD, NLO_EW, NLO_QCD_EW. The theory files need updating accordingly, which is done in PR#25 on smefit_database.

LucaMantani · 2024-11-26T13:04:12Z

@jacoterh Correct me if wrong, this will be fully compatible with current tables as long as we specify "NLO" or "LO" for each dataset, right?

jacoterh · 2024-11-26T13:11:55Z

Yes, that's right, the only thing that mattes is that the order specified for each dataset also appears in the theory json. This can be any key as you point out, but PR#25 on smefit_database actually updates the keys in the jsons to our new convention (i.e. NLO_QCD, NLO_EW, etc)

arossia94 · 2024-11-26T13:18:32Z

@jacoterh, is there any default behavior for the case in which one forgets to specify the order for a dataset? That could be useful in case one reuses an old runcard.
I'd print a warning to the user and either exclude the dataset or use it with LO (assuming all datasets will have at least LO).

src/smefit/loader.py

tests/test_fisher.py

tests/test_optimize.py

arossia94

Minor requests regarding default and limit behaviors.
Should we also update the documentation within this same PR before forgetting about this change?

jacoterh · 2024-11-27T14:45:53Z

Should be ready to go. Updated behaviour

datasets:
  - dataset_1
  - dataset_2: {"order": "NLO_QCD"}
  - dataset_3: {"order": "NLO_EW"}

where the predictions of dataset_1 are LO (default behaviour), dataset_2 NLO_QCD and dataset_3 at NLO_EW

jacoterh · 2024-11-27T23:15:59Z

Updated syntax (no mixed types anymore)

datasets:
  - name: dataset_1
  - name: dataset_2
    order: NLO_QCD
  - name: dataset_3
    order: "NLO_EW

This syntax is future proof as it supports naturally additional properties we might want to specify per dataset, i.e. cuts.

LucaMantani · 2024-11-28T08:45:10Z

Updated syntax (no mixed types anymore)
datasets:
  - name: dataset_1
  - name: dataset_2
    order: NLO_QCD
  - name: dataset_3
    order: "NLO_EW
This syntax is future proof as it supports naturally additional properties we might want to specify per dataset, i.e. cuts.

So now you made it a list of dictionaries, right? it's more similar to the NNPDF format. I am in favour of this but I thought we wanted to keep the functioning of the old runcards?

jacoterh · 2024-11-28T09:03:46Z

That's right, it's a list of dictionaries, so it'd be equivalent to doing

datasets:
  - {name: dataset_1}
  - {name: dataset_2, order: NLO_QCD}
  - {name: dataset_3, order: "NLO_EW

This breaks compatibility with the old runcards, but for this we can provide a short script to convert them. In any case it's a small difference.

LucaMantani

I have one comment on the order datasets are looped through but for the rest it looks fine.

LucaMantani · 2024-11-28T22:12:56Z

src/smefit/loader.py


    _logger.info(f"Applying cutoff scale: {cutoff_scale} GeV.")
-    for sset in np.unique(datasets):
+    for sset in datasets:


Here and in similar loops (the one in load_rge_mat for example) datasets are looped without being put in alphabetical order now? I am a bit worried about having mismatches, are we guaranteed that things are ordered correctly?

I see your concern, but datasets is a list which has a well defined ordering, so there's no need to sort anything (as opposed to dictionaries). But it was a list also before, and yet it got sorted. Perhaps @giacomomagni can comment if/why this was necessary at the time?

I see, so the order is always the one put in the card basically.

I guess the unique was there just to be sure a dataset is not loaded twice, but yes from the DataTuple you can always infer the order.

giacomomagni · 2024-11-29T09:17:50Z

src/smefit/loader.py

    use_quad,
    use_theory_covmat,
    use_t0,
    use_multiplicative_prescription,
+    default_order="LO",


why do you need to have a default "LO" ? Do you need to load a dataset without any theory being loaded?

This is just the default theory order that gets loaded if one doesn't specify it for a particular dataset. Of course, the default can be set to NLO as well, but not all predictions are available at NLO.

sure, but sorry to be picky why do you want to have a default ?
I mean if you load, you need to specify an order

src/smefit/loader.py

giacomomagni · 2024-11-29T09:29:34Z

src/smefit/loader.py


    _logger.info(f"Applying cutoff scale: {cutoff_scale} GeV.")
-    for sset in np.unique(datasets):
+    for sset in datasets:


I guess the unique was there just to be sure a dataset is not loaded twice, but yes from the DataTuple you can always infer the order.

Co-authored-by: Giacomo Magni <[email protected]>

jacoterh added 3 commits November 26, 2024 10:50

making order part of dataset entry

ef26286

updating tests

215de08

codefactor suggestions

8ab83b0

jacoterh marked this pull request as ready for review November 26, 2024 11:40

jacoterh requested review from LucaMantani and arossia94 November 26, 2024 11:40

jacoterh added enhancement New feature or request data labels Nov 26, 2024

arossia94 reviewed Nov 26, 2024

View reviewed changes

src/smefit/loader.py Outdated Show resolved Hide resolved

arossia94 reviewed Nov 26, 2024

View reviewed changes

tests/test_fisher.py Outdated Show resolved Hide resolved

arossia94 reviewed Nov 26, 2024

View reviewed changes

tests/test_optimize.py Outdated Show resolved Hide resolved

arossia94 requested changes Nov 26, 2024

View reviewed changes

jacoterh added 4 commits November 27, 2024 10:28

adding assert statement to check for dict

3fddae5

updating structure of dataset entry in runcard

f23f0db

updating tests to new dataset format

aa52b49

codefactor suggestions

e6308b0

jacoterh force-pushed the NLO-keys branch from 06eff31 to e6308b0 Compare November 27, 2024 14:35

jacoterh added 2 commits November 27, 2024 15:12

small fix in variable name

13fe21f

updating rge to new dataset format

769418b

jacoterh mentioned this pull request Nov 27, 2024

NLO EW ZH production at FCCee LHCfitNikhef/smefit_database#25

Merged

6 tasks

making all dataset entries of same type

1851cbd

LucaMantani added 2 commits November 28, 2024 22:55

Merge branch 'main' into NLO-keys

2b24927

Fixed bug introduced by mistake

f40c768

LucaMantani added 2 commits November 28, 2024 23:02

Remove too-many-statements

020f609

docstring typo

f8c40c4

LucaMantani reviewed Nov 28, 2024

View reviewed changes

Adapted logging

ea2ddbc

LucaMantani approved these changes Nov 29, 2024

View reviewed changes

giacomomagni reviewed Nov 29, 2024

View reviewed changes

arossia94 approved these changes Nov 29, 2024

View reviewed changes

Update src/smefit/loader.py

85e9bde

Co-authored-by: Giacomo Magni <[email protected]>

jacoterh merged commit ec2b607 into main Nov 29, 2024
5 checks passed

jacoterh deleted the NLO-keys branch November 29, 2024 10:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Making theory order local per dataset #99

Making theory order local per dataset #99

jacoterh commented Nov 26, 2024

LucaMantani commented Nov 26, 2024

jacoterh commented Nov 26, 2024

arossia94 commented Nov 26, 2024

arossia94 left a comment •

edited

Loading

jacoterh commented Nov 27, 2024

jacoterh commented Nov 27, 2024

LucaMantani commented Nov 28, 2024

jacoterh commented Nov 28, 2024

LucaMantani left a comment

LucaMantani Nov 28, 2024

jacoterh Nov 28, 2024 •

edited

Loading

LucaMantani Nov 29, 2024

giacomomagni Nov 29, 2024

giacomomagni Nov 29, 2024

jacoterh Nov 29, 2024

giacomomagni Nov 29, 2024

giacomomagni Nov 29, 2024

Making theory order local per dataset #99

Making theory order local per dataset #99

Conversation

jacoterh commented Nov 26, 2024

LucaMantani commented Nov 26, 2024

jacoterh commented Nov 26, 2024

arossia94 commented Nov 26, 2024

arossia94 left a comment • edited Loading

Choose a reason for hiding this comment

jacoterh commented Nov 27, 2024

jacoterh commented Nov 27, 2024

LucaMantani commented Nov 28, 2024

jacoterh commented Nov 28, 2024

LucaMantani left a comment

Choose a reason for hiding this comment

LucaMantani Nov 28, 2024

Choose a reason for hiding this comment

jacoterh Nov 28, 2024 • edited Loading

Choose a reason for hiding this comment

LucaMantani Nov 29, 2024

Choose a reason for hiding this comment

giacomomagni Nov 29, 2024

Choose a reason for hiding this comment

giacomomagni Nov 29, 2024

Choose a reason for hiding this comment

jacoterh Nov 29, 2024

Choose a reason for hiding this comment

giacomomagni Nov 29, 2024

Choose a reason for hiding this comment

giacomomagni Nov 29, 2024

Choose a reason for hiding this comment

arossia94 left a comment •

edited

Loading

jacoterh Nov 28, 2024 •

edited

Loading