Skip to content

Commit

Permalink
feat: update list of branches (#17)
Browse files Browse the repository at this point in the history
* re-derive list of branches with representative data file
* update notebook reading branches and processing them with coffea to use new branches
* add some extra debugging utility & performance metrics
  • Loading branch information
alexander-held authored Apr 10, 2024
1 parent a369a0f commit 8ffe060
Show file tree
Hide file tree
Showing 7 changed files with 3,997 additions and 3,577 deletions.
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -168,3 +168,4 @@ cython_debug/
servicex.yaml
*.pstats
servicex/test.py
*.html
9 changes: 7 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,8 +10,6 @@ ATLAS does not have released OpenData, so there isn't an AGC we can copy and try

## Description of files

* `size_per_branch.ipynb`: produce breakdown of branch sizes for given file
* `branch_sizes.json`: output of , produced by `size_per_branch.ipynb`
* `materialize_branches.ipynb`: read list of branches, distributable with Dask (use for benchmarking)

## Usage
Expand Down Expand Up @@ -45,6 +43,13 @@ with additional files:
rucio list-file-replicas mc20_13TeV:mc20_13TeV.364126.Sherpa_221_NNPDF30NNLO_Zee_MAXHTPTV500_1000.deriv.DAOD_PHYSLITE.e5299_s3681_r13145_p6026 --protocol root --pfns --rses MWT2_UC_LOCALGROUPDISK
```
### Branch list determination
Branches to be read are determined with a 2018 data file.
* `input_files/size_per_branch.ipynb`: produce breakdown of branch sizes for given file
* `input_files/branch_sizes.json`: output of notebook above
## Acknowledgements
[![NSF-1836650](https://img.shields.io/badge/NSF-1836650-blue.svg)](https://nsf.gov/awardsearch/showAward?AWD_ID=1836650)
Expand Down
945 changes: 0 additions & 945 deletions branch_sizes.json

This file was deleted.

967 changes: 967 additions & 0 deletions input_files/branch_sizes.json

Large diffs are not rendered by default.

2,593 changes: 2,593 additions & 0 deletions input_files/size_per_branch.ipynb

Large diffs are not rendered by default.

629 changes: 429 additions & 200 deletions materialize_branches.ipynb

Large diffs are not rendered by default.

2,430 changes: 0 additions & 2,430 deletions size_per_branch.ipynb

This file was deleted.

0 comments on commit 8ffe060

Please sign in to comment.