# PyHEP Higgs Discovery Talk
## Logistics
* [Conference Site](https://indico.cern.ch/event/1019958/) July 5-9 (Mon-Fri)
* [Talk Abstract](https://indico.cern.ch/event/1019958/abstracts/128850/)
* **Abstract**
CERN's Open Data Portal has 1000's of datasets, in many formats. The ServiceX project reads experiment data files of various formats and translates them into columnar formats. This talk will demonstrate taking the CMS and ATLAS demonstration Higgs samples from the CERN Open Data Portal and using Servicex, coffea, and other python tools like awkward array to re-discover the Higgs from both experiments. Differences between working with data from the two experiments will be used to illustrate how the tool-chain works together to produce final plots
* Time Restrictions for Talk: 3:00 PM - 8:30 PM ( 3:30 PM - 7:30 PM - but might cut it close) Geneva Time
* (GitHub repo for all of this work)[https://github.com/gordonwatts/pyhep-tail-of-two-higgs]
## Goal of Talk
1. Show off ServiceX Backend working on two different data formats (Flat ROOT Tuple and CMS Run 1 AOD).
3. What the `func_adl` looks like
We do this by doing two demos during the talk:
1. [ATLAS H to 2 Lepton + Mising ET](http://opendata.atlas.cern/release/2020/documentation/physics/DL2.html)
1. [CMS Run 1 AOD H to 4 Lepton](https://github.com/cms-opendata-analyses/HiggsExample20112012/)
## Plan
Some general thoughts:
* Run this is jupyter notebooks, as this will in the end be a Jupyter Notebook talk (which means running this live!)
* We should watch Jim's video on [How to Give a Good Jupyter Notebook Talk](https://indico.cern.ch/event/1044648/)
Process:
1. Start by putting the two demo's in two different notebooks.
2. Create a repo that holds all of this work
* Done. Repo can be found [here](https://github.com/gordonwatts/pyhep-2021-SX-OpenDataDemo).
4. We need ServiceX instance running on River that are publically accesible, and have the OpenData DID Finder installed.
ToDo:
* [ ] Get public ServiceX instances with OpenDID finder installed and running and publically accessible.
* Asked ServiceX folks to integrate the code
* [ ] Fetch the data via ServiceX we are going to need
* [ ] Use awkward to process the data