CMS Open Data Workshop 2021

CMS open data

The CMS Open Data workshop, launched last year with the intent of making the CMS Open Data more accessible to theorists, phenomenologists and other interested researchers, goes one step further with its second edition!

Since 2014, the CMS Collaboration has pioneered the release of LHC research quality data for public use by making a significant amount of this data accessible under the Creative Commons CC0 waiver via the CERN Open Data portal. With the supplement of new releases, last year CMS brought all of the proton-proton collision data recorded from 2010, 2011, 50% of the amount from 2012, and its first batches of open data from heavy-ion collisions at the LHC to the public domain. 

Even if there are examples of peer-reviewed publications which used these Open Data, an important question arises: how to make these Open Datasets easier to work with by non-CMS analysts?

In September 2020, the Collaboration initiated a first-of-its-kind CMS Open Data Workshop where theorists, phenomenologists, and other interested researchers could learn more about how to interact with and analyse these vast datasets. 

The goal: to lower the threshold for access to these data to bridge the technical gap that usually exists between the scientific creativity of an external analyst and the nuts-and-bolts details of a full analysis with CMS Open Data.

"I would like to express my immense gratitude to the workshop organizers for putting this event together. It came at a crucial time of my studies, and has been very helpful for me as an independent researcher without help and expertise usually more accessible to people working in official CMS groups. Thank you all so much!" said an attendee last year.

This year the second version of that workshop will take place and will benefit new participants and also attendees of the first iteration. The Registration deadline is 2nd July, 2021.

All attendees will be led through the steps of how to login to the CERN virtual machines, run the CMS analysis software, and perform the basics of an analysis. A set of pre-exercises and assignments are provided and required for participants so that they can make the most of the workshop. All exercises will be hands-on and participants should be prepared to dive into the data right away.

Time will also be spent brainstorming with attendees about how the entire process of accessing and analysing the data could be made more useful for the broader HEP community.

So, what are you waiting for? Register for the workshop!

All information here: 

The views expressed in CMS blogs are personal views of the author and do not necessarily represent official views of the CMS collaboration.