A collection of low-level information useful for classifying a particle jet's flavour, i.e. related to the jet's constituents like charged or neutral particle flow candidates, or secondary vertex information. This is the first openly available dataset of that kind that has been derived with experiment-specific software run on top of CERN Open Data and then stored in experiment-independent format, making ROOT or similar frameworks obsolete. Get ready to jump into this fruitful particle physics task without worrying about containers, special tools and physics terminology.
Contribution: Prepared the dataset with a framework to extract and calculate low-level information based on jet constituent properties. A set of plugins and scripts have been written to facilitate the production with high-performance computing infrastructures, for experts with CMSSW experience. To facilitate exchange with other (academic) communities, I derived a novel file structure that is fully independent of experiment-specific software and which does not require any experience with ROOT. I then executed production and conversion of the samples. The file structure has been exchanged with the co-author, whose feedback was included for the public version of the dataset. I modified an exemplary notebook that shows how to use the files directly on the kaggle platform, increasing the datasets's usability score.