ATLAS Web>HiggsAnalysisAtATLASUsingRooStats (2016-05-06, WilliamBreadenMadden)

Higgs analysis at ATLAS using RooStats

This page contains basic information on getting started with Higgs analysis at ATLAS using RooStats. Only the most meager of attempts is made to keep this documentation current.

Higgs analysis at ATLAS using RooStats

What is RooStats?

RooStats is a project to create statistical tools built on top of the RooFit library, which is a data-modelling toolkit. It is distributed in ROOT. Specifically, it has been distributed in the ROOT release since version 5.22 (December 2008). The latest version of ROOT is recommended as the RooStats project develops quickly.

setting up RooStats

There are three main options available for acquiring ROOT with RooStats included.

option 1: Download the latest ROOT release binaries.

The latest ROOT binaries for various operating systems are accessible here.

option 2: Build the ROOT trunk from source.

Follow the appropriate instructions here to build the ROOT trunk.

shell script: building ROOT with RooFit and RooStats

%CODE{"bash"}% #!/bin/bash

################################################################################ # This script builds the latest version of ROOT in Ubuntu. Specifically, first # the ROOT prerequisites are installed, then the most common ROOT optional # packages are installed. Next, the latest version of ROOT in the CERN Git # repository is checked out. Finally, ROOT is compiled. After compiling is # complete, ROOT environment variables should be set up as necessary. ################################################################################

echo -e "\nstart ROOT installation\n" read -s -n 1 -p "Press any key to continue." echo

# Specify the time. date

# Install ROOT prerequisites. echo "install ROOT prerequisites..." sudo apt-get -y install subversion sudo apt-get -y install dpkg-dev sudo apt-get -y install make sudo apt-get -y install g++ sudo apt-get -y install gcc sudo apt-get -y install binutils sudo apt-get -y install libx11-dev #sudo apt-get -y install libxpm-dev sudo apt-get -y install libgd2-xpm-dev sudo apt-get -y install libxft-dev sudo apt-get -y install libxext-dev

# Install optional ROOT packages. echo "install optional ROOT packages..." sudo apt-get -y install gfortran sudo apt-get -y openssl-dev sudo apt-get -y install libssl-dev #sudo apt-get -y install ncurses-dev sudo apt-get -y install libpcre3-dev sudo apt-get -y install xlibmesa-glu-dev sudo apt-get -y install libglew1.5-dev sudo apt-get -y install libftgl-dev sudo apt-get -y install libmysqlclient-dev sudo apt-get -y install libfftw3-dev sudo apt-get -y install cfitsio-dev sudo apt-get -y install graphviz-dev sudo apt-get -y install libavahi-compat-libdnssd-dev #sudo apt-get -y install libldap-dev sudo apt-get -y install libldap2-dev sudo apt-get -y install python-dev sudo apt-get -y install libxml2-dev sudo apt-get -y install libkrb5-dev sudo apt-get -y install libgsl0-dev sudo apt-get -y install libqt4-dev

# Check out the latest ROOT trunk. Save the download in the ~/root directory. # This should take only a moment. echo "check out the latest ROOT trunk..." cd ~/ git clone http://root.cern.ch/git/root.git

# Configure for the compilation. Specifically, the system architecture is # defined and building of the libRooFit advanced fitting package is enabled. cd ~/root while true; do read -p "Specify the computer bit architecture you want to compile ROOT for (64/32): " computerArchitecture if [ "${computerArchitecture}" == "32" ]; then echo "configure ROOT compile for 32 bit computer architecture..." #./configure linux --enable-roofit --enable-minuit2 ./configure linux --enable-roofit --enable-minuit2 --enable-python --with-python-incdir=/usr/include/python2.6 --with-python-libdir=/usr/lib/i386-linux-gnu break elif [ "${computerArchitecture}" == "64" ]; then echo "configure ROOT compile for 64 bit computer architecture..." #./configure linuxx8664gcc --enable-roofit --enable-minuit2 ./configure linuxx8664gcc --enable-roofit --enable-minuit2 --enable-python --with-python-incdir=/usr/include/python2.6 --with-python-libdir=/usr/lib/x86_64-linux-gnu break fi echo "invalid input" done # See other possible configurations using the following command: ./configure --help

# Specify the time. date

while true; do read -p "Do you want to continue to compile ROOT now? (y/n): " yOrn if [ "$(echo "${yOrn}" | sed 's/$.*$/\L\1/')" == "y" ]; then break elif [ "$(echo "${yOrn}" | sed 's/$.*$/\L\1/')" == "n" ]; then echo "exit installation script..." exit 0 fi echo "invalid input" done

# Compile. echo "compile ROOT..." time make

# Move ROOT to the install directory (e.g., /usr/local/) and set up ROOT environment variables in the specified configuration file (e.g., /etc/bash.bashrc, ~/.bashrc). installationDirectory="/usr/local" configurationFile="/etc/bash.bashrc" while true; do read -p "Do you want to continue to move ROOT to the directory "${installationDirectory}" and set up the ROOT environment variables in the file "${configurationFile}"? (y/n): " yOrn yOrnLowercase="$(echo "${yOrn}" | sed 's/$.*$/\L\1/')" if [ "${yOrnLowercase}" == "y" ]; then break elif [ "${yOrnLowercase}" == "n" ]; then echo "exit installation script..." exit 0 fi echo "invalid input" done echo "move ROOT to the directory "${installationDirectory}"..." sudo mv ~/root "${installationDirectory}"

echo "Set up ROOT environment variables in the file "${configurationFile}"..." echo -e "\n# ROOT environment variables" >> "${configurationFile}" echo "export ROOTSYS="${installationDirectory}"/root" >> "${configurationFile}" echo "export PATH=\$PATH:\$ROOTSYS/bin" >> "${configurationFile}" echo "export LD_LIBRARY_PATH=\$LD_LIBRARY_PATH:\$ROOTSYS/lib" >> "${configurationFile}"

# Specify the time. date

echo -e "\nROOT install complete\n" %ENDCODE%

option 3: Build the RooStats branch.

The RooStats branch can be built in order to have the latest development of RooStats (that has not yet been incorporated into a ROOT version). Instructions can be found here.

RooFit

general description

The RooFit library provides a toolkit for modelling the expected distribution of events in a physics analysis. Models can be used to perform unbinned maximum likelihood fits, produce plots and generate "toy Monte Carlo" samples for various studies.

The core functionality of RooFit is to enable the modelling of 'event data' distributions, in which each event is a discrete occurrence in time and has one or more measured observables associated with it. Experiments of this nature result in datasets of Poisson (or binomial) statistics. The natural modeling language for such distributions is probability density functions (PDFs), F(x;p), that describe the probability density of the distribution of observables x in terms of the function parameter p.

In RooFit, every variable, data point, function and PDF is represented by a C++ object. So, for example, in constructing a RooFit model, the mathematical components of the model map to separate C++ objects. Objects are classified by the data or function type that they represent, not by their respective role in a particular setup. All objects are self-documenting. The name of an object is a unique identifier for the object while the title of an object is a more elaborate description of the object.

Here are a few examples of mathematical concepts that correspond to various RooFit classes:

mathematical concept	RooFit class
variable	RooRealVar
function	RooAbsReal
PDF	RooAbsPdf
space point (set of parameters)	RooArgSet
list of space points	RooAbsData
integral	RooRealIntegral

Composite functions correspond to composite objects. The ArgSet class is dependent on argument order while the ArgList class is not.

example code: defining a RooFit variable

%CODE{"c++"}% // general form for defining a RooFit variable: RooRealVar x(

generic example
type of histogram	histogram naming convention
Phenomenon histogram	<phenomenon name>_m<mass point>
Phenomenon upward systematic histogram	<phenomenon name>_m<mass point>_sys_<systematic name>_up
Phenomenon downward systematic histogram	<phenomenon name>_m<mass point>_sys_<systematic name>_do

specific example
type of histogram	histogram name
ttH histogram	ttH_m110
ttH upward luminosity systematic histogram	ttH_m110_sys_Lumi_up
ttH downward luminosity systematic histogram	ttH_m110_sys_Lumi_do
ttH upward JES systematic histogram	ttH_m110_sys_JES_up
ttH downward JES systematic histogram	ttH_m110_sys_JES_do
WW Herwig 105987 upward luminosity systematic histogram	WW_Herwig_105987_m110_sys_Lumi_up
WW Herwig 105987 downward luminosity systematic histogram	WW_Herwig_105987_m110_sys_Lumi_do
WW Herwig 105987 upward JES systematic histogram	WW_Herwig_105987_m110_sys_JES_up
WW Herwig 105987 downward JES systematic histogram	WW_Herwig_105987_m110_sys_JES_do

objective	code
set the prefix for output files	void SetOutputFilePrefix(const std::string& prefix);
set the parameter of interest for the measurement	void SetPOI(const std::string& POI);
set a parameter in the model to be constant	void AddConstantParam(const std::string& param);
set the value of a parameter in the model	void SetParamValue(const std::string& param, double val);
set the low and high bins for all observables	void SetBinLow(int BinLow); void SetBinHigh(int BinHigh);
set the luminosity and its relative error	void SetLumi(double Lumi); void SetLumiRelErr(double LumiRelErr);
set whether the model should save plots and tables or should export the workspace	void SetExportOnly(bool ExportOnly);
add a channel object to a model (a measurement)	void AddChannel(RooStats::HistFactory::Channel chan);
open all specified ROOT files and copy and save all necessary histograms	void CollectHistograms();
save a measurement (a model) to a ROOT file (for possible future modification and use in creating new models)	void writeToFile(TFile* file);

objective	code
create a channel and give it a name	Channel::Channel(const std::string& name);
set the channel histogram using the name and path of a histogram in a ROOT file	void SetData(std::string HistoName, std::string InputFile, std::string HistoPath="");
set the value of the single bin of a channel with only one bin (creating a 1 bin histogram)	void SetData(double value_1);
create or load a histogram in memory by supplying a pointer to the histogram as the data	void SetData(TH1* data_1);
create a HistFactory data object and load it directly (useful in configuring an object and using it multiple times	void SetData(const RooStats::HistFactory::Data& data);

objective	code
create a sample object specifying the name	Sample(std::string Name);
create a sample object specifying the name, the histogram name, the histogram file and the histogram path in the file	Sample(std::string Name, std::string HistoName, std::string InputFile, std::string HistoPath="");
add a sample object to a channel object	void AddSample(RooStats::HistFactory::Sample sample);
independently set a histogram object	void SetHisto(TH1* histogram_1);
independently set a value	void SetValue(Double_t value_1);
set a sample to be "normalised by theory" (its normalisation scales with luminosity)	void SetNormalizeByTheory(bool norm);

Higgs analysis at ATLAS using RooStats

What is RooStats?

setting up RooStats

option 1: Download the latest ROOT release binaries.

option 2: Build the ROOT trunk from source.

shell script: building ROOT with RooFit and RooStats

option 3: Build the RooStats branch.

RooFit

general description

example code: defining a RooFit variable

RooPlot

PDFs

example code: create a Gaussian PDF using RooStats and plot it using the RooPlot class

example code: telling a RooFit PDF what to normalise over

datasets

general description

RooDataSet (unbinned data)

example code: generating toy Monte Carlo, storing it as unbinned data and then plotting it

example code: plotting unbinned data (a RooDataSet) using a specified binning

importing data from ROOT trees (how to populate RooDataSets from TTrees)

RooDataHist (binned data)

importing data from ROOT TH histogram objects (take a histogram and map it to a binned data set) (how to populate RooDataHists from histograms)

example code: import a ROOT histogram into a RooDataHist (a RooFit binned dataset)

fitting

fitting a model to data

fitting a PDF to unbinned data

example code: fit a Gaussian PDF to data

The RooFit workspace

general description

example code: using the Workspace Factory to create a Gaussian PDF

What's in the RooFit workspace?

example code: What's in the workspace?

visual representations of the model/PDF contents

Graphviz

example code: examining PDFs and creating graphical representations of them

Model Inspector

using the Model Inspector

How do I get it?

links

accessing the RooFit workspace

example code: accessing the workspace

example code: accessing both data and PDF from a workspace stored in a file

links for RooFit

RooStats

general description

example code: create a simple model using the RooFit Workspace Factory. Specify parts of the model using ModelConfig. Create a simple dataset. Complete a confidence interval test using the ProfileLikelihoodCalculator of RooStats

links for RooStats

ModelConfig

HistFactory

general description

XML approach to HistFactory

prepareHistFactory

HistFactorySchema.dtd

hist2workspace

XML files

general description

conventions

top-Level XML file

general description

specific instructions

example file: $ROOTSYS/tutorials/histfactory/example.xml

channel XML files

general description

specific instructions

systematic uncertainties

example file: $ROOTSYS/tutorials/histfactory/example_channel.xml

caveats

slash suffix in HistoPath attribute

colon characters in Name attributes

guidance in writing the XML configuration files

histograms

samples

create XML configuration files automatically

C++ approach to HistFactory

HistFactory class tree structure

HistFactory configuration in C++

example (early in project development): creation of the Measurement and a Channel and, thence, creation of the channel samples, including signal and backgrounds

example HistFactory model construction using C++

details on HistFactory usage in C++

details on the object measurement