Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
<-- p { margin-bottom: 0.21cm; }h1 { margin-bottom: 0.21cm; }h1.western { font-family: "Liberation Serif",serif; }h1.cjk { font-family: "DejaVu Sans"; }h1.ctl { font-family: "DejaVu Sans"; }h2 { margin-bottom: 0.21cm; }h4 { margin-bottom: 0.21cm; }h5 { margin-bottom: 0.21cm; }h3 { margin-bottom: 0.21cm; }h3.western { font-family: "Liberation Serif",serif; }h3.cjk { font-family: "DejaVu Sans"; }h3.ctl { font-family: "DejaVu Sans"; }pre.cjk { font-family: "DejaVu Sans",monospace; }a:link { } --> | ||||||||
Line: 389 to 389 | ||||||||
To set up a neural net for the analysis of a particular kind of data it is necessary to train it with sample data; this process will adjust the "weights" on each variable that the neural net analyses in the ntuple, in order to optimise performance. These weights can then be viewed as a scatter plot in ROOT. | ||||||||
Changed: | ||||||||
< < | Specifying files as Signal/Background | |||||||
> > | Specifying files as Signal/Background or as real data | |||||||
Changed: | ||||||||
< < | The input datasets need to be specified in a number of peripheral files, so that the ANN can distinguish between signal and background files. Errors for each process also need to be specified - how this is done is detailed in that section. The relevant files for adding processes are atlastth_histlist_flat-v15.txt, AtlasttHRealTitles.txt, FlatAtlastthPhysicsProc1.txt and FlatSysSetAtlastth1.txt. There are also some files that are produced through the action of genemflat_batch_Complete2_SL5.sh. At several points in these files, there are common structures for inputting data, relating to ListParameter and ColumnParameter: | |||||||
> > | The input datasets need to be specified in a number of peripheral files, so that the ANN can distinguish between signal and background MC files or real data files. There is only one place where data files need to be specified differently to MC - FlatAtlastthPhysicsProc1.txt - and if you are running the fit with data and not pseudodata, this is determined through one single flag, set in genemflat - see here![]() | |||||||
ListParameter <tag> <onoff> <colon-separated-parameter-list> | ||||||||
Line: 410 to 410 | ||||||||
atlastth_hislist_flat-v15.txt | ||||||||
Changed: | ||||||||
< < | This file provides a map for the ANN, giving it the output file names (and in which directory they are to be stored, relative to ${template_area} - set in genemflat) and the tree structure where the final result of the ANN will be stored in the output (in the example below, the output file is ${template_area}/11602-filter.root, and the result graph will be FlatPlotter/NNScoreAny_0_0_0 0). The number to the left of the file name indicates which process it is - this is established using a file called TMVAsteer.txt, which is created through the running of genemflat_batch_Complete2_SL5.sh), and corresponds to the variable my_Eventtype in the input files (this can also be influenced in genemflat). | |||||||
> > | This file provides a map for the ANN, giving it the output file names (and in which directory they are to be stored, relative to ${template_area} - set in genemflat) and the tree structure where the final result of the ANN will be stored in the output (in the example below, the output file is ${template_area}/11602-filter.root, and the result graph will be FlatPlotter/NNScoreAny_0_0_0 0). The number to the left of the file name indicates which process it is - this is established using a file called TMVAsteer.txt, which is created through the running of genemflat_batch_Complete2_SL5.sh), and corresponds to the variable my_Eventtype in the input files (this can also be influenced in genemflat). Thus it is possible to assign multiple files to the same process (e.g. a file for the electron stream and muon stream are both assigned to Data), by giving them a common number at the start of the line. | |||||||
0 116102-filter.root FlatPlotter/NNScoreAny_0_0_0 0 | ||||||||
Line: 471 to 471 | ||||||||
This file contains various parameters:
ColumnParameter BackgroundList 0 tt0j=0 | ||||||||
Changed: | ||||||||
< < | ColumnParameter SignalList 1 ttH=1 | |||||||
> > | ColumnParameter SignalList 1 ttH=1 ColumnParameter DataList 1 Data=11 | |||||||
Changed: | ||||||||
< < | Here you specify once again the numbers assigned to the processes by my_Eventtype (for tt0j it equals zero), and list things as BackgroundList or SignalList. The number after 'BackgroundList' or 'SignalList' is unique for each process (to preserve the uniqueness of <tag>:<sequence>), but it must be sequential, running from 0 to n-1 (where you have n samples). It also does not need to correspond to my_Eventtype, however, for completeness' sake within this file I have set it as such. The number at the end of this declaration (ttjj=0 in this case) needs to be sequential - it instructs the net of the order in which to process the samples, so it must go from 0 to n-1 (when you have n samples). | |||||||
> > | Here you specify once again the numbers assigned to the processes by my_Eventtype (for tt0j it equals zero), and list things as BackgroundList, SignalList or DataList. The number after 'BackgroundList' or 'SignalList' is unique for each process (to preserve the uniqueness of <tag>:<sequence>), but it must be sequential, running from 0 to n-1 (where you have n samples) - apart from for DataList entries (as shown above). It also does not need to correspond to my_Eventtype, however, for completeness' sake within this file I have set it as such. The number at the end of this declaration (tt0j=0 in this case) needs to be sequential - it instructs the net of the order in which to process the samples, so it must go from 0 to n-1 (when you have n samples). It must match up with the numbers provided in atlastth_histlist_flat-v16.txt so that processes and data can be matched to the various individual files. | |||||||
ColumnParameter PseudoDataList 0 tt0j=0 | ||||||||
Line: 690 to 691 | ||||||||
***NOTE*** The flags DoTraining and DoTemplates had previously (until release 00-00-21) been set on the command line. They were moved from the command line when the other flags were introduced. | ||||||||
Added: | ||||||||
> > | If you wish the fit to be run using data and not pseudodata, then the flag is set in FlatFitSteer.txt, which is created in genemflat:
GeneralParameter bool 1 PseudoData=1If this flag is set to 1 then pseudodata is used, 0 causes data to be used. | |||||||
teststeerFlatPlotterATLAStthSemileptonic-v16.txt and teststeerFlatReaderATLAStthSemileptonic-v16.txtGeneralParameter bool 1 LoadGlobalOnEachEvent=0 |