Single-top Search using Matrix Element,
Likelihood Function, and Neural Networks techniques (supporting material)
Common Pseudo-experiment study:
The question arises to which extent the results of the Matrix Element (ME), the
Likelihood Function (LF), and the Neural Networks (NN) techniques are compatible.
Given the fact that all three analyses
employ common selection (ie common dataset and MC samples), the compatibility question
can be answered by means of simulated experiments (``pseudo-experiments'').
The discriminants considered are:
- NN analysis outputs: Ot
, Os , Ost
- ME Extended Probability
Discriminant: Pst
- LF analysis: Lt , Ls
They are essentially used in four analyses (two combined s+t and two separate s-
vs t-channel searches), the discriminants of which are:
- s+t analysis NN:
Ost
- s+t analysis ME:
Pst
- s vs t analysis NN:
(Ot, O
s)
- s vs t analysis LF:
(Lt , Ls)
Every MC event is run through the ME, LF, and NN analyses, and the four (two 1d and two
2d) discriminants listed above are stored. Once this is done and the
four discriminants for every event have been recorded,
we proceed
to form simulated S+B datasets based on the 955/pb luminosity predictions. A pseudo-experiment
is constructed by randomly drawing MC events from the signal and background samples
according to the 955/pb expected contributions (used as means of Poisson distributions).
Once a pseudo-experiment is formed, the four discriminant distributions are constructed, and
then fitted as weighted sums of signal and background reference histograms
(5-component fit to b-like, c-like, q-like, ttbar-like, and signal-like templates).
The same fitting program is used for any and all of the four discriminant
distributions, and all
fits are found to be unbiased.
The systematic uncertainties are taken into account in a correlated way, ie
the same variations in JES, ISR/FSR/pdf etc are used for the ME, LF, separate
NN, and combined NN fits.
Let S1, S2, S3, and S4 denote the fitted signal cross sections in units of SM cross-sections
for the four analyses, respectively, and let e1, e2, e3, e4 denote the uncertainties returned by
these fits. In the infinite statistics limit the fits would return S1,S2,S3,S4=1
and e1,e2,e3,e4=0.
The Table below shows the correlations between pairs of the four fit results.

The following two plots show the fits S1, S2, S3, S4 in units of the SM
single-top cross section, along with the fit uncertainties e1, e2, e3, e4,
without and
with the inclusion of the systematic
uncertainties. In the "without" case, only the background rates are included as
nuisance parameters.
Starting from the fitted cross-sections Si (i=1,..,4)
we form a best linear unbiased estimate (BLUE) [1] for the single-top cross-section.
We do this by constructing a covariance matrix from the
statisitical (and systematic) correlations. We invert this covariance matrix to
obtain
a weight for each i and combine the four results with these weights to obtain our best
cross section estimate. Once we have this, we can calculate a chisquare value for each
pseudo-experiment (and for the data). We can then ask the question: in what fraction
of pseudo-experiments does the corresponding chisquare exceed the value
observed in the data. This fraction will define our compatibility measure.
For the four analyses, this fraction is 0.65%.
[1] L. Lyons, D. Gibaut and P. Clifford, NIM A 270, 110 (1988).
L. Lyons, A. Martin and D. Saxon, Phys. Rev. D 41, 3 (1990).
A. Valassi, NIM A 500, 391 (2003)