MTM2.5: A Top Mass Measurement in the Lepton + Jets Channel with 1.9 fb-1

Lina Galtieri, Paul Lujan, Jeremy Lys (LBNL), John Freeman, Pedro Movilla Fernandez (FNAL), Jason Nielsen (UC Santa Cruz), Igor Volobouev (Texas Tech)
Send mail to the authors

Measured value:
mt = 172.7 ± 1.2 (stat.) ± 1.3 (JES) ± 1.2 (syst) GeV/c2 = 172.7 ± 2.1 (total) GeV/c2

Contents

Note: to download high-resolution Encapsulated PostScript (.eps) versions of the plots and figures, click on the plot. There are also .eps versions as well as LaTeX source available for the tables.

Abstract

We report an update on our measurement of the top quark mass obtained from proton-antiproton collisions at a center-of-mass energy of 1.96 TeV at the Fermilab Tevatron using the CDF II detector. Our method uses a matrix element integration method for the signal and a neural network discriminant to identify background events. We employ an "effective propagator" in the matrix element to take into account assumptions of the event kinematics used in the analysis. We compute a 2D likelihood as a function of mtop and JES, an overall factor that scales all jet energies. We apply a cut to the likelihood in order to reduce the effect of badly reconstructed events. We then extract a value for the top mass. This measurement updates our previous measurement, which was performed on a 1.7 fb-1 data set, to a 1.9 fb-1 sample, and requires events with a lepton and four high energy jets in the |η| ≤ 2 region with at least one jet tagged as coming from a b quark. We observe 318 events passing all of our cuts and obtain a final value of mt = 172.7 ± 1.2 (stat.) ± 1.3 (JES) ± 1.2 (syst) GeV/c2 = 172.7 ± 2.1 (total) GeV/c2.

Event selection

In our analysis, we look for events in which ttbar pairs are produced, each decays into a W boson and a b quark, and then one W decays into a neutrino and a lepton (meaning, in this paper, an electron or muon) and the other W decays into a quark-antiquark pair; this is called the "lepton + jets" channel.

We identify top mass candidates in this channel by requiring four high energy jets from the four quarks and a W decay into a lepton and a neutrino. Specifically, for the lepton we require either an identified electron with ET > 20 GeV or an identified muon with pT > 20 GeV/c in the central region of the detector. The neutrino is identified by requiring a missing ET > 20 GeV in the event. For the jets, we require exactly 4 jets with ET > 20 GeV and |η| ≤ 2, where the jet energies have been corrected for non-uniform detector response, calorimeter stability, and nonlinear response to particle momenta. The missing ET is also corrected for muons and jet response. In addition, at least one of the jets must be tagged as a b-jet using a secondary vertex tagging algorithm. With these selection criteria, we observe a total of 371 events in the data.

The background to this signal consists of three main sources: events where a W is produced in conjunction with heavy flavor quarks, events where a W is produced with light flavor quarks which are mistagged, and QCD events where a jet is misidentified as an electron. There are also smaller contributions from diboson (WW, WZ, or ZZ) production, events where a Z produces a lepton pair, and single-top production; we do not consider these events directly, but rather increase the contribution from W+light (for diboson and Z) and W+heavy flavor (for single-top) to include the contributions from these sources.

We use a variety of Monte Carlo samples to test and calibrate our method and evaluate the backgrounds to ttbar production. For signal events, we use ttbar events generated at a variety of top masses from 152 GeV/c2 to 190 GeV/c2 by the PYTHIA generator. The non-W QCD background is derived from data with non-isolated leptons, while the other backgrounds are generated using the ALPGEN generator with parton showering by PYTHIA, except for the single top samples which are generated using MadEvent with parton showering by PYTHIA. Overlaps in the W+partons samples are removed using the ALPGEN jet-parton matching along with a jet-based heavy flavor overlap removal algorithm.

The background estimate is shown in the table below:

Background1 tag≥ 2 tags
non-W QCD13.81 ± 11.490.48 ± 1.50
W+light mistag16.34 ± 3.560.31 ± 0.09
diboson (WW, WZ, ZZ)3.29 ± 0.260.27 ± 0.03
Z → ee, μμ, ττ2.19 ± 0.260.19 ± 0.03
Sum of above 321.82 ± 3.580.77 ± 0.10
W+bbar13.75 ± 5.532.78 ± 1.14
W+ccbar, c12.31 ± 4.990.58 ± 0.24
Single top s-chan1.49 ± 0.140.52 ± 0.07
Single top t-chan1.53 ± 0.120.41 ± 0.05
Sum of above 429.08 ± 10.244.29 ± 1.36
Total background64.71 ± 16.255.54 ± 2.56
Events observed28480
eps version | LaTeX source

The transfer functions, acceptance, and normalization described below are obtained from ttbar events generated by the HERWIG generator, so we also cross-check against HERWIG events as well.

Signal likelihood calculation

Our signal likelihood calculation is performed by integrating over the matrix element using the following formula:

Signal
likelihood integration formula

eps version | LaTeX source

This likelihood gives us the probability that we observe in our detector an event with kinematic variables y as a function of the true top mass mt and the jet energy scale JES by integrating over the unknown parton-level quantities x. Specifically:

In order to make the integration computationally tractable, we make a few simplifying assumptions. Specifically, we assume the lepton angle and momentum are perfectly measured, the four parton angles are measured perfectly by the jet angles, the b quark from the hadronically decaying top is on mass shell, and the other three quark masses are 0. This reduces the number of integration variables from 22 to 7. However, of course, it also introduces some imperfection. To compensate for this imperfection, we alter the distributions of MW2 and Mt2 corresponding to the propagator terms in the matrix element so as to take into account these effects; we call the resulting distributions "effective propagators". These effective propagators, as shown below, are more smeared than the Breit-Wigners they would be without these effects.

Hadronic-side effective propagator
Hadronic-side effective propagator
Leptonic-side effective propagator
Leptonic-side effective propagator

Background handling

In order to distinguish between signal and background events, we employ a neural network discriminant. Our discriminant uses ten variables: the PT of the 4 leading jets, the missing ET, the lepton ET, HT (the scalar sum of the jet transverse energies, missing energy, and lepton energy), aplanarity, DR, and HTZ. The neural network is trained on Monte Carlo events with a signal mass of 170 against a W+bbbar background and then checked to see that the output does not change significantly with different signal masses and background types. The result is shown below.

Neural network
discriminant
Neural network discriminant. The solid lines show the output for different signal masses, while the dashed lines show the output for different background types.

Overall our discriminant shows good stability with respect to signal mass and background types. For a given event, we calculate the background fraction for that event fbg(q) = B(q)/(S(q)+B(q)), where q is the neural network output for that event. Note that the distributions for B(q) and S(q) are normalized to the overall expected background and signal fractions.

Our background handling proceeds in two ways. Our method does not include an explicit background likelihood, as we do not integrate over the background matrix elements. Instead, we treat all events under the assumption that they are signal. Thus, we expect that when we add all of the likelihoods for the observed events, the events will contain signal and background in their expected fractions. Thus, to recover the likelihood for the signal events, we subtract off the expected contribution from the background events:

log Lsig(mt, JES) = Σi[log Li(mt, JES)] - nbg log Lavg(mt, JES | background)

where the Li are the individual likelihoods for each event and Lavg is the average likelihood for background events, as obtained from Monte Carlo. We can rewrite this slightly using the individual background fraction for each event:

log Lsig(mt, JES) = Σi[log Li(mt, JES) - fbg(qi) log Lavg(mt, JES | background)]

The second step in our background handling is to de-weight events which have been identified by our discriminant as being more likely to be background. We perform this de-weighting by averaging these curves with a uniform distribution U. So, for a single event, we have:

log L(mt, JES) → log [L(mt, JES)(1 - fbg(q) κ) + fbg(q) κ U]

In this equation, κ is an adjustable parameter that can be tuned to perform the optimum amount of de-weighting. However, in our studies, we determined that the optimum value of κ in the current analysis is 0; that is, the benefits of de-weighting background events are never greater than the penalties from accidentally de-weighting signal events in our case. Thus, for this version of the analysis, we do not perform the above de-weighting, only the subtraction of the expected background.

Finally, in addition to background events, there is another class of events not handled well by our signal integration. These are events which contain a true ttbar pair, but where the four observed jets do not correspond to the four quarks produced in the ttbar decay; we call these events "bad signal" events. These can occur due to a variety of possibilities (extra jets from radiated gluons, misidentified dilepton or all-hadronic events, W → τ decay, etc.) and overall comprise roughly 35% of our total signal. In order to deal with these events, we implement a cut on the log of the peak value of the likelihood curve; studies have shown that the optimal value of this cut is 6. We find that such a cut eliminates a good percentage of bad signal and background events while retaining nearly all signal events. The below table shows the efficiency for "good signal", "bad signal", and background events. For a signal mass of 170 GeV/c2, 63.9% of 1-tag and 69.4% of >1-tag events are "good signal".

Type of event1-tag>1-tag
Good signal94.0%96.4%
Bad signal78.5%78.3%
Background71.1%70.5%
eps version | LaTeX source

Method validation

To test and calibrate our method, we perform our integration on Monte Carlo samples at a variety of signal top masses with background events included in the expected fraction. For a given top mass, 2000 pseudo-experiments (PEs) are performed, where each pseudo-experiment includes 315.2 events (the expected number of observed events after applying the likelihood cut) randomly drawn from the signal and background pools according to their expected fraction; the number of events for each pool is fluctuated around its average by a Poisson fluctuation.

For a given pseudo-experiment, we combine the individual event likelihoods, subtract off the expected background contribution as described above, and then extract the overall top mass using the "profile likelihood" method; that is, for each value along the mt axis, we select the value along the JES axis where the likelihood is maximized:

Lprof(mt) = maxj ∈ JES L(mt, j)

We then extract our result and statistical uncertainty from the resulting 1-D likelihood curve. For an ensemble of 2000 PEs, we then compute the measured mass (determined by the mean of the ensemble), bias, expected statistical uncertainty, and pull.

The plots below show the results of this test. The upper-left plot shows the measured mass as a function of input mass, while the upper-right plot shows the measured bias as a function of input mass. The lower-left plot shows the pull widths as a function of input mass, while the lower-right plot shows the expected uncertainty as a function of input mass.

Best mass Bias

Pulls Expected error

From the results of the above test, we adopt the following calibration constants:

Data results

In the 1.9 fb-1 data sample, we find a total of 371 events passing all of our selection cuts before the likelihood cut, 284 single-tag and 87 multiple-tag events. After applying the likelihood cut, we have 237 single-tag events and 81 multiple-tag events for a total of 318 events in our final likelihood. Applying the background subtraction, taking the profile likelihood, and applying the above calibration factors, we obtain a measurement of:

mt = 172.7 ± 1.8 (stat. + JES) GeV/c2

We can separate this uncertainty into a statistical uncertainty and uncertainty due to JES by comparing this with the 1-dimensional result, which yields a result of:

mt = 172.7 ± 1.2 (stat.) + 1.3 (JES) GeV/c2

We can also perform separate measurements on the 1-tag and >1-tag samples, which yield mt = 172.1 ± 2.3 GeV/c2 and mt = 174.2 ± 2.9 GeV/c2, respectively.

The plots below show the overall likelihood in data events. The plot on the left shows the likelihood over most of the range used in our integration. The right plot shows the contours corresponding to a 1-sigma, 2-sigma, and 3-sigma uncertainty around the peak. Calibration has been applied to the mt axis; since we do not make a JES measurement, we do not calibrate the JES axis.

Data likelihood Data contours

We can also compare the observed uncertainty with the expected uncertainty from pseudo-experiments. The below plot shows this comparison for PEs at a signal mass of 172 GeV/c2. 50% of pseudo-experiments had a smaller uncertainty than our uncertainty measured in data.

Expected uncertainty from PEs

Another comparison of interest is to compare the likelihoods observed in data with the likelihoods observed in Monte Carlo, to check the validity of our likelihood cut as applied to data. The plot on the left shows the top mass value at the peak of the likelihood curve for events which pass the likelihood cut. The plot on the right shows the value of the log-likelihood at the peak of the curve for all events; the cut at 6 is shown as the dashed line on the plot. In both cases the Monte Carlo is normalized to the number of data events (371 pre-cut and 318 post-cut).

Top mass value at peak of
likelihood curve Log-likelihood value
at peak

Systematics

Our systematics are summarized in the table below.

Systematic sourceSystematic uncertainty (GeV/c2)
Calibration0.06
MC generator0.19 ± 0.36
ISR0.26 ± 0.37
FSR0.13 ± 0.38
Residual JES0.53
b-JES0.36
Lepton PT0.11
Permutation weighting0.03
Multiple interactions0.05
PDFs0.25
Background fraction0.33
Background composition0.39
Background average shape0.31
Background Q20.07 ± 0.20
Gluon fraction0.14
b-tag ET dependence0.16
Total1.16
eps version | LaTeX source

Here is a brief summary of the systematic uncertainties:

Conclusion

We have measured the mass of the top quark on a total of 1.9 fb-1 of integrated luminosity, and found a total of 318 events, from which we extract a measurement of:

mt = 172.7 ± 1.2 (stat.) ± 1.3 (JES) ± 1.2 (syst) GeV/c2 = 172.7 ± 2.1 (total) GeV/c2