Search for Boosted Top Quarks in High Transverse Momentum Jets with 5.95 fb -1 of CDF Run II Data.


Raz Alon, Ehud Duchovni and Gilad Perez (Weizmann Institute of Science)


Pekka Sinervo (University of Toronto),   [Contact]

Abstract [Link to public note]

We present the results of a search for boosted top quarks in a sample of high transverse momentum jets observed in the CDF detector in a sample of 5.95 fb-1 and reconstructed with the Midpoint cone jet algorithm (without search cones). We observe 57 candidate events in a sample where we require two massive jets or one massive jet with significant missing transverse energy, with an estimated background of 46 ± 8.5 (stat) ± 13.8 (syst) events. We use these data to set an upper limit on the production cross section for Standard Model ttbar events with at least one top quark produced with pT > 400 GeV/c of 38 fb at 95% confidence level (C.L.). This can be compared with the comparable Standard Model predictions of 4.55 fb. Finally, we place an upper limit on the production of a pair of massive objects with masses near the top quark mass with at least one jet with pT > 400 GeV/c of 20 fb at 95% C.L.


Link to Previous Version

Introduction

This note follows on CDF Note 10199 [1] ) that documents a study of the highly boosted jets using a 5.95 fb-1 sample of jet data. In this note, we extend the study to look more closely at the potential signals for top quark production in this sample, focusing on top quarks produced with pT > 400 GeV/c.

Data Sample

Our data sample consists of events collected with the jet100 trigger and selected to have at least one jet with pT > 300 GeV/c and |η| < 0.7. We considered jets reconstructed with the Midpoint algorithm with cone sizes of 0.4, 0.7 and 1.0.

Expected Sources of Events

The high pT jet sample is dominated by the QCD production of light quarks and gluons at all pT scales. The relative rate of top quark pair production rises as one increases the minimum pT requirement. Without any suppression of other sources, ttbar production contributes approximately ~ 1.5% at the pT > 400 GeV/c. With light quark, bottom quark and gluon contributions suppressed by a factor of 250, ttbar production represents over 50% of the signal for objects with pT > 400 GeV/c.

NNLO Prediction for High pT Top Quark Production

The most recent NNLO calculation of the ttbar differential cross section [2] has been updated with the MSTW 2008 parton distribution functions and a top quark mass of mtop = 173 GeV/c2. The calculation itself includes next-to-leading-order (NLO) corrections to the leading-order diagrams along with next-to-next-to-leading-order (NNLO) soft-gluon corrections. No rapidity cut was placed on this cross section though the author believes this would have a neglible effect on the overall rate. The scale used is μ2 = pT2 +Mtop2.

This calculation for the pT distribution yields a total cross section of 8.15 pb and a cross section for pT > 400 GeV/c of 4.55+0.50-0.41 fb. Said another way, the fraction of top quarks produced with pT > 400 GeV/c is 5.58x10-4.

In our calculations of expected ttbar contributions, we will employ the Kidonakis and Vogt cross section of 4.55+0.50-0.41 fb for top quarks produced with pT > 400 GeV/c. With this cross section, the PYTHIA MC sample for this pT range has a sensitivity of 888 fb-1.

Event Reconstruction and Selection

We used a data sample collected with an inclusive jet trigger with a nominal transverse energy threshold of 100 GeV. The data sample corresponds to 5.95 fb-1 of Run II data. The entire inclusive jet sample consisted of 75,764,270 events for 5.95 fb-1. This corresponds to an effective triggered cross section of 12.7 nb.

More details of the event selection are provided in CDF Note 10199 [1].

Monte Carlo Modeling

Theoretical Expectations

With the selection described above, we believe that the event sample is dominated by jets produced by QCD scattering. The requirements that each event have a high quality primary vertex and that the calorimeter energy deposition associated with the leading jet be confirmed with charged tracks or the presence of both EM and HAD energy essentially eliminates all potential physics backgrounds and instrumental effects. We place a maximim value on SMET > 10 to reject backgrounds from cosmic rays.

The only other significant source of events to this sample is top quark pair production. Although the rate of top quarks is also expected to be of order 5 fb for pT > 400 GeV/c, these events will be unusual in that they will produce a small number of events with two massive objects. We therefore have considered these events as a potentially significant contribution to the event sample at high jet mass where the QCD rates are expected to be significantly reduced.

Top Quark Production and Decay

Top quark production is dominantly a pair-production process (ttbar) with the transverse momentum of the top quark being approximately half the mass of the quark, but with a long tail to higher transverse momentum. It is this tail that in principle contributes to any analysis looking at very boosted objects.

In order to understand the nature of this process and its characteristics when we require a central, high pT jet in the event, we used a standard top quark MC sample with 4.75 million events. As noted earlier, in order to normalize this sample, we employ the top quark cross section prediction by Kidonakis and Vogt prediction for top quarks with pT > 400 GeV/c. There are 4041 ttbar events with at least one top quark with pT > 400 GeV/c.

Given our event selection starts with a high pT jet in the central region, we make the same requirements on the top quark MC sample. We observe 1027 jets in this MC sample, which corresponds to an observed cross section for jets meeting these requirements of 1.15±0.14 fb, where the uncertainty includes the statistical uncertainty of the MC sample (almost neglible) and the uncertainty on the top quark cross section (which dominates). We note that this is approximately 600 times smaller than the observed rate of such jets in the data. As there are 866 events that are responsible for these jets, there are 161 events in this sample with two jets with pT > 400 GeV/c. This corresponds to an expected ttbar event rate in this sample of 0.9 fb.

Selection of an Enriched Top Quark Sample

Employing Both mjet1 and mjet2

We start with 4230 events with a high pT leading jet requirement of jet pT > 400 GeV/c and |η| < 0.7.

A simple strategy to detect the presence of ttbar production when one is searching for fully-hadronic ttbar decays is to use both candidate jets in an equivalent manner. We start from the observation, illustrated in the figure below, that for QCD dijet events the masses of the two leading jets are uncorrelated. We can therefore use the observed distribution in either mjet1 or mjet2 of events in the low jet mass peak (defined here to be 30 to 50 GeV/c2) relative to events in the top mass window of 130 to 210 GeV/c2 to estimate the QCD background in the signal region where both jet masses are between 130 and 210 GeV/c2.

We find that there are 370 events with both jets in the mass region 30 to 50 GeV/c2 (region A). We also find 47 events with mjet1 ∈ (130, 210) and mjet2 ∈ (30, 50) (region B). We find 102 events in the region mjet2 ∈ (130, 210) and mjet1 ∈ (30, 50) (region C). Note that the difference in rate between regions B and C is due to the pT cut placed on the leading jet.

We also use in this calculation the parameter Rmass=(NBNC)/(NAND) which was introduced in [4]. The value used for it is 0.89 coming from a POWHEG calculation, with a statistical uncertainty of 0.03 and a systematical uncertainty of 0.03 resulting from the difference between the POWHEG result and the lowest matched MC result in [4].

With these data, we estimate the number of QCD background events in the signal region (region D) to be 14.6±2.76 (stat). We observe 31 events in the signal region. This calculation is summarized in the table below.

Applying the same selection to our ttbar MC sample, we find 452 events in the signal region out of the 4041 ttbar MC events that have a leading jet with pT > 400 GeV/c. If we use the sensitivity of the MC sample of 888 fb-1, we would expect to see

events in the signal region.

Employing mjet1 and Missing ET Significance

In order to observe ttbar events where one top quark has decayed semileptonically, we turn to the sample of high pT jet events where a recoil jet has not been identified as a potential top quark candidate through its mass. In this case, there is a correlation in the signal events with high mjet1 and SMET , forming a signal region defined by mjet1 ∈ (130, 210) GeV/c2 and SMET ∈ (4, 10).

By assuming that mjet1 is independent of SMET , we can perform a calculation similar to that used in the two high mass jet case. We define region A to be the one with mjet1 ∈ (30, 50) and SMET ∈ (2, 3), region B as mjet1 ∈ (130, 210) and SMET ∈ (2, 3), region C to be mjet1 ∈ (30, 50) and SMET ∈ (4, 10) and region D to be the signal region. We find that there are 256 events in region A, 42 events in region B and 191 events in region C. With these event counts, we predict 31.3±8.1 (stat) events in region D (the signal region).

Applying the same selection to our ttbar MC sample, we find 283 events in the signal region out of the 1390 ttbar MC events that have a leading jet with pT > 400 GeV/c. If we use the the sensitivity of the MC sample of 631 fb-1, we would expect to see

events from ttbar production in the signal region.

We observe 26 events in this signal region, consist with the background estimate and also consistent with the number of expected background and signal events. This calculation is summarized in the table below.

Combination Channels

Combining the results of the two channels, we find 57 candidate events with an expected background from QCD jets of 46±8.5 events (the uncertainty is only statistical). The systematic uncertainty on the background rate is dominated by the uncertainty on the jet mass scale (see the next subsection), and results in a background estimate of 46±8.5 (stat)±13.8 (syst) events. The statistical significance of this result is modest, given the lack of any excess in the lepton+jets channel, and is even less if systematic uncertainties are taken into account. Given this relatively modest significance, we cannot claim observation of high pT top quark production.

However, we do find that we can still set interesting limits on top quark production using these two channels. First, we estimate the systematic uncertainties associated with this measurement.

Systematic Uncertainties

The largest systematic uncertainties affecting this analysis arise from uncertainty on the jet mass scale. The other sources of uncertainty we have considered are the integrated luminosity in the sample and the uncertainty in the top quark acceptance due to the uncertainty in the jet energy scale and the top mass used to create the MC samples.

We estimate the effect of the uncertainty on the jet mass scale by shifting the upper mass window by ±10.2 GeV/c2 and observing how the QCD background estimate changes. This results in a systematic uncertainty of -24% to +35% on the combined background rate of 57 events.

The jet energy scale uncertainty results in a systematic uncertainty on the top quark acceptance, which we determine by shifting the jet pT scale by 3% (the efficiency is sensitive to the jet energy scale simply because, for example, an underestimate in the jet energy scale would drop the observed rate of events and vice-versa). The resulting change in the top quark acceptance is 24.5%, using the pT distribution from the Kidonakis and Vogt calculation.

We find that the acceptance has an uncertainty of only 0.3% arising from the uncertainty on the top quark mass used to create the MC samples.

As mentioned we assign a systematic uncertainty of 0.03 on Rmass.

Finally, we incorporate a systematic uncertainty on the integrated luminosity of ±6%.

Together, these result in overall systematic uncertainties on the total cross section limit of ±44%.

Limits on ttbar Production

In principle, we can use the event rates observed in each channel as independent observations and combine them using a maximum likelihood technique or other statistical procedure to estimate the signal rate. However, given that we expect comparable signal-to-noise and acceptance in each channel, we combine the total number of candidate events and total background rate and use these to set an upper limit on ttbar production for top quarks with pT > 400 GeV/c. We calculate the 95% C.L. limit, folding in the systematic uncertainties, using the pseudo-experiment calculation developed by T. Junk and implemented in mclimit.C [3].

The resulting upper limit, taking into account the efficiency of 0.182 and the integrated luminosity of 5.95 fb-1, is 38 fb at 95% C.L. on standard model ttbar production for top quark pT > 400 GeV/c. This is approximately an order of magnitude higher than the estimated Standard Model rate, and is limited by the QCD background rates.

We note that the similar calculation using either of the two channels alone set upper limits that are within 50% of this limit.

Finallty, we calculate the "expected limit" by using the background estimated from the data-driven technique and assuming an observation of ttbar events at the expected level of 4.9 events. The mclimit calculation yields an upper limit of 33 fb at 95% C.L., which is lower than the observed limit since we see an excess of events above the expected signal plus background in the data.

It is interesting to set a limit on the fully hadronic channel, as this creates a selection that is sensitive to pair production of two massive objects near the mass of the top quark. We have 31 events with two massive jets with mjet ∈ (130, 210) GeV/c2, with a background estimate of 14.6±2.76 (stat)±4.4 (syst) events. As we are interested in beyond-SM contributions to this final state, we now include in the background estimate the expected ttbar contribution of 3±0.8 events. If we use the acceptance for top quark pair production in this channel (0.112) but then take out the top quark hadronic branching fraction of 4/9 = 0.44, employ the systematic uncertainties described earlier, and use the mclimits calculation, we set an upper limit of 20 fb at 95% C.L.

Conclusion

We present the first search for very high pT top quark production using data gathered with an inclusive jet trigger. We find a modest excess of events - 57 candidate events with an estimated background of 46±16.2 events - either in a configuration with two high pT jets each with masses between 130 and 210 GeV/c2 or where we observe one massive jet recoiling against a second jet with significant missing transverse energy.

We expect approximately 5 signal events from Standard Model top quark production. The data are not sufficiently significant to support a claim for observation of top quark production. However, we do set a 95% C.L. upper limit on the rate of top quark production for top quarks with pT > 400 GeV/c of 38 fb at 95% C.L.

We use these data to also search for pair production of a massive object with masses comparable to that of the top quark with at least one of the objects having pT > 400 GeV/c. We set an upper limit on the pair production of 20 fb at 95% C.L.

Results

Total Number of Observed Events in Signal Region 57
Predicted Background from QCD Jets in Signal Region 46±8.5(stat)±13.8(syst)
Expected Number of ttbar Events in Signal Region 4.9
95% C.L. Upper Limit on SM Top Quark Production with pT > 400 GeV/c 38 fb
95% C.L. Expected Upper Limit 33 fb
Total Number of Observed Events in Hadronic Signal Region 31
Predicted Background from QCD Jets in Hadronic Signal Region 14.6±2.76(stat)±4.4(syst)
Expected Number of ttbar Events in Hadronic Signal Region 3
95% C.L. Upper Limit on Heavy Object Pair Production with pT > 400 GeV/c In Hadronic Channel 20 fb
Table of Uncertainties
Table of Event Count - All Hadronic
Table of Event Count - Semileptonic
The data distribution of mjet2 vs mjet1, pTjet2 > 100 GeV/c, SMET < 4, cone R=1.0
The data distribution of SMET vs mjet1, pTjet2 > 100 GeV/c, cone R=1.0
The data distribution of SMET (in the range 4-10) vs mjet1, pTjet2 > 100 GeV/c, cone R=1.0
The ttbar MC distribution of mjet2 vs mjet1, pTjet2 > 100 GeV/c, SMET < 4, cone R=1.0
The ttbar MC distribution of SMET vs mjet1, pTjet2 > 100 GeV/c, cone R=1.0
The ttbar MC distribution of SMET (in the range 4-10) vs mjet1, pTjet2 > 100 GeV/c, cone R=1.0
The QCD MC distribution of mjet2 vs mjet1, pTjet2 > 100 GeV/c, SMET < 4, cone R=1.0
The QCD MC distribution of SMET vs mjet1, pTjet2 > 100 GeV/c, cone R=1.0
The QCD MC distribution of SMET (in the range 4-10) vs mjet1, pTjet2 > 100 GeV/c, cone R=1.0
The QCD and ttbar MC distributions of mjet1 requiring pTjet2 > 100 GeV/c, SMET < 4, cone R=1.0
The QCD and ttbar MC distributions of mjet2 requiring pTjet2 > 100 GeV/c, SMET < 4, cone R=1.0
The QCD and ttbar MC distributions of mjet1 requiring pTjet2 > 100 GeV/c, 4 < SMET < 10, cone R=1.0
The QCD and ttbar MC distributions of mjet2 requiring pTjet2 > 100 GeV/c, 4 < SMET < 10, cone R=1.0
The QCD and ttbar MC distributions of pTjet2, cone R=0.7
The QCD and ttbar MC distributions of ηjet2, cone R=0.7
The QCD and ttbar MC distributions of SMET, cone R=0.7

References

  1. CDF Note 10199 (January 2011).
  2. N. Kidonakis and R. Vogt, "Next-to-Next-to-Leading Order Soft-Gluon Corrections in Top Quark Hadroproduction," Phys. Rev. D 68, 114014 (2003) [hep-ph/0308222].
  3. T. Junk, CDF Note 8128 (October 2007).
  4. Y. Eshel et al, "Implications of the Measurement of Ultra-Massive Boosted Jets at CDF", [hep-ph/1101.2898v2]

The above results were blessed on December 16, 2010. Created by Raz Alon. Last updated on May 20, 2011. [Contact]
★Blessed on January 11, 2011.
☆Blessed on May 19, 2011.