Primary Authors: Douglas Glenzinski, Matthew Herndon, Walter Hopkins, Teruki Kamon, DaeJung Kong, Vyacheslav Krutelyov, Cheng-Ju Lin, David Sperka, Julia Thom, Satoru Uozumi
This webpage provides some additional information of the analysis described in http://arxiv.org/abs/1107.2304, submitted to PRL. Slides of the Joint Experimental Theoretical Physics seminar (Wine and Cheese) held at Fermilab, July 15th, are here.
Processes involving Flavor Changing Neutral Currents (FCNC) provide excellent opportunities to search for evidence of new physics since in the standard model they are forbidden at tree level and can only occur through higher order loop diagrams. Two such processes are the decays Bs (Bd) → μ+μ-. The SM predictions for these branching fractions are BR(Bs → μ+μ-) = (3.2 ± 0.2)× 10-9 and BR(Bs → μ+μ-) = (1.00 ±0.1)×10-10 (1).
These predictions are one order of magnitude smaller than the current experimental sensitivity. Previous bounds from the CDF collaboration, based on 3.7 fb-1 of integrated luminosity, are BR(Bs → μ+μ-) < 4.3×10-8 and BR(Bd → μ+μ-) < 7.6×10-9 at 95% C.L. A description of the previous CDF analysis is given here.
Enhancements to Bs → μ+μ- occur in a variety of different new-physics models. For example, in supersymmetry (SUSY) models, new supersymmetric particles can increase BR(Bs → μ+μ-) by several orders of magnitude at large tanβ, the ratio of vacuum expectation values of the Higgs doublets (2). In the minimal supersymmetric standard model (MSSM), the enhancement is proportional to tan6β. For large tanβ, this search is one of the most sensitive probes of new physics available at the Tevatron experiments.
This measurement uses 7 fb-1 of integrated luminosity collected by the CDF detector and supersedes our previous published result (http://arxiv.org/abs/0712.1708), which used 2/fb of data.
In addition to increasing the size of the data set, the sensitivity of this analysis is improved another 20% by including events which cross regions of the COT where the trigger efficiency is rapidly changing (4) and by including events with muon stubs in the CMX miniskirt region. Other improvements include the use of an improved neural-network (NN) discriminant that provides approximately twice the background rejection for the same signal efficiency as shown in this plot.
The events are collected using a set of dimuon triggers and must satisfy either of two sets of requirements corresponding to different topologies: CC events have both muon candidates detected in the central region (often labeled "CMU" by CDF), while CF events have one central muon and another muon detected in the forward region (aka CMX).
We use a new NN that has 14 input variables compared to 3 input variables of the previously used NN. The signal discrimination power of the 14 variables and some additional kinematic variables (pT(B) and pT(higher pT muon)) are shown here, here, and here. We check the MC modeling of our 14 input variables using a sample of B+ → J/Ψ K+ → μ+μ- K+ events collected on the same triggers and satisfying the same set of baseline requirements. The kaon is required to have pT>1 GeV/c. The comparisons are shown here, here, and here. For these plots, in order to better mimic the resolutions for the B→μ+μ- decays, the vertex variables use only the two muons from the J/Ψ to mimic the resolutions of the Bs→μ+μ- decay while the pT(B) and isolation variables use the 3-track information.
The baseline selection requires high quality muon candidates with transverse momentum relative to the beam direction of pT > 2.0 (2.2) GeV/c in the central (forward) region. The muon pairs are required to have an invariant mass in the range 4.669 < Mμμ < 5.969 GeV/c2 and are constrained to originate from a common well measured three-dimensional (3D) vertex. A likelihood method together with a dE/dx based selection are used to further suppress contributions from hadrons misidentified as muons. Only a fraction of the total number of background and simulated signal events are used to train the NN. The remainder are used to test for NN overtraining and to determine the signal and background efficiencies.
Several tests are done to ensure νNN (the neural network output) is independent of Mμμ. We train the NN with the inner and outer part of our sideband and then compare the NN output of the two trained NN. In the resulting plots for CC and CF show no signs of mass bias. We also check the NN output as a function of dimuon mass and find no correlation between mass and NN output. All selection criteria were finalized before revealing the content of the signal regions. The optimization used the expected upper limit on the branching fraction as a figure of merit. To exploit the difference in the Mμμ distributions between signal and background and the improved suppression of combinatorial background at large νNN , the data is divided into sub-samples in the (νNN , Mμμ) plane. The CC and CF samples are each divided into 40 sub-samples. There are eight bins in νNN with bin boundaries 0.70, 0.76, 0.85, 0.90, 0.94, 0.97, 0.987, 0.995 and 1. Within each νNN bin we employ five Mμμ bins, each 24 MeV/c2 wide, centered on the world average Bs (Bd) mass.
We use candidate B+→ J/Ψ K+ events collected on the same triggers as a relative normalization to estimate the BR(Bs → μ+μ-) as:
BR(Bs → μ+μ-) = NBs/(αBs⋅εrecoBs) ⋅ (αB+εrecoB+)/ (NB+) ⋅ εtrigB+/εtrigBs ⋅ εNNBs fu/fs ⋅ BR(B+→ J/Ψ K+)⋅ BR(J/Ψ→μ+μ-),
where NBs is the number of candidate Bs → μ+μ- events, αBs is the geometric and kinematic acceptance of the di-muon trigger for Bs → μ+μ- decays, εrecoBs is the reconstruction efficiency for Bs → μ+μ- events in the acceptance, εtrigBs is the trigger efficiency for Bs → μ+μ-, with NB+, αB+, εtrigB+, and εrecoB+ similarly defined for B+→ J/Ψ K+ decays; the ratio fu/fs accounts for the different b-quark fragmentation probabilities and is (0.402 ± 0.013)/(0.112 ± 0.013) = 3.589 ± 0.374 (4), where the (anti-)correlation between the uncertainties has been accounted for. The final two terms are the relevant branching ratios BR(B+→ J/Ψ K+) ⋅ BR(J/Ψ → μ+μ-) = (1.01 ±0.03)×10-3 ⋅ (5.93 ±0.06)×10-2 =(6.01 ± 0.21)×10-5 (4). A summary of the values for all the parts of the equation is given in this table. The analysis described is also sensitive to Bd→μ+μ- decays. The BR(Bd → μ+μ-) is estimated from the same equation, substituting Bs for Bd, and changing fu/fs to fu/fd = 1. All other aspects are the same as the Bs → μ+μ- search.
The backgrounds in this analysis are categorized as combinatoric and B→ h+h'- peaking background.
The combinatoric background is estimated by fitting a fixed slope first order polynomial to the mass sidebands with dimuon mass greater than 5 GeV/c2. The lower sideband goes from 4.669 GeV/c2 to 5.169 GeV/c2 while the upper mass sideband goes from 5.469 GeV/c2 to 5.969 GeV/c2. The slope is attained from the dimuon mass shape for all NN bins combined (νNN>0.7). We then allow the normalization of the pol1 to float and fit for each NN bin. The individual fits to the separate NN bins can be seen in these figures: Lower NN bins, CC, Higher NN bins, CC, Lower NN bins, CF, Higher NN bins, CF For the three highest NN bins we also assign a shape systematic based on our ignorance of the background shape.
We fit a completely floating pol1 to sidebands with dimuon mass >5GeV/c2, and an exponential to the entire sideband region and compare the results with our standard fixed slope pol1. We take the largest difference as our systematic. The resulting relative errors are shown in this table.
The final expected number of combinatoric background events in all the NN bins for both Bs and Bd signal regions are given in this table.
Our other main source of backgrounds is a the peaking B→ hh background where both hadrons pass our muon ID. This background is more signicant in the Bd mass window (an order of magnitude larger than in the Bs window). The peaking B→ h background is estimated by Monte Carlo and D*-tagged D0→ Kπ events. We measure the probability that a pion or kaon will satisfy our muon identification criteria using a data sample of D*-tagged D0→ Kπ events. We use a MC sample of B→ hh events to estimate the acceptance, the pT(hadron) distribution, and the shape of the invariant mass distribution (assuming the muon mass for both legs). All the reconstruction efficiencies are taken from the data in the manner described above. The estimated B→ hh background in each NN bin is shown here.
We use 4 control regions to test our background estimates:
The first 3 samples are dominated by combinatoric backgrounds with negligible B→ hh contributions. Due to the looser muon-id requirements, the FM+ sample has a significant B→ hh contribution. For each control sample we compare the number of predicted background events to the number observed in each NN bin for the CC and CF channels separately. For these cross checks we use a "signal" region defined as 5.169 < Mμμ < 5.469 GeV/c2. The results are shown for CC and CF channels. The B→ hh contribution to the FM+ sample is broken out in this table. These comparisons give us confidence in our background estimates.
The number of observed events is compared to the number expected in all 80 sub-samples for the Bd search region in this table and is summarized in this plot.The data are consistent with the background expectations and yield an observed limit of BR(Bd→μ+μ-) < 6.0 (5.0) × 10-9 at 95% (90%) C.L. An ensemble of background-only pseudo-experiments are employed to estimate the significance as a p-value. The effects of systematic uncertainties are included in the pseudo-experiments by allowing them to float within Gaussian constraints. The p-value is obtained by comparing the log-likelihood ratio, -2ln(Q), observed in the data with the distribution from an ensemble of MC pseudo-experiments. Here Q=L(s+b|data)/L(b|data), where L(h|x) is the product of Poisson probabilities over all NN and mass bins. The systematic uncertainties are included as nuisance parameters. The likelihood is minimized with respect to the nuisance parameters. The resulting p-value for background-only pseudoexperiments is 23.3%.
The result for the Bs region are shown in this table and is summarized in this plot. There is a excess of events concentrated in the νNN>0.97 region. The p-value for background-only pseudoexperiments is 0.27. The excess is concentrated in bins with νNN > 0.97. The excess in the 0.97 < νNN < 0.987 bin appears to be a statistical fluctuation of the background as there is no significant expectation of Bs → μ+μ- signal consistent with the observation in the two highest NN bins. If we consider only the two highest NN bins the p-value becomes 0.66%. If we include Bs→μ+μ- events in the pseudo-experiments at the SM level (BR=3.2× 10-9) we obtain a p-value of 1.9% (4.1%) using all (only the highest 2) NN bins. Using all NN bins this changes to 2.1% if we use the SM BR +1σ of it's uncertainty (3.4× 10-9).
We use the log-likelihood fit described above to determine the BR(Bs→μ+μ-) most consistent with the data in the Bs search region. From the resulting Δχ2 distribution the BR(Bs → μ+μ-) is taken as the value at the minimum and the uncertainty as the BR corresponding to 1 unit change in Δχ2, BR(Bs → μ+μ-)=(1.8+1.1-0.9)× 10-8. For comparison, we also used a Bayesian posterior technique. The resulting central value and 68% CL obtained from this fit is consistent with that obtained from the Δχ2. Additionally we set bounds at 90% (95%) C.L. on the braching fraction of Bs → μ+μ- of 4.6 × 10-9 < BR(Bs → μ+μ-) < 3.9 × 10-8 (2.8 × 10-9 < BR(Bs → μ+μ-) < 4.4 × 10-8).
We also derive an upper limit at 95% (90%) C.L. of BR(Bs → μ+μ-) < 4.0 × 10-8 (3.5 × 10-8) with the CLs methodology.
(1) A. Buras, B. Duling, T. Feldmann, T. Heidsieck, C. Promberger, S. Recksiegel, 1002.2126 or JHEP 09, 106 (2010).
(2) S. R. Choudhury and N. Gaur, Phys. Lett. B 451, 86 (1999); K.S. Babu and C. Kolda, Phys. Rev. Lett. 84, 228 (2000).
(3) Public CDFNOTE 9860.
(4) W. M. Yao et al. [Particle Data Group], J. Phys. G 33, 1 (2010).
The list of Figures and Tables is here.
Date: 2011-09-27 12:26:45 CDT