..
國立臺灣大學理學院物理學系 碩士論文
Department of Physics College of Science
National Taiwan University Master Thesis
Belle II 實驗第一級觸發器中二維軌跡探測器之實現 Implementing the 2D track reconstruction for the
Level 1 trigger of the Belle II experiment
盛子安 Tzu-An Sheng
指導教授:張寶棣博士 Advisor: Paoti Chang, Ph.D.
中華民國 107 年 7 月
July, 2018
..
..
誌謝
碩士班的光陰,三分積澱在這卷論文的算式圖表中,剩下的七分化 作煙塵,隨風搖落在經途草木之下。這些無緣成為論文題目的研究歷 練,縱然有欠完整,也依舊滋養了百木;儘管遠僻,也絕不失蔥鬱。但 願有朝一日,我會慶幸曾在這片森林裡踟躕過。
在這裡特別向這幾條研究的「歧途」上,提攜我的貴人們致謝。感 謝張寶棣、徐靜戈、王名儒三位老師對我物理分析技巧的琢磨與指導,
也感謝李彥頡老師在我們合作的研究中,給予我熱情與鼓勵。感謝賴 昀樅悉心的教學與敦促,更感謝曾衍銘對我傾囊相授研究經驗與心法。
感謝黃坤賢活絡了學生之間的交流討論,也感謝張祐豪、張硯詠帶給 實驗室無盡的精神食糧。感謝柯尹拉 (Suman Koirala) 數度肯定我另闢 蹊徑的分析程式架構,也感謝裴思達 (Stathes Paganis) 老師向我分享 高能物理實驗的樂趣。感謝稻見武夫老師提供我在研究之外,能站在 講台上與學生們切磋習題的經驗,也感謝陳昱潭屢次在我困於研究生 活的水火中時遞出的關懷。感謝劉建宏與趙元對我在管理組內伺服器 與在台大架設測試站時大力相助,也感謝黃子娟、周建宏對我實驗上 的的諸多幫助與指導。限於篇幅,感謝所有和我一起在碩士班修課、談 天與苦惱的好友。
最後深深感謝父母在我稍長的求學期間,對我的支持與縱容。
..
..
Acknowledgements
I would like to thank my advisor, Prof. Paoti Chang, who taught me about particle physics and strives for funding. I am also grateful to Dr. Jing-Ge Shiu for his great mentoring.
I am indebted to Dr. Sara Pohl, who raised the theoretical perfor- mance of the 2D tracker, helped me with the code, and never ceases to amaze me with her thoughtfulness in this research. I also appre- ciate Dr. Yun-Tsung Lai’s perseverance to smooth the data transmis- sion. Thank Dr. Yoshihito Iwasaki for his good leadership. Thank Dr. Hideyuki Nakazawa for the help with the data taking and his kind support during my stay at KEK. Thank Dr. Jae-Bak Kim for all his in- spiring ideas. Thank Prof. Jeri M.C. Chang for bringing the research topic to me.
Thank Shiu-san, Nakazawa-san and Jeffery Chiang for their pa- tience as editors and readers of the draft.
..
..
摘要
位處日本筑波的 B 介子工廠:KEKB 正負電子加速器與 Belle 實驗,
透過研究 B 介子衰變中,弱作用之電荷對稱宇稱破壞的現象,奠定了 小林──益川理論的實驗基礎,並且促成 2008 年的諾貝爾物理獎。為 了從稀有衰變中探究粒子物理標準模型以外的新物理,此工廠正升 級為 SuperKEKB 加速器與 Belle II 實驗,將加速器瞬時亮度提升至 8 × 1035cm−2s−1(原先的 40 倍)。然而在 Belle II 偵測器中,資料擷 取的速率上限僅為每秒 3 萬次,並不能紀錄新亮度之下所有的對撞事 例。實際上,具研究價值的Υ 介子、B 介子及τ 子等事例僅佔所有對撞 事件的數個百分比。另外還有許多偵測器反應並非對撞事件,而是源 自加速器中帶電粒子簇的散射、同步輻射、或是粒子與真空管線中殘 餘空氣分子碰撞等背景雜訊。為了在資料擷取的速限之下盡可能紀錄 所有珍貴的事例,Belle II 實驗勢必得仰賴一套基於硬體的即時觸發系 統,提供高效率、低延遲、無死區時間的事例判別,使資料擷取系統得 以忽略背景事例,不至受到掣肘。
由於多數背景事例不會在碰撞點附近產生具高橫向動量的帶電粒 子,這樣的粒子便成為判別背景事例的關鍵。因此,Belle II 實驗將帶 電粒子軌跡觸發器重新改造,以因應加速器亮度提升。在高能加速器 實驗中,透過辨認帶電粒子通過偵測器時,在數十處感應線留下的電 流訊號,我們得以重建帶電粒子的空間軌跡。由於偵測器內通有縱向 磁場,我們亦可藉螺旋軌跡推知粒子的動量。掌握帶電粒子的數量、動 量等資訊,並輔以量能器能量團與帶電粒子軌跡的空間對應關係,便 能輕易地區分目標事例與背景事例的差別。
.. 帶電粒子留下的電流訊號經過數位化後成為擊打訊號,輸入至軌跡 觸發器。軌跡觸發器首先將偵測器中相鄰的擊打訊號組成區段擊打訊 號。每個帶電粒子軌跡由最多 9 層的區段擊打訊號所組成,其中 5 層 包含了三維的粒子螺旋軌跡在偵測器橫段面上所投影出的二維圓弧軌 跡訊號。這 5 層訊號的幾合位置透過共形變換以及霍夫變換後,在軌 跡參數空間中形成許多三角函數曲線。藉由尋找參數空間中 4 條以上 來自不同層的曲線交點,可知幾何空間中四層以上共圓弧的區段擊打 訊號,與該圓弧所對應的粒子橫向動量之大小及方向。將橫向動量與 剩餘四層包含粒子螺旋軌跡縱向資訊的區段擊打訊號結合後,即可推 知完整的三維軌跡。
前述尋找區段擊打訊號、尋找二維軌跡及尋找三維軌跡的步驟皆由 各別的硬體模組所實現。另外,軌跡觸發器還包含了整合最前端感應 線擊打訊號的模組。各模組之間由光纖傳輸連接。本論文著重於將上 述由二維區段擊打訊號尋找二維軌跡的演算法,以現場可程式化邏輯 閘陣列實現。實現後的邏輯延遲為 11 個時脈周期(相當於 350 奈秒,
不包含傳輸所需的延遲)。透過測量宇宙射線事例,並與更精密的軟體 軌跡重建方法比較後,我們推估對於所有橫向動量在0.5 GeV以上、與 碰撞點徑向距離小於 1 公分、含有 4 個以上區段擊打訊號、並且不受 前端模組錯誤影響的所有軌跡,二維軌跡尋找效率在一個標準差之下 的信心區間完全落在 98% 以上。
本論文同時紀錄了二維軌跡擬合的實現方法。這個方法利用軌跡 偵測器中由高能帶電粒子碰撞氣體分子游離出的電子,以及由該電子 游離出的次級電子在電場中的的飄移速度,通過測量飄移時間,推算 出更精確的軌跡區段擊打位置,並且以最小平方法擬合得出更精密的 二維軌跡。由於這個步驟將會併至更後端的三維軌跡擬合模組中實現,
並且包含大量需藉由查表實現的運算步驟,因此在不喪失計算精確度 的前提下降低記憶體用量便成為最大的挑戰。我們發展了複和式的查 表方法,並利用三角函數的對稱性減低記憶體用量。另外,本論文也 包含數項對建立光纖傳輸資料流穩定性的改善。尤其透過以特定時間
.. 間隔重置位於晶片同一側的光纖收發器,我們得以在更高的傳輸速率 下提升建立傳輸資料流的穩定性。
關鍵字: Belle II 實驗;模式辨認;粒子軌跡;CP 破壞;觸發器;現 場可程式化邏輯閘陣列
..
..
Abstract
The Belle experiment at the KEKB collider in Tsukuba, Japan is a B meson factory designed to operate at a center-of-mass energy of 10.58 GeV, the mass value of Υ(4S). It is undergoing an upgrade that will boost its instantaneous luminosity to 8 × 1035cm−2s−1 (40 times higher than before), whereas the maximum acceptable event rate for the data acquisition system is only 30 kHz. Most of the detector re- sponses arise from the scattered particles with other particles in the accelerated bunch, or with the residual gas molecules in the vacuum beam pipe. Furthermore, only a few percent of the total number of e+-e− collisions correspond to Υ, B or τ events. The rest are consid- ered backgrounds and must be either suppressed or prescaled in real time without losing too many signal events. To achieve this goal, a hardware-based online trigger system with good background suppres- sion, high efficiency, low latency and no dead time is indispensable.
In experimental particle physics, tracking refers to the pattern recog- nition process that searches for the trajectories of charged particles by analyzing the traces they leave on the detector. Once the trajectory, or the track, is reconstructed, the momentum and the charge is also determined. High-precision tracking provides crucial information for telling signals from backgrounds, since most background events don’t produce charged particles with enough transverse momenta near the collision point. As a result, the track trigger in Belle II is redesigned
.. to accommodate the dramatic increase of luminosity and background rate.
The track trigger starts from relating adjacent wire hits in space and in time from a drift chamber, grouping them into maximally 9 seg- ments of a track. Out of the 9 segment hits, 5 are groups of sense wires parallel to the beam axis, and thus their positions contain information of the track projected onto the 2-dimensional plane perpendicular to the beam axis. The track trigger then detects the coincidence of several axial track segments by transforming their radial and angular posi- tions to a parameter space with a conformal map followed by a Hough map, and looking for their intersections there. Each segment in one layer of the detector cylinder contributing to the track is extracted. Af- terwards, it fits these positions with the drift length, and reconstructs the track’s projection in the plane perpendicular to the beam axis. Fi- nally, by combining the 2D track information with the remaining track segments which contains the information of longitudinal position, the vertex position along the beam axis is reconstructed. Each of these steps is a separate module in the track trigger system. This thesis fo- cuses on implementing the steps of finding and reconstructing the 2D track using an algorithm developed by our collaborator.
The 2D tracker module is implemented on 4 printed circuit boards with field programmable gate array (FPGA) and 10 Gbps optical I/O connection to both upstream and downstream modules. It has a la- tency of 11 data clocks (352 ns) excluding the transmission time. The lower bound of the 1-𝜎 confidence interval of its tracking efficiency is measured to be more than 98% for cosmic ray tracks with radial im- pact parameters smaller than 1 cm, 𝑝t > 0.5 GeV, with at least 4 track segment hits, and coming from regions with expected track segment finding efficiency.
.. This thesis also outlines the implementation of the 2D fitter, which involves fitting an arc to the positions of the axial track segment hits corrected by their drift lengths. As the fitting contains many fixed- point arithmetic operations implemented as look-up tables, it is crucial to reduce the usage of the block RAM while maintaining similar arith- metic precision. Composite look-up tables, which increase the preci- sion in the worst-performing part of the arithmetic function’s range by sacrificing the unnecessary precision in other parts, are developed to meet the requirement. Lastly, several improvements are made to stabilize the buildup process of the optical transmission data flow. In particular, an automatic way to reset different optical transceivers on the same side of the die, separated with an adjustable time interval, is tested to make the buildup more stable at the full 10 Gbps lane rate.
Keywords: Belle II; tracking; CP violation; trigger; FPGA
..
..
Contents
誌謝 iii
Acknowledgements v
摘要 vii
Abstract xi
1 Introduction 1
1.1 A matter-antimatter asymmetric universe . . . 2
1.1.1 Current situation . . . 2
1.1.2 Degree of the asymmetry . . . 6
1.1.3 Against a symmetric Universe . . . 7
1.2 𝒞𝒫 violation in the Standard Model . . . 10
1.2.1 Quark flavor mixing . . . 13
1.3 B-meson decays as probes for new physics . . . 18
1.3.1 𝒞𝒫 violation and neutral B meson mixing. . . 19
1.3.2 𝒞𝒫 violation observable . . . 24
1.3.3 Highlight of the recent B measurements . . . 34
1.4 The Belle II Experiment . . . 37
1.4.1 Distinctiveness of an e+e−machine . . . 37
1.4.2 The Belle II detector . . . 38
1.4.3 The SuperKEKB accelerator . . . 42
.. 2 The Level 1 Trigger in Belle II 43
2.1 Accelerator reviewed . . . 44
2.1.1 RF acceleration and beam dynamics . . . 44
2.1.2 Main structure of the accelerator . . . 49
2.1.3 The nano-beam scheme . . . 51
2.2 Beam background source . . . 53
2.3 Event rate at Belle II. . . 56
2.4 Data acquisition in Belle II . . . 57
2.5 Requirements of Level 1 Trigger System . . . 59
2.5.1 Event time decision . . . 59
2.5.2 Requirements from the FEE and the DAQ system . . . 60
2.6 Structure of the Level 1 Trigger System . . . 61
2.7 The track trigger . . . 62
2.7.1 A closer look at the tracking detector . . . 64
2.7.2 Track reconstruction at the first level trigger . . . 65
2.7.3 The Track Segment Finder . . . 69
2.7.4 The 2-dimensional (2D) tracker . . . 69
3 High level algorithm 73 3.1 The 2D finder . . . 74
3.1.1 Input and output . . . 74
3.1.2 Conformal mapping and Hough mapping . . . 74
3.1.3 Discretization . . . 78
3.1.4 Clustering . . . 78
3.1.5 Peak finding . . . 81
3.2 The 2D selector . . . 81
3.3 The 2D fitter . . . 82
3.3.1 The 2D fitter with drift time information . . . 82
3.3.2 Principle of the 2D fitter . . . 82
3.3.3 Treatment of the priority position . . . 85
.. 4 Implementation 87
4.1 Hardware specification . . . 87
4.1.1 Field programmable gate array . . . 87
4.1.2 The printed circuit board . . . 89
4.1.3 Clock signals . . . 91
4.1.4 Parallelism . . . 92
4.2 I/O definition . . . 93
4.3 Decoder . . . 98
4.4 Persistor . . . 99
4.4.1 Timing clones . . . 100
4.5 Finder. . . 101
4.5.1 Mapping . . . 101
4.5.2 Voting . . . 101
4.5.3 Clustering . . . 102
4.5.4 Peak Finding . . . 104
4.6 Selector . . . 107
4.6.1 Track parameter extraction . . . 107
4.6.2 TS association . . . 108
4.6.3 Persistence suppression . . . 108
4.7 Hierarchical view of the core logic . . . 111
4.7.1 Core logic latency . . . 111
4.8 Fitter . . . 113
4.8.1 Representation of numbers . . . 113
4.8.2 Numerical operation . . . 113
4.8.3 Design of the 2D fitter . . . 114
4.8.4 LUT functions . . . 115
4.8.5 Numerical error of the 2D fitter . . . 122
4.9 Common FPGA modules in the UT3 . . . 122
4.9.1 VME interface . . . 123
.. 4.9.2 GTH optical I/O . . . 124
4.9.3 Belle2Link interface . . . 124
4.10 Implementing an FPGA design . . . 125
4.10.1 Timing closure . . . 126
5 Validation 129 5.1 Fast trigger software simulation . . . 130
5.1.1 Efficiency and resolution. . . 131
5.2 HDL simulation . . . 132
5.3 Local cosmic ray test . . . 134
5.3.1 Testing condition . . . 134
5.3.2 Output of the 2D tracker . . . 135
5.4 Global cosmic ray test. . . 138
5.4.1 Performance in the Global Cosmic Ray Test 1 . . . 141
5.4.2 Performance in the Global Cosmic Ray Test 2 . . . 143
6 Resetting the High-speed optical transmission 155 6.1 Start-up instability of the full-speed GTH transmission . . . 156
6.1.1 Problem in the reset sequence . . . 157
6.1.2 Coupling between different GTH quads . . . 159
6.2 New reset for the full-speed GTH transmission . . . 161
7 Conclusion 163 7.1 Open issues . . . 164
7.2 Prospect . . . 165
A Track trajectory parameterization 167 B Estimation of the statistical uncertainty 169 B.1 Uncertainty of the efficiency . . . 169
B.2 Uncertainty of the resolution . . . 170
.. C Change of the 2D tracker parameters 173 D Measurements of the baryon–antibaryon asymmetry 175 D.1 Acoustic peaks in the CMB anisotropies . . . 175 D.2 Light element abundance of Big Bang Nucleosynthesis . . . 180
Bibliography 185
..
..
List of Figures
1.1 He/He flux ratio. . . 3
1.2 The global CKM fit in the ̄𝜌- ̄𝜂 plane . . . 19
1.3 Box diagrams of the B0self interaction . . . 20
1.4 Box diagrams of the B0-B0mixing . . . 21
1.5 Feynman diagrams for B → Kπ, ππ . . . 27
1.6 Penguin contribution to B → ϕK . . . 31
1.7 The sin(2𝛽) values for decay modes related to b → s penguins . . . 32
1.8 Cross section of e+e−→ hadrons . . . 33
1.9 Background-subtracted Δt distributions and asymmetries . . . 33
1.10 Current status of 𝑅Dand 𝑅D∗ measurements . . . 35
1.11 The 𝑃5′ angular observable in bins of 𝑞2 from LHCb Run 1 data . . . 37
1.12 Belle II detector . . . 39
2.1 Phase space plots of the beam particle . . . 47
2.2 Lattice design of the arc cell . . . 48
2.3 The SuperKEKB accelerator . . . 49
2.4 Schematic drawing around the positron target . . . 50
2.5 RF system in the main ring required to produce the ultimate lumi- nosity . . . 51
2.6 Beam size near the collision point . . . 53
2.7 Crossing angle at SuperKEKB . . . 53
2.8 Horizontal and vertical collimators at SuperKEKB . . . 54
2.9 Scaler rates as a function of time after injection . . . 55
.. 2.10 Simulated background CDC wire hit rate . . . 57 2.11 Bunches in the storage rings of SuperKEKB . . . 57 2.12 Overview of the Belle II data acquisition system . . . 58 2.13 The Level 1 trigger system . . . 63 2.14 𝑧-vertex distribution of the tracks in Belle random trigger events . . 63 2.15 CDC in the Belle II and the Belle detector . . . 65 2.16 Wire configuration in the CDC . . . 66 2.17 Measured drift time v.s. drift length in the CDC . . . 66 2.18 3D view of the CDC wires related to a track . . . 67 2.19 CDC sub-trigger system . . . 68 2.20 A simulated event with background hits in the Central Drift Chamber 71 3.1 Transformation from the geometrical space to the parameter space 75 3.2 Conformal map . . . 75 3.3 The curves in the parameter space . . . 77 3.4 Voting in the accumulator space. . . 77 3.5 Tracking efficiency depending on 𝑝tand number of TS hits . . . 78 3.6 Connected and disconnected cells. . . 79 3.7 Disconnected squares . . . 80 3.8 Clustering with the seed square . . . 80 3.9 The relation used in the 2D fitter . . . 83 4.1 The main boad of UT3 . . . 90 4.2 Acceptance range of the first 2D tracker . . . 93 4.3 Bit map of the 2D tracker input from Track Segment Finder . . . . 95 4.4 Bit map of the 2D tracker output . . . 97 4.5 Timing clones due to persistence and a trigger threshold of 4 TS hit 100 4.6 Logic diagram of the voting process . . . 102 4.7 Clustering of the squares in a block . . . 103 4.8 Logic diagram of finding the upper-right corner cell . . . 105
.. 4.9 Priority of choosing the corner square. Smaller numbers take prece- dence. . . 105 4.10 Decomposition of the cluster center . . . 106 4.11 Technology schematic diagram of the TS linking process using 2
clock cycles. . . 109 4.12 Technology schematic diagram of the TS linking process using only
1 clock cycle . . . 109 4.13 Persistence suppression . . . 110 4.14 Rules regarding whether to send new output . . . 110 4.15 Pipeline stages of the 2D tracker . . . 112 4.16 Schematic of the 2D fitter . . . 116 4.17 (Continued from Fig. 4.16) Schematic of the 2D fitter. . . 117 4.18 LUT function for tan−1𝑥 . . . 118 4.19 Composite LUT for tan−1𝑥. . . 119 4.20 Composite LUT made of 2 single LUTs . . . 119 4.21 Numerical error of tan−1𝑥 with single and composite LUT . . . 120 4.22 The LUT function for cos 𝑥 . . . 121 4.23 The LUT function for tan−1𝑥 . . . 121 4.24 Numerical error of the azimuthal angle to the Hough circle center 122 4.25 Numerical error of the transverse momentum . . . 123 4.26 RTL schematic of the logic in Fig. 4.8. . . 126 4.27 Technology schematic diagram of the logic in Fig. 4.8. . . 127 5.1 Track finding efficiency measured on single track events . . . 131 5.2 Track parameter resolution for two-track events . . . 132 5.3 Waveform of HDL simulation with SL-shifted hits as input . . . 133 5.4 Waveform of HDL simulation with TS-shifted hits as input . . . 133 5.5 TS acceptance of the 2D tracker with partial hit map input . . . 135 5.6 A single-track cosmic ray event . . . 136 5.7 Another single-track cosmic ray event . . . 137
.. 5.8 A multi-track cosmic ray event from a single source . . . 138 5.9 Waveform diagram of the event in Figure 5.8. . . 139 5.10 Another multi-track cosmic ray event from a single source . . . 139 5.11 Yet another multi-track cosmic ray event from a single source . . . 140 5.12 Track finding efficiency depending on 𝑑0in GCR1 . . . 142 5.13 Efficiency of the 2D Tracker in GCR2 . . . 145 5.14 Efficiency of the axial Track Segment Finders in run 1103 . . . 146 5.15 A track with small slope . . . 147 5.16 The poorly reconstructed track by the offline tracking software . . . 148 5.17 The track with incorrect low transverse momentum in the offline
reconstruction . . . 148 5.18 Efficiency of the 2D Tracker with |𝑧0| < 40 cm in GCR2 . . . 149 5.19 Number of matched 2D trigger tracks per reconstructed track . . . 149 5.20 𝜙0 resolution in GCR2 . . . 150 5.21 𝑝tresolution of the 2D tracker in GCR2 . . . 151 5.22 Crosstalk effect in the CDC . . . 153 5.23 Instances of fake track segment hits . . . 154
6.1 GTH Transceiver Reset Following the Assertion of GTHRESET when in Full Line Rate Mode . . . 158 6.2 Init sequence in Fig. 6.1. . . 159 6.3 The die view of the Virtex-6 FPGA . . . 160 6.4 States of the v4 (new) reset_logic.vhd. . . 161 6.5 States of the v3 (original) reset_logic.vhd . . . 162 7.1 Belle II at the start of data taking . . . 166
A.1 Definition of the track parameters . . . 168 B.1 𝜙0and 𝑝tdistribution of the 2D tracker in GCR2 fitted with double-
Gaussian. . . 172
.. D.1 Sensitivity of the acoustic peaks in the temperature spectrum to the baryon density Ω𝑏ℎ2 . . . 179 D.2 Planck 2015 temperature power spectrum . . . 180 D.3 The primordial abundances of4He, D,3He and7Li as predicted by
the standard model of Big-Bang nucleosynthesis . . . 183
..
..
List of Tables
2.1 Total cross section and trigger rates at ℒ = 8 × 1035cm−2s−1 . . . . 56 2.2 Requirements of the L1 trigger system . . . 61 2.3 Main parameters of the Belle and Belle II Central Dirft Chamber . . 64 4.1 Virtex-6 FPGA Feature Summary . . . 89 4.2 TS acceptance range for each 2D module (UT3) . . . 94 4.3 Content of the 2D tracker output to the 3D tracker and the Neuro-
Trigger . . . 96 4.4 Block RAM usage of the 2D fitter . . . 122 4.5 Summary of Non-default Compiling Options . . . 128 5.1 Summary of the 2D tracker performance in the fast simulation . . . 132 5.2 Conditions of data applied for the 2D tracker performance study . 140 5.3 CDC offline tracking resolution in GCR1 . . . 141 5.4 Track parameters of the 2D tracker measured in GCR2 . . . 151 6.1 Footprint comparison of the two reset modules . . . 162 C.1 Comparison between the old and the new 2D Tracker . . . 173
..
..
Chapter 1
Introduction
In another moment Alice was through the glass, and had jumped lightly down into the Looking-glass room.
Lewis Carroll, Through the Looking-Glass
Despite the advance of experimental high energy physics and cosmology all these years, several fundamental questions still perplex even the brightest minds among physicists:
• Why is there more matter than antimatter in the observed Universe?
• What is the underlying mechanism that produces the quark lepton mass hi- erarchy?
• What is the presumed dark matter that hold stars in galaxies together?
Many exotic theories have tried to solve the aforementioned questions, but none of them are supported by empirical observation so far. In contrast, the Stan- dard Model of particle physics cannot addresses these mysteries, but has success- fully described almost every phenomenon in the laboratory. The Belle II experi- ment attempts to break this tie and shed light on these questions.
Several hints of discrepancy of the Standard Model detected in the preced- ing Belle experiment, and also in BaBar and LHCb, will be carefully examined
.. with increased luminosity. These include, but are not limited to, the ratios of the branching fractions 𝑅D∗ = ℬ(B → D∗τ−ντ)/ℬ(B → D∗ℓ𝜈ℓ) and 𝑅D = ℬ(B → Dτ−ντ)/ℬ(B → Dℓ𝜈ℓ) [1], the ratios of the branching fractions 𝑅K = ℬ(B+ → K+μ+μ−)/ℬ(B+ → K+e+e−) [2] and 𝑅K∗ = ℬ(B0 → K∗0μ+μ−)/ℬ(B0 → K∗0e+e−) [3], and the 𝑃5′angular observable [4] of the decay B0 → K∗0μ+μ−.
If any of these hints turns out to be an anomaly of the Standard Model, it will certainly knock on the gate of new physics, hopefully bridging the knowledge gap from the particle world to the cosmos.
1.1 A matter-antimatter asymmetric universe
The existence of antimatter arises naturally when special relativity and quan- tum mechanics are combined [5]. Since the first discovery of positron (the an- tiparticle of electron) [6], generations of experiments at ever more powerful ac- celerators have established that all particles are created and destroyed together with their antiparticles. As the laws of physics treats matter and antimatter al- most equally, it becomes peculiar that everything tangible to us is only made of matter—from the Earth, the moon, all the way to the planets and asteroids in the solar system1. The puzzle has led to an endeavor lasting over half a century to seek the direct evidence of antimatter fluxes in cosmic rays. In this section, we mainly follow Ref. [7] (but with updated data) to review the observational evidence of a matter-antimatter asymmetric universe.
1.1.1 Current situation
Although positrons and antiprotons have been observed in cosmic rays outside the atmosphere, they can be easily produced as secondary particles following the collisions of energetic cosmic-ray particles with nuclei in the interstellar gas [8]. If the e+and p fluxes are higher than expected, they are more likely to be produced
1If the planets and asteroids were composed of antimatter, the spaceships would have disap- peared upon landing.
.. in some matter reactions rather than indicating the existence of cosmological anti- matter. On the other hand, the expected ratio of secondary antiheliums2(3He and
4He) produced in cosmic ray interactions to the number of heliums is no more than 10−9-10−12 [11, 9, 10], so an observation of antihelium in the cosmic rays would provide unmistakable evidence to the existence of primordial antimatter3.
Recent measurements with balloon flights (The BESS collaboration [13]) and satellites (The precursor flight of the Alpha Magnetic Spectrometer, AMS-01 [14]
and the PAMELA collaboration [15]) found no evidence of antihelium in cosmic rays, and concluded that the flux ratio of antihelium to helium is less than 10−6 over a wide range of energy. The sensitivity is expected to reach 10−9 following the release of AMS-02 data [16] (also shown in Fig. 1.1b).
(a) Upper limits at 95% confidence level for PAMELA, AMS-01, BESS and earlier results [15]. The 𝑥–axis is the rigidity 𝑅 =
𝑝𝑐
𝑍𝑒. For example, the rigidity of a proton with a momentum of 1 GeV is 1 V.
(b) Calculated flux Ratios [17]. The two up- per curves correspond to the case of the maximal possible mass of antimatter glob- ular cluster 𝑀max = 105𝑀⊙, and the two lower curves to the case of the minimal pos- sible mass of such cluster 𝑀min = 103𝑀⊙. The real line is the expected sensitivity of AMS-02 [18].
Figure 1.1: He/He flux ratio
2The annihilation of dark matter might also produce excess of3He over3He [9,10].
3The presence of heavier antinuclei with atomic number 𝑍 < −2 indicates their ultimate source must be stellar objects (stars, supernovae, pulsars, etc.), since they could not be synthesized in the big bang, nor could they be produced in the collision of high-energy proton with the interstellar gas [12,7].
.. Lacking further direct evidence of antimatter, people turn to the observation of radiation from distant objects. Since the antiparticle of photon is just itself, a stellar object made of matter or antimatter produces identical signal. However, interacting matter and antimatter produces annihilation signals. By constraining these products (and thus the annihilation rate), upper limits to the amount of antimatter can be obtained.
The primary products of a nucleon-antinucleon annihilation are charged and neutral pions. A typical decay scheme is [7]
𝑁 + 𝑁 →
⎧{
⎨{
⎩
π0→ γ + γ π±→ μ±+ νµ(νµ)
↰ e±+ νe(νe) + νµ(νµ)
The γ-ray from π0decay provides the most prominent signature of annihilation.
The γ-ray energy spectrum depends on the decay topology and ranges from 50- 600 MeV [19, 20], and there are typically 3-4 γ with average energy of 200 MeV.
Thus, the annihilation rate per unit volume corresponds to a γ-ray emissivity 𝑆γ = 𝑔γ𝑆, where 𝑔γ = 3-4. The observation of the cosmic diffuse gamma-ray background (CDG) implies [7]
𝑆 ≲ 10−32cm−3s−1.
On the other hand, 𝑆 is related to the mean squared intergalactic gas density ⟨𝑛2⟩, the antimatter fraction 𝑓, the annihilation cross section 𝜎, and the gas velocity 𝑣 by
𝑆 = 𝑓 ⟨𝑛2⟩ 𝜎𝑣.
The number density of hydrogen atom 𝑛H can be obtained independently from the measurement of 21-cm spectrum, while (𝜎𝑣) depends on the temperature of the interstellar gas. In Ref. [7], the lifetime of an antiparticle in the interstellar gas
.. 𝑡a = (𝑛H𝜎𝑣)−1 is estimated to be 300 yr-30 Myr, corresponding to upper limits of 𝑓 ∼ 10−10-10−15.
If antimatter exists in some region of the Universe, what scale of those region describes the observational data? Due to the short lifetime (~300 yr), they cannot coexist with matter in the Galaxy before the gravitational collapse leads to the formation of stars. If antimatter objects somehow condensed prior to annihilation, they still collide with interstellar medium as the stars rotate along the center of the Galaxy. Using the accretion cross section and the observed total γ-ray luminosity of the Galaxy at ℒ ≃ 2 × 1042s−1[21], Ref. [7] concludes that there must be fewer than 107antistars in the Galaxy (or 𝑓 ≲ 10−4). On the scale of clusters of galaxies, the two-body collisions of baryons in the intracluster gas, responsible for creating the x-rays via thermal bremsstrahlung emission, would ensure the production of annihilation γ-ray proportional to the x-ray flux. Assuming that cosmic rays exist in other galaxies with characteristics (mean path length, etc.) similar to those in our own Galaxy [12], the ratio of the x-ray flux 𝐹Xto the γ-ray flux 𝐹γprovides an upper bound to 𝑓 [22]
𝑓 ≤ 2.6 × 10−18𝑇𝐹γ 𝐹X.
Using data from 55 x-ray emitting clusters of galaxies [23], together with the EGRET upper bound to the γ-ray flux [24], it was found that 𝑓 < 10−6in these sam- ples. In addition, the analysis of the Bullet Cluster (colliding clusters) gives an up- per limit of 𝑓Bullet < 3×10−6[22], implying that these cluster are entirely composed of either matter or antimatter. Therefore, if there exist antimatter-dominated re- gions, they must be separated from matter-dominated regions on scales greater than the scales of clusters of galaxies (~Mpc4) or the Bullet Cluster (tens of Mpc).
Can these regions be separated with large voids between them, such that no annihilation signals may be observed? According to the Big Bang cosmological
41 parsec(pc) = 1 AU/ tan 1″≈ 3.26 lightyear is the length of the longer leg of a right triangle with a shorter leg of 1 AU and a smaller angle of 1 arcsecond. This definition is related to one of the earliest methods to measure the distance from Earth to a star, which records the difference in angle between two measurements of the same star separated by 6 months.
.. model, the cosmic microwave background (CMB) is caused by photons decou- pled from matters (last scattering) at the same time as electrons and protons in the cooled down plasma combined into neutral atoms (recombination). The thickness (half width) of the last scattering surface implies that decoupling took place dur- ing a finite period of ≈ 105years, which would dilute anisotropy at scales smaller than 15 Mpc. This corresponds to the smallest resolvable structure in the CMB.
[25] points out that the observed uniformity of the CMB (to parts of 10−5) requires that such voids of matter must be smaller than 15 Mpc. Following this line of thoughts, [25] calculated the relic CDG flux produce by the inevitable encounter of matter and antimatter at the boundaries of these patches. The resultant scale of antimatter-dominated regions in accordance with CDG data [26] is larger than 103Mpc (comparable to the observable universe), and thus a matter-antimatter symmetric universe is ruled out.
To sum up, many observational data suggest the lack of antimatter in the ob- servable Universe, while no evidence of antimatter have been found.
1.1.2 Degree of the asymmetry
The baryon–antibaryon asymmetry of the universe can be described by the ratio of baryon number density 𝑛𝐵 to the average photon number density 𝑛γ (or the total entropy density 𝑠5) as
𝜂 = 𝑛𝐵
𝑛γ = 𝑛𝐵− 𝑛𝐵̄
𝑛γ . (1.1)
Its value is related to the baryon mass density parameter Ω𝐵by6 [28]
𝜂 = 2.74 × 10−8Ω𝐵ℎ2,
5As the baryons and antibaryons annihilate, 𝑛γwill evolve, but the entropy 𝑠 remains a con- stant. Thus, the baryon asymmetry is also expressed in terms of 𝜂/𝑠.
6The baryon number density is 𝑛𝐵=𝑚𝜌𝐵𝐵, where 𝜌𝐵is the baryon mass-energy density, and 𝑚𝐵the average mass per baryon. The matter density is often expressed as the ratio to the critical density 𝜌crit = 8𝜋𝐺3𝐻2 by Ω𝐵 = 𝜌𝐵/𝜌crit. The Universe after big bang nucleosynthesis contains roughly 75% of protons and 25% of heliums (see Eq. (D.11)), yielding an average baryon mass of 𝑚𝐵≈ 0.938 MeV.
.. where the Hubble parameter 𝐻0 = 100ℎ km s−1Mpc−1stands for the present rate of expansion.
According to the standard cosmological model (also ΛCDM, or the concor- dance cosmology) [29], 𝜂 can be determined from either the acoustic peaks in the angular power spectrum of CMB or the abundance of light elements after the Big Bang nucleosynthesis (BBN). Appendix Dgives an introduction to these two methods.
The ΛCDM best fit of the Planck temperature power spectrum combined with low-ℓ likelihood in temperature and polarization data (Fig. D.2) [30] determined
𝜂 = (6.09 ± 0.06) × 10−10.
On the other hand, BBN constrains 𝜂 to [28,31]
5.8 ≤ 𝜂 × 1010 ≤ 6.6 (95% CL).
1.1.3 Against a symmetric Universe
In the early stages (𝑡 ⪅ 10−5s) of the Universe, the high temperature and high density hold nuclei, antinuclei and photons in equilibrium [7]
𝑁 + 𝑁 ⇌ γ + γ, (1.3)
The cosmic photons are mainly contributed by the CMB [27], which agrees well with the black body radiation. Therefore, the photon number density is given by the Bose-Einstein statistics
𝑛γ ≈ 20.3𝑇03= 413 cm−3, (1.2)
where 𝑇0= 2.73 K is the present photon temperature. The present baryon to photon ratio is thus
𝜂 = 𝑛𝐵
𝑛γ = Ω𝐵𝜌crit/𝑚𝐵
20.3𝑇03 = 2.74 × 10−8Ω𝐵ℎ2 .
.. so that the baryon to photon ratio 𝜂 in Eq. (1.1) is directly related to the baryon- antibaryon asymmetry in the early Universe [32]
𝜂 = 𝑛𝐵− 𝑛𝐵 𝑛γ ∣
𝑇 =3 K
≈ 𝑛𝐵− 𝑛𝐵 𝑛𝐵+ 𝑛𝐵∣
𝑇 ⪆1 GeV
.
Can the observed 𝜂 ≈ 6 × 10−10arises from a Universe started out with equal number of matter and antimatter (Δ𝐵 = 0)? As long as the equilibrium in Eq. (1.3) can be maintained, the ratio of (anti)nuclei to photons is given by Eq. (D.7) and Eq. (1.2) [7]
𝑛𝑁
𝑛γ ≈ 2 (𝑚
𝑇 )3/2exp (−𝑚 𝑇 ) .
Similar to the BBN. the nucleon number density decreases as the Universe cools down, until the number density is so small that 𝑁 –𝑁 annihilation effec- tively ceased. After such a critical time 𝑡c, the baryon number density freezes out and remains constant in a comoving volume, turning into the η as observed to- day7. The critical time and baryon to photon ratio can be estimated reasonably well by equating the age of the Universe to the annihilation lifetime [7]
𝑡c≈ 0.002 s, 𝑇c≈ 20 MeV, 𝜂𝑐 ≈ 2 × 10−18 (1.4)
A more careful treatment regarding some nuclei going out of equilibrium un- der expansion gives 𝑛𝑁/𝑛γ ≈ 4.6 × 10−19 [33]. Either way, such a simple model fails miserably to explain 𝜂 by 9-10 orders of magnitude. Therefore, 𝜂 is often taken as the relic abundance of baryons over antibaryons. Namely, there are ~109times more baryons to antibaryons before the freeze-out. Almost all the antibaryons annihilated with baryons, and the remaining baryons form the structures we see today8.
How does the baryon number asymmetry arise? It can be an initial condition
7While additional photons can be created when e±pairs annihilate, η can only become smaller (𝜂0< 𝜂c).
8Even if η can really arise from a symmetric Universe, baryons and antibaryons still need to be separated before they annihilate down to the concentration in Eq. (1.4). This indicates that symmetry must be broken at some scale.
.. at the big bang, but this is a widely unfavored assumption for aesthetic reasons.
Furthermore, any baryon to photon ratio preserved in fermions will be diluted by
~60 e-folds when the Universe undergoes inflation and is reheated afterwards [34]
9. It would be difficult to explain the observed flatness and homogeneity without inflation. In light of this, theories of baryogenesis investigate ways to dynamically generate the baryon asymmetry in the presence of inflation. They are all based on 3 sufficient conditions to generate baryon asymmetry in a Δ𝐵 = 0 Universe, discovered by Sakharov [35].
1. The baryon number is not conserved
2. The 𝒞 and 𝒞𝒫 symmetries are violated
3. Departure from thermal equilibrium
The first condition is self-evident. If the second condition does not hold, then any process that generates more matter than antimatter will be balanced by a sym- metric process that generate antimatter at a equal rate. Intuitively, since the mass of a particle and its antiparticle is equal under 𝒞𝒫𝒯 asymmetry, the thermal equi- librium of 𝐵, which only depends on its mass as the chemical potential is 0 by the first condition, must be equal to ̄𝐵 [36]. Thus, the third condition must hold after an excess of baryon is generated by an 𝐵-violating process; otherwise, the asym- metry will be washed out.
The Standard Model of particle physics satisfies (at least qualitatively) all three conditions [37]. There are the sphaleron process for baryon number violation, the Kobayashi-Maskawa mechanism for 𝒞𝒫 violation, and a spontaneous electroweak symmetry breaking. Sec. 1.2 introduces the mechanism of 𝒞𝒫 violation in Stan- dard Model.
9It is pointed out that asymmetry preserved in a bosonic field can in principle survive inflation, but it requires super-Plankian field values and significant tuning to prevent the asymmetry from being washed out [34].
..
1.2 𝒞𝒫 violation in the Standard Model
If, in some cataclysm, all of scientific knowledge were to be destroyed, and only one sentence passed on to the next generations of creatures, what statement would contain the most information in the fewest words? I believe it is the atomic hypothesis (or the atomic fact, or whatever you wish to call it) that all things are made of atoms —little particles that move around in perpetual motion, attracting each other when they are a little distance apart, but repelling upon being squeezed into one another. In that one sentence, you will see, there is an enormous amount of information about the world, if just a little imagination and thinking are applied.
Richard P. Feynman [38, section 1-2]
So our problem is to explain where symmetry comes from. Why is nature so nearly symmetrical? No one has any idea why. The only thing we might suggest is something like this: There is a gate in Japan, a gate in Neiko10, which is sometimes called by the Japanese the most beautiful gate in all Japan; it was built in a time when there was great influence from Chinese art.
This gate is very elaborate, with lots of gables and beautiful carving and lots of columns and dragon heads and princes carved into the pillars, and so on.
But when one looks closely he sees that in the elaborate and complex design along one of the pillars, one of the small design elements is carved upside down; otherwise the thing is completely symmetrical. If one asks why this is, the story is that it was carved upside down so that the gods will not be jealous of the perfection of man. So they purposely put an error in there, so that the gods would not be jealous and get angry with human beings.
We might like to turn the idea around and think that the true explanation of the near symmetry of nature is this: that God made the laws only nearly symmetrical so that we should not be jealous of His perfection!
Richard P. Feynman [38, section 52-9]
10Probably Nikko (日光) in Japan’s Tochigi Prefecture.
.. The Standard Model11 is the most successful model of particle physics that explains and predicts phenomena at energies below 1 TeV, almost free of anoma- lies. It provides an effective mathematical description of the elementary particles which make up atoms, and of the interaction between these particles. Atoms are bound states of positively charged nucleus and negatively charged electrons or- biting around them. The electromagnetic force accounts for the attraction and repulsion between electric charges. The atomic nucleus consists of protons and neutrons, which are the bond states of up-quarks and down-quarks by strong force. At energy below about 1 GeV, the strong interaction grow stronger as the distance increases such that all the quarks end up being “confined” in mesons (the bound state of a quark and an anti-quark) or baryons (the bound state of 3 quarks). The weak force are involved in the decay of isotopes and nuclear reaction, often producing neutrinos along the way. Both electrons and neutrinos fall into the category of leptons, which don’t feel strong force. Carrying no electric charge, neutrinos are also not affected by electromagnetic force. Gravitational force is ne- glected since it is negligible at the scale of particle physics, besides it is difficult to be quantized. Most common matters and nuclear reactions only involve electrons, electron neutrino, up-quarks, and down-quarks. Together, they are known as the first generation of elementary particles.
High-energy colliders and cosmic rays revealed that there are 2 copies of each quark or lepton in the first generation. Except for the difference in mass, they are identical in every other way. Each generation has a mass about 1 to 2 orders of magnitude larger than the previous generation. Whether there is a fundamental reason behind this mass hierarchy is not well understood.
Each kind of interaction between elementary particles is governed by a quan- tum field theory. The theory that describes the strong, weak and electromagnetic
force is the Yang-Mills gauge theory of (spontaneously broken) SU(3)C⊗ SU(2)L⊗ U(1)Y
11This brief introduction is tended heavily toward flavor physics, and only the bare minimum to explain the 𝒞𝒫 violation is given here. For a complete overview of the Standard Model, see, for instance, Ref. [39,40]. See also Ref. [41, Appendix B]
.. symmetry12. The SU(3)C part corresponds to the strong, or color, interaction, and is known as quantum chromodynamics (QCD). The SU(2)L⊗ U(1)Ypart de- scribes the electroweak interaction, and it is spontaneously broken to U(1)EMbe- low its critical energy. U(1)EMdescribes the electromagnetic force, and is know as the quantum electrodynamics (QED). When there is a local gauge transformation invariance, it requires a new gauge field, which leads to an interaction force, medi- ated by a spin-1 gauge boson (a quantum of the gauge field). The force-mediating gauge boson of the SU(3)C strong interaction is called gluon. For SU(2)L, it is the W±boson. Photons are the gauge bosons of U(1)EM, and it is a linear combina- tion of the original gauge bosons of SU(2)L and U(1)Y. Another such mixture is Z0, the neutral gauge boson of the weak interaction. Finally, the Higgs boson cor- responds to a scalar field that is responsible for the aforementioned spontaneous symmetry breaking, which give mass to all the fermions, the W±and the Z0.
Just like scalars, vectors and tensors in 3-dimensional space form different representations under the SO(3) ordinary spatial rotational group, fermions are grouped into multiplets (representations) of the gauge group. Scalars don’t trans- form under rotation; likewise, particles without color charges (the leptons) form color singlets and don’t feel the color (strong) force. Vectors exchange compo- nents under rotation; similarly, particles with color charges (the quarks) form color triplets and change to particles with different colors under the color gauge transformation. However, other quantum numbers like the flavors or the weak hy- percharges are not affected by the strong interaction. Particles with certain charges are coupled to each other by the corresponding interaction, and the strength is de- termined by the coupling constant of that interaction.
Any Dirac fermion (spinor) may be expressed as the combination of a right- handed and a left-handed component
𝜓 = 1 + 𝛾5
2 𝜓 + 1 − 𝛾5
2 𝜓 ≡ PR𝜓 + PL𝜓 ≡ 𝜓R+ 𝜓L,
12Here C refers to color, L to left, and Y to (weak) hypercharge.
.. where the matrix 𝛾5 = 𝑖𝛾0𝛾1𝛾2𝛾3is the chirality operator and 𝛾𝜇are the γ-matrices.
Since the weak interaction is determined from experiment to be of the form of 𝑉 (vector) − 𝐴(axialvector)
𝑗𝜇 ∝ ̄𝑢1(𝛾𝜇− 𝛾𝜇𝛾5)𝑢2 = 2 ̄𝑢1𝛾𝜇1 − 𝛾5
2 𝑢2 = 2 ̄𝑢1𝛾𝜇PL𝑢2,
and that QED conserves chirality, only 𝑢L doesn’t vanish in the matrix element.
Thus, only the left-handed chiral components of particles participate in charged current weak interactions. Similarly, only the right-handed chiral components of antiparticles participate in charged current weak interactions. In other words, while the right-handed particles form singlets under SU(2)L
uR, d
R, c
R, s
R, t
R, b
R and e
R, μ
R, τ
R, (νe)R, (νµ)R, (ντ)R,
the left-handed particles form doublets (hence the subscript L in SU(2)L)
⎛⎜
⎜
⎝ uL
dL
⎞⎟
⎟
⎠
⎛⎜
⎜
⎝ cL
sL
⎞⎟
⎟
⎠
⎛⎜
⎜
⎝ tL
bL
⎞⎟
⎟
⎠
and ⎛⎜⎜
⎝ eL
(νe)L
⎞⎟
⎟
⎠
⎛⎜
⎜
⎝ νL
(νµ)L
⎞⎟
⎟
⎠
⎛⎜
⎜
⎝ τL
(ντ)L
⎞⎟
⎟
⎠ .
1.2.1 Quark flavor mixing
Mixing between different generations arises from the explicit breaking of cus- todial SU(2) symmetry through the Yukawa couplings of the quarks. To illustrate, the Standard Model Lagrangian can be divided into 3 parts, ℒSM = ℒkinetic + ℒHiggs+ ℒYukawa, and the quark Yukawa interaction is given by
−ℒquarksYukawa= 𝑌𝑖𝑗d𝑄IL𝑖𝜑𝐷IR𝑗+ 𝑌𝑖𝑗u𝑄IL𝑖𝜀𝜑∗𝑈R𝑗I + h.c.,
where 𝑖, 𝑗 = 1, 2, 3 are generation labels, 𝑌uand 𝑌dare 3×3 complex matrices, φ is the Higgs field, and ε is the rank-2 antisymmetric tensor. 𝑄ILare left-handed quark doublets, and 𝐷IR(𝑈RI) are right-handed down(up)-type quark singlets, all in the weak eigenstates. When φ acquires a vacuum expectation value, 𝜑 = (0, 𝑣/√
2),
.. the Yukawa interactions give rise to quark mass terms
−ℒ𝑞M = (𝑀𝑑)𝑖𝑗𝐷IL𝑖𝐷IR𝑗+ (𝑀𝑢)𝑖𝑗𝑈L𝑖I 𝑈R𝑗I + h.c.,
with the 3 × 3 mass matrices
𝑀𝑑 = 𝑣
√2𝑌𝑑, 𝑀𝑢 = 𝑣
√2𝑌𝑢
and 𝑈L𝑖I , 𝐷IL𝑖being parts of the same SU(2)Ldoublet, 𝑄IL𝑖. One can use unitary ma- trices 𝑉L𝑢(𝑑)and 𝑉R𝑢(𝑑)to change the mass matrices from the basis of flavor eigen- states to that of mass eigenstates
𝑉L𝑢(𝑑)𝑀𝑢(𝑑)𝑉L𝑢(𝑑)† = diag (𝑚u(d), 𝑚c(s), 𝑚t(b)) ,
where the mass 𝑚𝑞 are real. Then, the doublet in the interaction basis (with su- perscript I) are expressed in terms of the mass basis (no superscript) as
𝑄IL=⎛⎜⎜
⎝ 𝑈L𝑖I 𝐷L𝑖I
⎞⎟
⎟
⎠
= (𝑉L𝑢†)𝑖𝑗⎛⎜⎜
⎝
𝑈L𝑗 (𝑉L𝑢𝑉L𝑑†)𝑗𝑘𝐷L𝑘
⎞⎟
⎟
⎠ .
By convention, (𝑈L𝑢†)𝑖𝑗 is pulled out, so that the transformation only acts on the down-type quarks. Hence, the charged-current weak interaction in ℒkineticis mod- ified by the product of the diagonalizing matrices, or the Cabbibo-Kobayashi- Maskawa (CKM) matrix [42]
𝑉 = 𝑉L𝑢𝑉L𝑑† =
⎛⎜
⎜⎜
⎜⎜
⎜
⎝
𝑉ud 𝑉us 𝑉ub 𝑉cd 𝑉cs 𝑉cb 𝑉td 𝑉ts 𝑉tb
⎞⎟
⎟⎟
⎟⎟
⎟
⎠ .
It is the misalignment between these two bases that leads to the quark mixing.
Being the product of unitary matrices, the CKM matrix is itself unitary (𝑉 𝑉†= 𝐼). Out of the free parameters of 3 real numbers and 6 complex phases, 5 phases
.. can be rotated away without making any observable effect13, leaving 3 real Eu- ler angles 𝜃12, 𝜃13, 𝜃23 and 1 irreducible complex phase δ. This corresponds to 3 rotations in real, 3-dimensional space [45]
𝑈12 =
⎛⎜
⎜⎜
⎜⎜
⎜
⎝
𝑐12 𝑠12 0
−𝑠12 𝑐12 0
0 0 1
⎞⎟
⎟⎟
⎟⎟
⎟
⎠
, 𝑈13 =
⎛⎜
⎜⎜
⎜⎜
⎜
⎝
𝑐13 0 𝑠13
0 1 0
−𝑠13 0 𝑐13
⎞⎟
⎟⎟
⎟⎟
⎟
⎠
, 𝑈23 =
⎛⎜
⎜⎜
⎜⎜
⎜
⎝
1 0 0
0 𝑐23 𝑠23 0 −𝑠23 𝑐23
⎞⎟
⎟⎟
⎟⎟
⎟
⎠ ,
and another unitary matrix with the 𝒞𝒫-violating phase
𝑈𝛿=
⎛⎜
⎜⎜
⎜⎜
⎜
⎝
1 0 0 0 1 0 0 0 𝑒𝑖𝛿
⎞⎟
⎟⎟
⎟⎟
⎟
⎠ .
Here, 𝑐𝑖𝑗 = cos 𝜃𝑖𝑗and 𝑠𝑖𝑗= sin 𝜃𝑖𝑗are the cosines and sines of the rotation angles.
This complex phase is the only source of 𝒞𝒫 violation in the Standard Model (ne- glecting the θ-term of the strong interaction). The canonical way to parametrize the CKM matrix is [46]
𝑉CKM = 𝑈23𝑈𝛿†𝑈13𝑈𝛿𝑈12
=
⎛⎜
⎜⎜
⎜⎜
⎜
⎝
𝑐12𝑐23 𝑠12𝑐13 𝑠13𝑒−𝑖𝛿
−𝑠12𝑐13− 𝑐12𝑠23𝑠13𝑒𝑖𝛿 𝑐12𝑐23− 𝑠12𝑠23𝑠13𝑒𝑖𝛿 𝑠23𝑐13 𝑠12𝑠23− 𝑐12𝑐23𝑠13𝑒𝑖𝛿 −𝑐12𝑠23− 𝑠12𝑐23𝑠13𝑒𝑖𝛿 𝑐23𝑐13
⎞⎟
⎟⎟
⎟⎟
⎟
⎠ .
These 4 parameters are not predicted by the Standard Model, and thus have to be
13We are free to transform the quark fields as 𝑑𝑗→ 𝑒𝑖𝜑𝑑𝑗𝑑𝑗, 𝑢𝑗 → 𝑒𝑖𝜑𝑢𝑗𝑢𝑗. This has no observ- able effect (up to redefining the Yukawa coupling constants), except that the CKM matrix elements are now
𝑉𝑗𝑘𝑒𝑖(𝜑𝑗𝑑−𝜑𝑘𝑢). (1.5)
There are 5 independent phase differences in these expressions. Thus, up to 5 complex phases in the CKM matrix elements 𝑉𝑗𝑘can be eliminated by choosing the appropriate phases 𝜑𝑑𝑗 and 𝜑𝑢𝑘
[43,44].
.. determined by various experimental measurements. A recent fitting result is [31]
𝑉CKM =
⎛⎜
⎜⎜
⎜⎜
⎜
⎝
0.97434+0.00011−0.00012 0.22506 ± 0.00050 0.00357 ± 0.00015 0.22492 ± 0.00050 0.97351 ± 0.00013 0.00411 ± 0.0013
0.00875+0.00032−0.00033 0.0403 ± 0.0013 0.99915 ± 0.00005
⎞⎟
⎟⎟
⎟⎟
⎟
⎠ .
From these numbers, it follows that 1 ≫ 𝜃12 ≫ 𝜃23 ≫ 𝜃13. That is, the quarks are only slightly mixed. This leads to an alternative parametrization using the ex- pansion of a small parameter 𝜆 = |𝑉us| ≈ 0.23 [47], so that the hierarchy becomes more visible
𝑉CKM =
⎛⎜
⎜⎜
⎜⎜
⎜
⎝
1 − 𝜆2/2 𝜆 𝜆3Α[𝜌 − 𝑖𝜂(1 − 𝜆2/2)]
−𝜆 1 − 𝜆2/2 − 𝑖𝜂Α2𝜆4 𝜆2Α(1 + 𝑖𝜂𝜆2)
𝜆3Α(1 − 𝜌 − 𝑖𝜂) −𝜆2𝐴 1
⎞⎟
⎟⎟
⎟⎟
⎟
⎠
+ 𝒪(𝜆4) + 𝑖𝒪(𝜆5),
where all the rest parameters 𝐴, 𝜌 and η are of order 1. All the calculation in section 1.3adopts this parametrization, in which case all effects of 𝒞𝒫 violation in the Standard Model is proportional to 𝜂. Of course, the physics prediction would be the same in any other convention.
As physical quantities are independent of phase convention, the magnitude of 𝒞𝒫 violation can be defined as the Jarlskog parameter [48]
ℐ𝑚 (𝑉𝑖𝑗𝑉𝑘𝑙𝑉𝑖𝑙∗𝑉𝑘𝑗∗) = 𝐽
3
∑
𝑚,𝑛=1
𝜀𝑖𝑘𝑚𝜀𝑗𝑙𝑛,
which is invariant under the phase transformation in Eq. (1.5). In terms of the above parametrization,
𝐽 = 𝑐12𝑐23𝑐213𝑠12𝑠23𝑠213sin 𝛿 ≃ 𝜆6𝐴2𝜂 = (3.04+0.21−0.20× 10−5) .
In addition, if the d, s, b quarks were degenerate in mass, we could redefine the states so that each quark only couples to the same generation. Therefore, a basis-