• 沒有找到結果。

阿秒脈衝激發和遠紅外雷射驅動中稀有氣體原子的光子發射譜中的半脈衝週期震盪: 自作用修正的隨時變密度泛函理論計算

N/A
N/A
Protected

Academic year: 2022

Share "阿秒脈衝激發和遠紅外雷射驅動中稀有氣體原子的光子發射譜中的半脈衝週期震盪: 自作用修正的隨時變密度泛函理論計算"

Copied!
66
0
0

加載中.... (立即查看全文)

全文

(1)

國立臺灣大學理學院應用物理所 碩士論文

Graduate Institute of Applied Physics College of Science

National Taiwan University Master Thesis

阿秒脈衝激發和遠紅外雷射驅動中稀有氣體原子的光子發射 譜中的半脈衝週期震盪:

自作用修正的隨時變密度泛涵理論計算

Subcycle Dynamics of Photon Emission Spectra of Rare Gases Atoms Excited by Attosecond Pulses and Driven by Near-Infrared

Laser Fields:

Self-Interaction-Free Time-Dependent Density-Functional-Theory Approach

周繼暉 Chi-Hui Chou

指導教授:朱時宜 博士 Advisor: Shih-I Chu, Ph.D.

中華民國 103 年 7 月

July, 2014

(2)
(3)

誌謝

首先要感謝指導教授朱時宜老師,老師是個有耐心和學術涵養豐富的教授,

感謝他細心的指導,能夠容忍並且修正我犯一些錯誤,讓我能夠提升自己,老師 儘管在強場物理已發展出精確的理論,仍然有熱誠於研究新的且更完整的理論,

那精神值得我們晚輩學習。

我也要感謝 Dr. John Heslar, 我的論文主要延伸至他的工作,感謝他在數值方 法還有 Fortran 上給我很多幫助,他平時幽默風趣。Dr. Ho, Dr. Tong, Dr. Telnov 等 訪問學者也給予我一些幫助。

實驗室的學長和博士後也給我很多幫助,能夠給我在學術和生涯一些建議的 佑航,幫我處理計算主機、latex 及繪圖軟體等問題的聖崙,研究上很認真的 Dr.

Lee,經常邀大家吃飯的曄琳,來自澳門和時常討論時事的焌梃,英文很好和在 CUDA 常和我討論的昶明,和我一起進來實驗室的書豪。還有幫我們處理瑣事的 助理秀翎和曉佩,幫實驗室丟垃圾的阿姨,感謝你們讓我能在碩士二年能夠順利。

除了實驗室外,我還要感謝碩一時一起打電動和討論功課的焌佑,已經結婚 且帶小孩的榮伸;以及室友們,身在美國深造的家銓,將要入職場的浩明,打理 環境的伯棟,熱愛調酒的柏聿,讓我在生活上能夠安穩的做研究。

最後感謝我的家人以及我教會的朋友們,你們在背後支持我,鼓勵我能夠堅 持自己的夢想,並且教導我學術研究以外的事。碩士結束只是個一個階段的開始,

希望我自己能夠完成我的夢想以及面對挑戰。

(4)

中文摘要

本文應用了自作用修正的隨時變密度泛涵理論和非微擾的數值計算方法,計 算阿秒脈衝激發和遠紅外雷射驅動中氫、氦和氖原子的光子發射譜,藉由改變阿 秒脈衝和遠紅外雷射的時間差,我們可以觀察到半雷射周期的震盪以及能階的改 變。這種現象已從吸收光譜實驗上觀察到,我們也計算出激發態的電子機率隨時 間差有著相同的週期震盪,我們從兩個光子吸收觀點來解釋這種現象。

關鍵字:阿秒脈衝、自作用修正時變密度泛涵理論、半雷射週期現象

(5)

Abstract

We present an ab initio method to study the sub-cycle dynamics of hydrogen, helium and neon atoms in near-infrared(NIR) laser fields subject to excitation by a single ex- treme ultraviolet attosecond pulse(SAP). We extended the self-interaction-free time- dependent density functional theory(TD-KLI-SIC) to describe multi-electron system and solve the time-dependent Kohn-Sham equations by time-dependent generalized pseudospectral(TDGPS) method. We calculated the photon emission spectra and population of several excited states as the function of the time delay between the NIR pulse and SAP. The phenomena can be explain by two-photon absorption.

keyword: attosecond, time-dependent density functional theory, subcycle dynamics

(6)

Contents

Contents ii

List of Figures iv

1 Introduction 1

1.1 Attosecond Physics . . . 1

1.2 Real-time observation with attosecond technology . . . 2

1.3 Sub-cycle AC stark shift . . . 3

1.4 Purpose of this work . . . 5

2 Theory and Method 6 2.1 Time-dependent Generalized Pseudospectral Method . . . 6

2.1.1 The eigenvalues problem . . . 6

2.1.2 Time propagation . . . 11

2.2 Time-Dependent Density Functional Theory . . . 15

2.2.1 Density Functional Theory and Kohn-Sham scheme . . . 15

2.2.2 Optimized Effective Potential method and Krieger-Li-Iafrate approximation . . . 17

2.2.3 KLI-SIC method . . . 18

2.2.4 TD-KLI-SIC method . . . 20

2.3 Implement of numerical methods on graphics processing unit . . . 22

2.3.1 GPU architecture . . . 23

(7)

2.3.2 Implementation on GPU . . . 26 2.3.3 Result . . . 27

3 Result and Discussion 28

3.1 Hydrogen . . . 28 3.2 Helium . . . 38 3.3 Neon . . . 44

4 Conclusions and Perspectives 50

Bibliography 51

(8)

List of Figures

1.1 Characteristic length and time scales for structure and dynamics.[1] . 2

1.2 Illustration of pump-probe (time-resolved) spectroscopy.[1] . . . 3

2.1 The basis function in correspond to domain.Reproduced from[2]. . . . 8

2.2 Data distribution of wave function. Nl and Nr is the total number of partial wave and the grid points of wave function. . . 12

2.3 Illustration of matrix-vector multiplication. . . 13

2.4 Illustration of the transformation in Eq. 2.29 . . . 14

2.5 Mask function for one dimensional spatial grid with absorbing bound- ary condition. Reproduced from[3]. . . 15

2.6 The effective potential rVef f with the LSDA and LSDA-KLI-SIC in neon and argon(left to right). . . 19

2.7 GPU architecture. . . 23

2.8 SM architecture. . . 24

2.9 GPU Memory. . . 25

2.10 Runtime of the TDGPS and TD-KLI-SIC on one GPU and 16-cores CPU with different atoms. (nvidia Kepler K20(GPU) and Intel(R) Xeon(R) CPU E5-2690 2.9GHz(16 cores CPU)) . . . 27

3.1 Illustration of SAP and NIR with time delay -5 fs. . . 29

(9)

3.2 Photon emission energy spectrum of the exicted states (2p[3s− 5s]

and 2p[3d− 4d] as a function of the time delay between the NIR pulse and SAP. The yellow color indicates the highest energy emitted. The color bars are represented by the log10S(ω) of the spectral density in Eq. 3.7 . . . 31 3.3 Photon emission energy spectrum of the 1s2p excited state as a func-

tion of the time delay between the NIR pulse and SAP. . . 32 3.4 Photon emission energy spectrum of the 1s3p excited state as a func-

tion of the time delay between the NIR pulse and SAP. . . 32 3.5 Photon emission energy spectrum of the 1s4p excited state as a func-

tion of the time delay between the NIR pulse and SAP. . . 33 3.6 (left) Energy emitted near 1s2p transition (right) Energy emitted near

1s3p transition . . . 34 3.7 Illustration of Autler-Townes effect in hydrogen. When SAP comes

first, the electron can be exicted to np orbitals. The NIR is not weak for excited states, so it can make 2p3s and 2p3d transition happen. . 34 3.8 Population of several excited states as a function of the time delay

between the NIR pulse and SAP. The center frequency of NIR is 800 nm. . . 35 3.9 Illustration of two photon absorption. . . 36 3.10 Population of several excited states as a function of the time delay

between the NIR pulse and SAP.The center frequency of NIR is 800 nm. . . 37 3.11 Photon emission energy spectrum of the exicted states (1s[2p− 5p]

as a function of the time delay between the NIR pulse and SAP. The yellow color indicates the highest energy emitted. The color bars are represented by the log10S(ω) of the spectral density in Eq. 3.7 . . . . 39 3.12 Photon emission energy spectrum of the 1s2p excited state as a func-

tion of the time delay between the NIR pulse and SAP. . . 40

(10)

3.13 Photon emission energy spectrum of the 1s3p excited state as a func- tion of the time delay between the NIR pulse and SAP. . . 40 3.14 Photon emission energy spectrum of the 1s4p excited state as a func-

tion of the time delay between the NIR pulse and SAP. . . 41 3.15 (left) Energy emitted near 1s2p transition (right) Energy emitted near

1s3p transition . . . 42 3.16 Population of several excited states as a function of the time delay

between the NIR pulse and SAP. . . 42 3.17 Photon emission energy spectrum of the excited states (2p[3s− 5s]

and 2p[3d− 4d] as a function of the time delay between the NIR pulse and SAP. The yellow color indicates the highest energy emitted. The color bars are represented by the log10S(ω) of the spectral density in Eq. 3.7 . . . 45 3.18 Photon emission energy spectrum of the 2p3s excited state as a func-

tion of the time delay between the NIR pulse and SAP. . . 46 3.19 Photon emission energy spectrum of the 2p4s excited state as a func-

tion of the time delay between the NIR pulse and SAP. . . 46 3.20 Photon emission energy spectrum of the 2p3d excited state as a func-

tion of the time delay between the NIR pulse and SAP. . . 47 3.21 Photon emission energy spectrum of the 2p5s , 2p4d and 2p6s(down

to up) excited states as a function of the time delay between the NIR pulse and SAP. . . 47 3.22 Population of several excited states as a function of the time delay

between the NIR pulse and SAP. . . 49

(11)

Chapter 1 Introduction

In this chapter, we introduce the basic concept about attosecond physics and the purpose of our work.

1.1 Attosecond Physics

Attosecond(as,10−18 second) science and technology has been popular topic in recent year since the first generation of attoscecond pulse in experiments[4, 5]. This tech- nique allow us to study the electronic processes in atoms, molecules and surfaces at the attosecond timescales. There are two main progresses in attosecond science. The first one is that development of attosecond source. We want to generate more intense and shorter attosecond pulse. In our group, we focus on High-Order Harmonic gen- eration which can generate attosecond pulse. The second one is that application of attosecond pulse. There are many experiments on which they use attosecond pulse to discover ultra-fast the dynamic of electron[6, 7, 8, 9, 10, 11, 12].The complete discussion in attosecond physics can be found in [1]

(12)

1.2 Real-time observation with attosecond technol- ogy

In Fig. 1.1, this defines the characteristic time scale for motion of atoms in molecules to hundreds of femtosecond. The motion of individual electrons in semiconductor nanostructures, molecular orbitals, and the inner shells of atoms occurs from ten femtosecond to one attosecond. Motion of nuclei is even faster, on a zeptosecond time scale(10−21 second). The main application of attosecond pules is to observe ultrafast process in attosecond time scales.

Figure 1.1: Characteristic length and time scales for structure and dynamics.[1]

Real-time observation of ultrafast motion requires the ability to trigger and probe the process under scrutiny. Dynamical information is provided by an observable varying as a function of the delay between triggering and probing events in a pump- probe measurement in Fig. 1.2, where pump pulse triggers the process and probe pulse images the process at some delay time. This quantity varies on the time scale at which the motion occurs, affording the observer real-time access to the

(13)

process. If the observable can reveal the information on location of the moving particles, a series of freeze-frame pump-probe images allows retrieval of the ultrafast motion. Real-time observation and control of atomic motion relies on femtosecond laser techniques, using cycle-averaged quantities such as field amplitude and carrier frequency for triggering, probing and controlling dynamics. Attosecond technology allow us to improve the resolution of probing and control by orders of magnitude with sub-fs XUV pulse.

Figure 1.2: Illustration of pump-probe (time-resolved) spectroscopy.[1]

1.3 Sub-cycle AC stark shift

Stark shift is the shift of atomic energy level in a external field, which can be static(DC) or dynamic(AC). With the existing theory[13], when the monochromatic field with wL far from atomic resonance, the AC stark shift is equivalent to the quadratic DC Stark shift. The energy shift is proportional to the cycle average of the laser field’s square:

M Ea =−αaL)

2 L(t)2i = −αaL0(t)2

4 (1.1)

where εL(t) = ε0cos(ωLt) is the laser electric field. The polarizability αa = P

k6=a

ωka|dka|2 ωka2 − ω2L

depends on both the dipole matrix elements dka coupling a with

(14)

other states k and the detuning of the laser frequency wL. For highly excited states, we have

alim→∞αaL) = 1

ωL2 (1.2)

which leads to the well-known upshift of ionization threshold by the ponderomotive energy: Up ε202

L

.

Let|ni be the eigenstates of the time-independent Hamiltonian H0, such that H0|ni = En0|ni(n = 1, 2, . . . ). (1.3) The energy of |ni under the external field ε(t) = ε0(t) cos(ωLt), based on second- order time-dependent perturbation theory, is given by

En(t) = En0 + ε(t)dnn− iX

k6=n

Z t 0

dt0ε(t)ε(t0)e−iωkn(t−t0)|dnk|2 (1.4)

Supposed that laser pulse envelop is ε0(t) = εpe|t|τp with duration τp, the integra- tion in Eq. 1.4 can be simplified for multicycle pulses:

δEn(t) = 1

2ε0(t)2X

k6=n

[ωnk|dnk|2 ωkn2 − ω2L

cos2Lt)− iωL|dnk|2 ωkn2 − ωL2

sin(2ωLt)] = 1

2ε0(t)2ncos2Lt)− iγnsin(2ωLt)].

(1.5)

where γn=P

k6=nωL|dkn|2 ω2kn−ω2L

specifies the subcycle changes in the population of n as it couples to k.

However, the average-cycle shift can be verified by using long-pulse( ns) but lacked time resolution. Recently, pump-probe measurements using probe laser pulses longer than the oscillation period of the Stark field revealed Stark shifts with time resolution 10 fs[14], but only the average-cycle shift can be measured.

To resolve the subcycle ac stark shift, we probe the atoms with single extreme ultraviolet attosecond pulse(SAP) with duration nearly 20 times smaller than the NIR Stark laser period. When the atom absorb an XUV photon, electrons can be moved to one of the excited states or to the continuum states because the wide

(15)

frequencies range of XUV pulse. When the NIR field act on the electron, the effects of the NIR field can be observed through the changes in the photo emission spectra and population of excited states. In most experiments, they use the transient absorption technique to observe the subcycle oscillation[15, 16, 17, 18]. But for our work, we calculate the photo emission spectra and observe the similar features.

1.4 Purpose of this work

We give the main targets of this work.

1.We perform 3D calculation of the rare gas atoms (H,He,Ne) in the NIR field subject to excitation by SAP. We have calculated the photon emission spectra and the population of excited states with respect to time delay between the SAP and NIR fields and then we found the subcycle oscillation.

2.We include that multi-electron correlation effect by using time dependent den- sity functioanl theory (TDDFT). Unlike other theory which only makes single-active- electron(SAE) model[6, 8, 9, 15, 17, 19] or time independent model potential[20].

(16)

Chapter 2

Theory and Method

In this chapter, we introduce the numerical method for solving PDE and the theory for many-electron system. In addition, we also give simple introduction about graphic processing unit(GPU), on which we can accelerate our calculations.

2.1 Time-dependent Generalized Pseudospectral Method

There are many numerical methods for solving PDE, finite difference(FD), finite element(FE), and spectral method[2, 21, 22]. GPS is under the category of spectral method. Spectral method is more accurate and efficient than FD and FE because it needs less grid points and converges faster. Moreover, the grid points of GPS are more denser near the origin, so we can describe Coulomb potential better than equal-spacing grid points.

2.1.1 The eigenvalues problem

The basic idea of spectral method is to approximate function f (x) by order N poly- nomial with orthogonal basis φj(x)

A(x)' XN

j=0

ajφj(x) (2.1)

(17)

We need the coefficient aj for defining the function, so it requires the values of function at N + 1 grid points xj.

aj = XN

i=0

ωiA(xij(xi) (2.2)

The approximation of function can be expressed by

A(x) = XN

j=0

A(xj)gj(x) (2.3)

where gj(x) is the cardinal function given by

gj(x) = XN

i=0

ωiA(xij(x) (2.4)

With the definition of the cardinal function and the collocation points{xi}, we can obtain the differential operator matrix

d

dxA(x)|x=xi ' XN

j=0

A(xj) d

dxgj(x)|x=xj = XN

j=0

(DN)ijA(xj) (2.5) The value of weight and the form of the cardinal function and the differential operator matrix are depend on basis function and grid points. How to choose the basis function and grid points is dependent on the problems, mainly the boundary condition and domain. We list the basis function in Fig 2.1

(18)

Figure 2.1: The basis function in correspond to domain.Reproduced from[2].

In this work, we choose the Legendre polynomials. And then we choose the grid points from different boundary condition[21].

Legendre-Gauss

xj(j = 0, . . . , N )zeros ofPN +1;

ωj = 2

(1− x2j)[PN +10 (xj)]2, j = 0, . . . , N.

Legendre-Gauss-Radau

xj(j = 0, . . . , N )zeros ofPN + PN +1; ω0 = 2

(N + 1)2

ωj = 1− xj

(N + 1)2[PN(xj)]2, j = 1, . . . , N.

Legendre-Gauss-Lobatto

x0 =−1, xN = 1, xj(j = 1, . . . , N − 1)zeros ofPN +10 ;

(19)

ωj = 2

N (N + 1)[PN +1(xj)]2, j = 0, . . . , N.

We use Legendre-Gauss-Lobatto grid points for the radical coordinate and Legendre- Gauss grid points for the angular coordinate.

For atomic or molecule calculation including Coulomb potential, the general prob- lem is the singularity at r = 0 and the long-range potential. We truncates the semi-infinite (0,∞) to the finite interval [rmin, rmax] to avoid the problem, but it needs a lot of grid points for convergence and accuracy and cost much time for time propagation.

The generalized pseudospectral method offers us the solution of above problem.

This method can be used in time-dependent problem. Follow the discussion, we use nonlinear mapping function[23, 24] to map the [−1, 1] to [rmin, rmax]

r(x) = rm 1 + x

1− x + α (2.6)

where rm and α = 2rrm

max is the mapping parameters. Here, we begin with the time- independent Schrödinger equation for Hydrogen atom.

ψ(r, θ, φ) = X

n,l,m

Cnlm

ϕnl(r)

r Ylm(θ, φ) (2.7)

Hˆl0ϕnl(r) = (−1

22+ Vl(r))ϕnl(r) = Enlϕnl(r) (2.8) where

Vnl(r) =−1

r +l(l + 1)

2r2 (2.9)

Applying the nonlinear mapping for Eq. 2.6.

1 2[ 1

r0(x)2 d2

dx2 r00(x) r0(x)3

d

dx]ϕ(r(x)) + V (r)ϕ(r(x)) = Eϕ(r(x)) (2.10) This equation is not symmetrical, so we choose ϕ(r(x)) =p

r0(x)f (x)

1 2

1 r0(x)

d2

dx2f (x) + Vm(x)r0(x)f (x) + V (x)r0(x)f (x) = Er0(x)f (x) (2.11)

(20)

where

Vm(x) = 3(r00)2− 2r000r0

8r04 (2.12)

For special mapping Eq. 2.6, Vm(x) = 0. With a lot of effort, we finally translate the problem into the symmetric eigenvalue problem.

1 2

1 r0(x)

d2 dx2

1

r0(x)A(x) + V (x)A(x) = EA(x) (2.13) where

A(x) = r0(x)f (x) =p

r0(x)ϕ(x) (2.14)

We throw Eq. 2.3 into Eq. 2.13

1 2

1 r0(xi)

X

j

A(xj) r0(xj)

d2gj(x) d2x

xi

+ V (xi)X

j

A(xj)gj(xi) = EX

j

A(xj)gj(xi) (2.15)

where[24]

gj00(xi) = d(2)ij PN(xi)

PN(xj) (2.16)

and

d(2)ij = 2

(xi− xj)2[i6= j, (ij) 6= (0N), (ij) 6= (N0)] (2.17)

d(2)0N = d2N 0= N (N + 1)− 2

4 (2.18)

d(2)jj =−N (N + 1)

3(1− xj)2[j 6= 0, j 6= N] (2.19)

d(2)00 = d2N N = N (N + 1)[N (N + 1)− 2]

24 (2.20)

And the cardinal function has the property

gj(xi) = δij (2.21)

and choose

Ai = s

2

(N + 1)(N + 2)

A(xi)

PN +1(xi) (2.22)

(21)

The whole equation is simplified to matrix form X

j

[(D2)ij + V (xiij]Aj = EAj (2.23) The final wave function on the collection points is

ψ(r(xi)) = Ai rp

ωir0(xi) (2.24)

where the ωi is the weight of the cardinal function. And then we can check the wave function normalized.

< ψ|ψ >=

Z AiAi

r2ωir0(xi)r2dr

=X

i

AiAi

r2ωir0(xi)r2dr dxωi

=X

i

AiAi

(2.25)

Finally, we solve the Schrödinger equation and get the eigenvalues and eigenfunc- tions of the system. Moreover, there are something we should be careful.The real wave function is in Eq. 2.24 although the summation of AiAi is normalized by most eigensolver. It’s convenient to see Ai as the wave function numerically because some variables are eliminated by integral elements in Eq. 2.25.

2.1.2 Time propagation

The next step is to do time propagation by second-order split operator technique[25]

in spherical coordinates:

ϕ(r, t + δt)' exp(−i bH0δt/2) exp(−ibV (r, θ, t + δt)δt) exp(−i bH0δt/2)ϕ(r, t) + O(δt3) (2.26) The first term and third term is from time independent Hamiltonian. The the second term is the time dependent potential, namely the laser field.

(22)

Figure 2.2: Data distribution of wave function. Nl and Nr is the total number of partial wave and the grid points of wave function.

We define the initial wave function for hydrogen.

ψ(ri, l = 0) = A(l = 0)i

ψ(ri, l = 1 . . . Nl− 1) = 0 (2.27)

where the i is the index of radical part of the wave function and l is the partial wave number from angular quantum number. Nl and Nr is the total number of partial wave and the grid points of wave function. In general, we set Nl = 32 and Nr = 256 for the hydrogen atom.

We define the exp(−i ˆH0δt/2) by the eigenvalues and the eigenfunctions of Hamilto- nian.

Sij(l) =X

n

< ri, l|n, l > exp(−iEnlδt/2) < n, l|rj, l > (2.28) where the eigenfunctions are < ri, l|n, l > and eigenvalues are Enl.

(23)

Figure 2.3: Illustration of matrix-vector multiplication.

We perform the operationexp(−iH0δt/2)ψ(r, 0)in Fig. 2.1.2, composed of matrix- vector multiplications for each l. The process can be paralleled by openmp or other parallel methods.

Before we do exp(−iV (r, θ, t)δt), we need to transform the representation(r, l) to (r, θ). The θj is the Gaussion-Legendre points we mention before.

< ri, θj|ψ >=X

l

< ri, θj|ri, l >< ri, l|ψ > (2.29) The Eq. 2.29 can be illustrated in Fig. 2.1.2.

exp(−iV (ri, θj, t)δt)ψ(ri, θj) (2.30) And we transform the representation (r, θ) back to (r, l) before we perform the oper- ation exp(−i ˆH0δt/2) again. To prevent reflection, the wave function are multiplied by mask function after each time step. We partition our finite spatial grid into an

(24)

Figure 2.4: Illustration of the transformation in Eq. 2.29

(25)

inner region, which is large enough to completely contain the finite system of interest , and a border region, where outgoing flux is to be absorbed.

Figure 2.5: Mask function for one dimensional spatial grid with absorbing boundary condition. Reproduced from[3].

Throughout all processes above, we get the next time step wave function and we can calculate the observables.

O(t) =< ψ(r, t)| ˆO|ψ(r, t) > (2.31) O(t) is the observables,like number of electron,electron density, dipole moment in length form and in acceleration form. Finally, we don’t need to save the time depen- dent wave function.

ψ(r, l) = ψ(r, l, δt) (2.32)

where we set δt as the initial wave function and then to get 2δt, . . . , tf inal. The TDGPS method can be applied to many kinds of system.

2.2 Time-Dependent Density Functional Theory

2.2.1 Density Functional Theory and Kohn-Sham scheme

Density functional theory is the most popular method for many-electron system, atoms, molecules, and solid s. In 1964, Hohenberg and Sham develop the basic theorem of density functional theory[26]. We give simple concept about the theorem.

First theorem, we can represent the energy as a functional of the electron density

(26)

for given potential. We don’t need to use wave functionΨ(r1, r2, . . . , rN), which has 3N variables in N-electron system, but use electron densityρ(r), which has only 3 variables in N-electron system. That avoid the main computational difficulty. Second theorem, The energy functional is minimized by the ground state density. We can find the ground state potential by variantional principle. In Hohenberg-Kohn theorem, if we know the exact form of the universe functional, we can find the ground state by minimizing the functional. But we don’t know the exact functional and systematic way to find such functional. In 1965, Kohn and Sham develop systematic way to approximate the functional and find the ground state density.

Kohn and Sham develop another method[27] for the functional, by using non- interacting system as auxiliary system. Therefore, the ground-state wave function is the single slater-determinant.

Ψ = 1

√N !det[ϕ1ϕ2. . . ϕN] (2.33) And the density is

ρ(r) =X

σ Nσ

X

i=1

(r)|2 = ρα(r) + ρβ(r) (2.34)

The energy functional is

E[ρα, ρβ] = Ts[ρ] + J [ρ] + Excα, ρβ] + Z

vext(r)ρ(r)d3r (2.35) Ts[ρ] is the non-interacting kinetic energy functional, the J [ρ] is the Hartree en- ergy functional and the Exc[ρ] is called exchange-correlation energy functional. The exchange energy is from the Pauli-expulsion and the correlation is from the ap- proximation of single slater-determinant. The complexity of system is inside the exchange-correlation functional. And finally we take the density derivative of the energy functional, we get the Schrödinger-like equation

HˆKSϕ(r) = [−1

22+ veff,σ(r)]ϕ(r) = εϕ(r), i = 1, 2, . . . , Nσ,

(2.36)

(27)

where vef f,σ is the effective KS potential and σ is the spin index. The effective potential is

vef f,σ = vext(r) + δJ [ρ]

δρσ(r) + δExcα, ρβ]

δρσ(r) (2.37)

where vxc,σ(r) is the exchange-correlation potential vxc,σ(r) = δExcα, ρβ]

δρσ(r) (2.38)

The KS equations are solved self-consistently. One guesses the initial density at first and then solves the KS equation to get the new density from new orbitals until the convergence.

2.2.2 Optimized Effective Potential method and Krieger-Li- Iafrate approximation

The self-interaction is from the classical Coulomb repulsion. The effect of self- interaction should be cancelled by exchange-correlation functional, but for most of exchange-correlation functionals, the self-interaction correction is not consider. One of the most important error is the incorrect long-range tail of Kohn-Sham potential, which will affect the ionization energy. Therefore, the self-interaction correction is crucial for excited states.

Perdew and Zunger proposed the self-interaction correction(SIC)[28] by giving the approximate exchange-correlation energy functional Excα, ρβ],

ExcSICα, ρβ] = Excα, ρβ]X

σ Nσ

X

i=1

{J[ρ] + Exc, 0]} (2.39) where ρ is the one-electron density of the ith KS spin orbital.

However, the SIC energy functional is explicit orbital-dependence, so for each electron orbital they have different potentials. That cause each orbital to be nonorthogonal and be complicated.

(28)

Another approach is the optimized effective potential method[29, 30] .In this approach, one solves the set of one-electron equations, similar to the KS equations in Eq. 2.36.

HˆOEPϕ(r) = [−1

22+ vOEPσ (r)]ϕ(r) = εϕ(r), i = 1, 2, . . . , Nσ

(2.40)

The optimized effective potential vσOEP(r) is obtained by the orbitals{ϕ} in Eq. 2.40 which minimized the energy functional E[ϕ, ϕ],

δExc[}]

δvOEPσ (r) = 0 (2.41)

Eq. 2.41 can be written as, using chain rule for functional derivative, X

j

Z

d3r0δEOEP, ϕi0β] δϕ(r0)

δϕ(r0)

vOEPσ (r) + c.c. = 0 (2.42) Eq. 2.42 leads to an integral equation that is complicated. Krieger, Li and Iafrate[31, 32, 33] make an approximate procedure to simplify the original OEP in- tegral equations into the set of linear equations. Although the KLI procedure can’t reach the exact exchange functional, it reduces the computational difficulty and the its result is pretty close to OEP method.

2.2.3 KLI-SIC method

The OEP method and KLI approximation uses the exchange part of the density functional contains a Hartree-Fock-like nonlocal functional.

Exexact[}] = −1 2

X

σ Nσ

X

i,j=1

Z d3r

Z

d3r0ϕ(r0(r0(r)ϕ(r)

|r − r0| (2.43)

Even though Eq. 2.43 provides more accurate exchange potential, it’s computa- tionally more expensive than the traditional DFT functional with only local func- tional. Therefore,we present the extension of KLI procedures to the SIC term[34, 35]

(29)

in Eq. 2.39. This new KLI-SIC procedure can speed up the static DFT calcula- tion and time dependent DFT calculation. This KLI-SIC procedure make the self- interaction-free effective potential orbital-independent. In other word, this avoid the problems with respect to nonorthogonal spin-orbitals. And the KLI-SIC procedure give the optimized effective potential with the correct long-range behavior(−1/r)in Fig. 2.6 and surprisingly improvement of ionization energy and excited states.

-10 -9 -8 -7 -6 -5 -4 -3 -2 -1 0

0 1 2 3 4 5 6 7 8

r*Veff(r) (a.u.)

r (a.u.)

LSDA-KLI-SIC LSDA

-18 -16 -14 -12 -10 -8 -6 -4 -2 0

0 1 2 3 4 5 6 7 8

r*Veff(r) (a.u.)

r (a.u.)

LSDA-KLI-SIC LSDA

Figure 2.6: The effective potential rVef f with the LSDA and LSDA-KLI-SIC in neon and argon(left to right).

Define the total energy functional with SIC to be ESICOEP[, ϕ}] = EOEP[, ϕ}] −X

σ

X

i

{J[ρ] + Exc, 0]} (2.44)

where EOEP[, ϕ}] is normal energy functional in Eq. 2.35.Following the OEP- KLI procedure, one finds that

vSIC,σOEP (r) = vext(r) +

Z ρ(r0)

|r − r0|d3r0 +δExcα, ρβ]

δρσ(r) + vSIC,σ(r), (2.45) where

vSIC,σ =X

i

ρ(r)

ρσ {v(r) + viSIC,σ− v}, (2.46) v =

Z ρ(r0)

|r − r0|d3r0 δExc, 0]

δρ(r) (2.47)

(30)

and

viSIC,σ =< ϕ|vSIC,σ(r)|ϕ > (2.48) v =< ϕ|v(r)|ϕ > (2.49) The value of the vSIC,σ is unknown, but we can solve it through linear equations

NXσ−1 i=1

ji,σ− Mji,σ)(viSIC,σ− v) = vs− v (2.50)

where

Mji,σ =

Z ρ(r)ρ

ρσ d3r (2.51)

and

vs =< ϕ|

Nσ

X

j=1

ρ(r)v(r)

ρσ(r) > (2.52) The highest occupied orbital dominates the potential at the long-range. We choose vi=NSIC,σσ = vNσ to make sure the potential has correct asymptotic behavior.

2.2.4 TD-KLI-SIC method

We extend the KLI-SIC method into the time dependent system. The basic theorem of time dependent density functional theory is from Runge and Gross[36], mainly the similar structure of HK theorem and KS scheme. We’ll give the main theorems of TDDFT.More detail proofs and discussions can be found in.

First theorem, there is a one-to-one correspondence between time dependent den- sity and time dependent potential for any fixed initial states. In general, potential, hamiltonian and wave function is the functional of the time dependent density. This is the basic existence theorem of TDDFT.

Second theorem, we define the action A of the many-body system as the functional of many-body wave function,

A[ψ] = Z t2

t1

dt < ψ(t)|i∂

∂t − ˆH(t)|ψ(t) > (2.53)

(31)

If the variation of the action is δA[ψ] =

Z t2

t1

dt < δψ(t)|i∂

∂t− ˆH(t)|ψ(t) > + Z t2

t1

dt < ψ(t)|i∂

∂t − ˆH(t)|δψ(t) >

(2.54) We do integration by parts in the second term:

δA[ψ] = Z t2

t1

dt < δψ(t)|i∂

∂t− ˆH(t)|ψ(t) > + Z t2

t1

dt < (i∂

∂t− ˆH(t))ψ(t)|δψ(t) > +i < ψ(t)|δψ(t) > t2

t1

(2.55) The third term is zero because of the boundary condition on t1 and t2. The δA[φ] = 0 leads to the time dependent Schrödinger equation (i∂t − ˆH(t))φ(t) = 0. With the first theorem, the wave function is the functional of density, so we define

A[ρ] = Z t2

t1

dt < ϕ[ρ](t)|i∂

∂t− ˆH(t)|ϕ[ρ](t) > (2.56) And we rewrite the Eq. 2.56 as

A[ρ] = A0[ρ]− Z t2

t1

dt Z

d3rρ(r, t)v(r, t) (2.57) The action A0 is the universal functional from kinetic and electron-electron interac- tion term. The time dependent density can be solved from the variational principle.

δA[ρ]

δρ(r, t) = 0 (2.58)

Eq. 2.58 leads to the set of one-electron time dependent Schrödinger-like equation(TD Kohn-Sham equation).

(1

22+ vef f,σ(r, t))ϕ(r, t) = i∂

∂tϕ(r, t) (2.59) where the effective potential is

vef f(r, t) = v(r, t) +

Z ρ(r0, t)

|r − r0|dr0+ δAxc[ρ]

δρ(r, t) (2.60)

The last term is the time dependent exchange-correlation potential.

(32)

The OEP method is also available in the time dependent system[37].

Axc[ϕ]

vef f,σOEP[(r, t)] (2.61)

We present the TD-KLI-SIC method with adiabatic approximation[38]. The action is defined as

ASICxc = Z t2

t1

dtExcSIC0]|ρ0−→ρ(r,t) (2.62) The adiabatic approximation means that the system is only dependent on instant time. The derivative of the action ASICxc leads to the time dependent potential,

vxc,σSIC(r, t) = δExcα, ρβ]

δρσ(r, t) + vSIC,σ(r, t) (2.63) where

vSIC,σ(r, t) =X

i

ρ(r, t)

ρσ(r, t){v(r, t) + viSIC,σ(t)− v(t)}, (2.64) v(r, t) =−

Z ρ(r0, t)

|r − r0| d3r0−δExc, 0]

δρ(r, t) (2.65)

We solve the vSIC,σ(r, t) with the same procedures as before.

Finally, we solve the TDKS Eq. 2.60 with TDGPS method. We define ˆH0 as Hˆ0 =1

22 Z

r + vσ,ef f(r, 0) (2.66)

And

V (r, t) = v[ρ]ˆ σ,ef f(r, t)− vσ,ef f(r, 0) (2.67) Therefore, we have to solve the static Kohn-Sham equation by the self-consistency.

And we follow the TDGPS method that we discuss before to propagate the wave function.

2.3 Implement of numerical methods on graphics processing unit

In this section, I’ll give simple introduction about GPU and how to implement TDGPS on GPU. I recommend this book for beginner[39].

(33)

2.3.1 GPU architecture

Fig. 2.8 shows the architecture of the CUDA-capable graphic processing unit(GPU), which is composed of many streaming multiprocessors(SMs). CUDA is the abbre- viation of compute unified device architecture developed by NVIDIA. CUDA is the hardware and software architecture for the programmers who can develop and ex- ecute the programs in C, C++, Fortran and other languages. The programmer organizes the threads in blocks and grids of blocks in the program, which is called kernel, compile and execute. The programmer don’t worry about how the GPU ex- ecutes the threads and only focus on how to organize the threads. When executing a kernel, the machine will distribute the threads to SMs in the blocks. And for each SMs, the threads in the same block will be distribute to stream processors(SPs) in a SM in Fig. 2.7. In the Tesla K20, there are 13 SMs and 192 SPs in single SM. In general, a group of 32 threads forms a warp to hide the latency, so the threads in a warp was executed by the same SM. If threads are not organized well, the perfor- mance of the GPU would be bad. Therefore, we need to know the architecture in order to exploit the the power from GPU.

Figure 2.7: GPU architecture.

(34)

Figure 2.8: SM architecture.

GPU has several memory spaces, mainly global, shared, register. The memory is crucial for speeding up the programs. The register memory is for each thread. It’s very small, of scale of ten bytes but the fastest. The shared memory is for each block, so the threads in the same block share the shared memory which is of the scale of kilo-bytes. The global memory is of the scale of giga-bytes and it can be accessed by any thread but it’s slower than shared memory and register memory. Even though the global memory is not faster, it’s still far faster than CPU memory. Among all the memory access, the slowest one is the communication between CPU and GPU, so we minimize data transferring between them.

(35)

Figure 2.9: GPU Memory.

(36)

2.3.2 Implementation on GPU

Algorithm 1 pseudocode for TDGPS on GPU Input: ψ(r, 0),exp(−iH0δt/2),Grid points(ri, θj),δt Output: O(t)(Observables in each time step)

1: Allocate and Transfer GPU device memory for ψ(r, t),exp(−iH0δt/2),Grid points(ri, θj)

2: for t = 0 to t = tmax do

3: Operation ψ1(r, t) = exp(−iH0δt/2)ψ(r, 0)

4: Operation ψ2(r, t) = exp(−iV (r, θ, t)δt)ψ1(r, t)

5: Operation ψ(r, δt) = exp(−iH0δt/2)ψ2(r, t)

6: Operation < ψ(r, δt)| ˆO|ψ(r, δt) >

7: ψ(r, 0) = ψ(r, δt)

8: end for

9: Transfer O(t) from GPU to CPU

Here we show the pseudocode for TDGPS on GPU. Because we solve Schrödinger equation at only one time, we can do that on CPU. Next step is to allocate and transfer GPU device memory for wave functions, operators and grid points. This is a basic and important programming technique. Just like preparing ingredients for the cooking. The crucial part is to perform time propagation on GPU. Fortunately, we use the CUBLAS library to perform matrix-matrix and matrix-vector multiplications and it really reduces the difficulty of programming. But we still have to program some parts which the CUBLAS can’t include. Finally, we transfer data from GPU to CPU and print the results. For TDDFT, we solve the static Kohn-Sham equations on CPU because we do that only one time. And in the each time propagation, we have to calculate the effective potential which is the functional of the time dependent density.

(37)

2.3.3 Result

Figure 2.10: Runtime of the TDGPS and TD-KLI-SIC on one GPU and 16-cores CPU with different atoms. (nvidia Kepler K20(GPU) and Intel(R) Xeon(R) CPU E5-2690 2.9GHz(16 cores CPU))

Here we show that we speed up our program on GPU. We use the intel mkl library and openmp on multi-core CPU and use the CUBLAS library and parallel the programs on GPU. In our workstation, there are four GPU cards and we can also execute all of these cards at the same time. But I want to say there are some limitations on GPU. There only 5 GB on each GPU card, so we can’t put the data more than 5 GB on single GPU card. Even for four cards, the maximum memory is still only 20 GB which is not large enough for some AMO problems with thousands grid points.

How to reduce the data on time propagation is the main question in our research.

(38)

Chapter 3

Result and Discussion

In this chapter, we show the result of the calculation and give explanations of the phenomena.

3.1 Hydrogen

Hydrogen atoms is the simplest system in all atoms. We don’t need to use any approximations about multi-electron effect(only one electron).

We solve the time-dependent SchrÃűdinger equation by TDGPS method.

i∂

∂tϕ(r, t) = [−1

22 1

r + vext(r, t)]ϕ(r, t) (3.1) The laser fields are polarized along z-axis:

vext(r, t) =−z[εX(t) + εL(t)] (3.2) The SAP field can be defined as follow:

εX(t) = FXexp(−2 ln(2)(t− td)2

τX2 ) cos(ωX(t− td)) (3.3) Here, FX is the peak field strength of the SAP, τX = 140as is its full width at half maximum(FWHM), and ωX = 13.6 eV is its central frequency(here we choose the laser frequency as the ionization energy of 1s orbital because we want to excite

(39)

atoms). The SAP peak intensity is 1 × 1010W/cm2. The parameter td represents the time delay between the NIR and SAP; the negative time delay refers to the SAP arriving first. The NIR field has the form:

εL(t) = FLexp(−2 ln(2)t2

τL2 ) cos(ωLt) (3.4)

Here, FLis the peak field of the NIR pulse, τL fs is its FWHM, and ωL is the central frequency of the NIR field(here we choose the laser wavelength as 800 nm [ωL= 1.55 e.V] and 656 nm [ωL = 1.89 e.V]). The NIR laser peak intensity is 1× 1012W/cm2.

-0.006 -0.004 -0.002 0 0.002 0.004 0.006

-20 -15 -10 -5 0 5 10 15 20

-6e-05 0 6e-05

εNIR(t) (a.u.) εSAP(t) (a.u.)

Time (fs) Delay

SAP NIR

Figure 3.1: Illustration of SAP and NIR with time delay -5 fs.

After the time-propagation procedure, the dipole moment and the dipole accel- eration can be expressed as follow:

d(t) =hϕ(r, t)|r|ϕ(r, t)i (3.5)

a(t) =hϕ(r, t)|∇(1

r − vext(r, t))|ϕ(r, t)i (3.6) The spectral density of the radiation energy is given by the following expression:

(40)

S(ω) = 2 3πc3|

Z

−∞

a(t) exp(−iωt)dt|2 (3.7)

Here ω is the frequency of radiation, c is the velocity of light. S(ω) has the meaning of the energy emitted per unit frequency.

In the calculation, we use 128 radial and 32 angular grid points and the time step ω

L1024 (nearly 0.1 a.u.). The maximum radius is 60 a.u. and we place absorber between 40 a.u. and 60 a.u. describe the ionization process. The time delay was varied in steps of 4td = 20as within the range of −20fs 6 td 6 20fs (2048 steps in total).

(41)

Figure 3.2: Photon emission energy spectrum of the exicted states (2p[3s− 5s] and 2p[3d − 4d] as a function of the time delay between the NIR pulse and SAP. The yellow color indicates the highest energy emitted. The color bars are represented by the log10S(ω) of the spectral density in Eq. 3.7

In Fig. 3.2 we show the 3D plot of the photon emission spectrum as a function of td for the excited states 1snp(n ≤ 5). The higher excited states (1s4p and 1s5p) are shifted by the pondermotive potential Up of the NIR field, where Up = ε2L

(2ωL)2; for the field strength and frequency used, Up = 0.17 eV.

(42)

Figure 3.3: Photon emission energy spectrum of the 1s2p excited state as a function of the time delay between the NIR pulse and SAP.

Figure 3.4: Photon emission energy spectrum of the 1s3p excited state as a function of the time delay between the NIR pulse and SAP.

(43)

Figure 3.5: Photon emission energy spectrum of the 1s4p excited state as a function of the time delay between the NIR pulse and SAP.

The density plots of the photon emission spectrum in Fig. 3.3-3.5 depict the transition from 1s2p, 1s3p and 1s4p as the function of td. We can observe the oscillation structure in the region where the NIR and SAP overlap. The period of the oscillation is 1.1 fs, which is half of the NIR laser optical cycle. This phenomenon was also observed in theoretical calculation where they use absorption spectra[12].

In Fig. 3.3 and Fig. 3.4 we observe the splitting of the lines near td∼ 10fs. The electron absorbs one XUV photon to np states and then absorbs more NIR photons to forbidden states, ns or nd. If we try the NIR with wavelength of 656nm, the transition is more obvious. The splitting has been known as result of Autler-Townes effect[40]. We can identify this splitting by the Hamiltonian without some of the excited states. For 1s2p transition, we choose the td = 10f s and remove 3s and 3d states in the Hamiltonian and for 1s3p transition we choose the td= 10f s and remove 2s states in the Hamiltonian in Fig. 3.2. The splitting disappears in both of them and make sure this splitting can be explained in terms of two-photon absorption and emission process. The SAP excites the ground state to 1snp states; then the NIR

參考文獻

相關文件

(c) If the minimum energy required to ionize a hydrogen atom in the ground state is E, express the minimum momentum p of a photon for ionizing such a hydrogen atom in terms of E

Wang, Solving pseudomonotone variational inequalities and pseudocon- vex optimization problems using the projection neural network, IEEE Transactions on Neural Networks 17

volume suppressed mass: (TeV) 2 /M P ∼ 10 −4 eV → mm range can be experimentally tested for any number of extra dimensions - Light U(1) gauge bosons: no derivative couplings. =&gt;

Define instead the imaginary.. potential, magnetic field, lattice…) Dirac-BdG Hamiltonian:. with small, and matrix

For ASTROD-GW arm length of 260 Gm (1.73 AU) the weak-light phase locking requirement is for 100 fW laser light to lock with an onboard laser oscillator. • Weak-light phase

incapable to extract any quantities from QCD, nor to tackle the most interesting physics, namely, the spontaneously chiral symmetry breaking and the color confinement.. 

• Formation of massive primordial stars as origin of objects in the early universe. • Supernova explosions might be visible to the most

(Another example of close harmony is the four-bar unaccompanied vocal introduction to “Paperback Writer”, a somewhat later Beatles song.) Overall, Lennon’s and McCartney’s