Total Steiner k-distance - Steiner Distance

4.3 Steiner Distance

4.3.3 Total Steiner k-distance

r∈Z

C_k₁_,k₂H₁(2 + χ_r)s^−χ^r

k1+k∏2−1 i=2

(i + χ_r) +O(|s|¹^−k^′)

uniformly as |s| → 0 with | arg s| ≤ π − ϵ. Finally, we apply inverse Laplace transform and Proposition 1 of [98] and obtain that, as z → ∞,

C˜^[k¹^,k²^](z) = z^k¹^+k²⁻¹C_k₁_,k₂(C_kps+ ϖ_kps(log₂n)) +O(|z|^k¹^+k²^−2+ϵ).

In particular,

V˜^[k](z) = z^2k⁻¹

log 2C_k,k(C_kps+ ϖ_kps(log₂n)) +O(|z|^2k^−2+ϵ) as z→ ∞.

Remark 22. Note that from the expression of C_k₁_,k₂, we have C_k²₁_,k₂ = C_k₁_,k₁C_k₂_,k₂. Thus,

ρ(P_n^[k¹^], P_n^[k²^]) = Cov(Pn^[k¹^], Pn^[k²^])

√

Var(Pn^[k¹^])Var(Pn^[k²^])

∼

√

n^2k¹^+2k²⁻²C_m,m² ₋₁(C_kps+ ϖ_kps(log₂n))²

n^2k¹^+2k²⁻²C_m,mC_m_−1,m−1(C_kps+ ϖ_kps(log₂n))² = 1.

Remark 23. Since we already know that Pn^[1] satisﬁes a central limit theo-rem [98], together with the result in the above theo-remark and applying similar argument as of [76], we obtain that



Pⁿ^[1]− E(Pⁿ^[1])

√

Var(Pn^[1])

, . . . ,Pn^[k]− E(Pⁿ^[k])

√

Var(Pn^[k])



−→ (X, . . . , X),^d

where X is a standard normal distributed random variable and −→ denotes^d weak convergence.

4.3.3 Total Steiner k-distance

Let Sn^[k] be the Steiner k-distance. Then, using the same idea as for the k-th total path length, we consider four cases:

1. All k nodes are from one subtree.

S_B^[k]

n+ S_n^[k]_−B^∗

2. The k nodes are chosen from both subtrees and the root is not chosen.

k−1

3. The root is chosen, the other k

− 1 nodes are all from one subtree.

P_B^[k^−1]

4. The root is chosen, the other k

− 1 nodes are from both subtrees.

k−2

Note that as for the k-th total path length, here we have a system of recur-rences for the Steiner k-distance. Similar to the analysis of the k-th total path length, we let ˜g^[k](z) be the Poisson generating function of the mean of the total Steiner k-distance, ˜W^[k¹^,k²^](z) be the Poissonized covariance of the total k₁-th Steiner distance and the total k₂-th total path length and ˜V_S^[k](z) be the variance of the k-th Steiner distance. With the help from computer algebra systems, we get the diﬀerential-functional equations

and rest of the analysis will be very similar to the one with the k-th total path length, we skip the details and list only the results

Since the leading terms are exactly the same as for the k-th total path length, the same arguments as for Pn^[k] gives us the results stated in Theorem 4.3.1.

Chapter 5 A General Framework for Central Limit Theorems

5.1 Framework for m-ary Tries

In this section, we will discuss a general framework for the limiting distribu-tion of additive shape parameters in random digital trees. For m-ary tries and PATRICIA tries, an additive shape parameter is deﬁned as follows: X_n is a sequence of random variables satisfying the distributional recurrence

X_n=^d

∑m r=1

X^(r)

In^(r)

+ T_n, (n≥ n0), (5.1)

where n₀ ≥ 0 is an integer, Xn, Xn⁽¹⁾, . . . , Xn^(m), (In⁽¹⁾, . . . , In^(m)), T_n are inde-pendent and Xn⁽ⁱ⁾ has the same distribution as X_n. The random model we are using is the Bernoulli model which is introduced in Chapter 2. For digital search trees and bucket digital search trees, the distributional recurrence will be

X_n+b=^d

∑m r=1

X^(r)

In^(r)

+ T_n+b, (n ≥ n0), where b≥ 1 is an integer. (5.2) The remaining notations are as in the trie case.

Because of the development of related mathematical techniques, including poissonization, poissonized variance with correction, Mellin transform and contraction method, we have many tools to characterize the asymptotics of additive shape parameters under the Bernoulli model. The authors of [77]

and [75] proposed a systematical way to derive the asymptotics for mean and variance and the limit laws of additive shape parameters of random tries. It

turns out that the same method works for random digital search trees as well.

Deﬁnition 5.1.1. If a set P =

{p1, . . . , p_m} satisﬁes that pi ∈ (0, 1) for all 1≤ i ≤ m and ∑

ip_i = 1, then we say P is a probability family.

For a probability family P ={p1, . . . , p_m}, if there exists a constant a ∈ R and a sequence {ki}^mi=1, k_i ∈ N for all 1 ≤ i ≤ m such that pi = a^kⁱ for all i, then we say P is periodic. Otherwise, P is said to be aperiodic.

For a probability family P ={p1, . . . , p_m}, we deﬁne a function Λ(s) = 1− p^−s1 − · · · − p^−sm .

We let Z be the set of roots of Λ(s) = 0 and deﬁne the following notations Z<α=Z ∩ {ℜ(z) < α} and Z=α =Z ∩ {ℜ(z) = α}.

Then from [55] and [67], we have the following properties

Theorem 5.1.2. Depending on the real part of the solutions of Λ(s), we

have three cases:

(i) If ℜ(s) < −1, then Λ(s) has no solutions. In other words, Z<−1 =∅.

(ii) If ℜ(s) = −1, then Z=−1 ={−1} ∪ S where

S =





{−1 + χk|χk = 2kπi/ log a, k∈ Z \ {0}} , Pis periodic;

∅,

P is aperiodic.

(iii) If ℜ(s) > −1, then there exists a positive constant η such that for any solutions ω₁, ω₂, we have |ω1− ω2| > η.

Lemma 5.1.3. Let ˜

f (z) and ˜h(z) be entire functions satisfying a functional equation of the form

f (z) =˜

∑m r=1

f (p˜ _rz) + ˜h(z) (5.3)

where{p1, . . . , p_m} forms a probability family. We denote by h = −

∑m r=1

p_rlog p_r. If ˜h(z)∈ J S_α,γ with 0≤ α < 1 and ˜f (0) = ˜f^′(0) = 0, then

f (z) =˜ 1 h

∑

ωk∈Z<−α−ϵ

G(ω_k)z^−ω^k+O(z^α+ϵ), where the sum expression is inﬁnitely diﬀerentiable and

G(ω) =

∫ _∞

z^ω⁻¹˜h(z)dz =M [˜h; ω].

Proof. Since ˜h(z) ∈ J S_α,γ with 0 ≤ α < 1, by a similar proof as of

By the converse mapping theorem, Theorem 3.3.10 and Theorem 5.1.2, we get the desired result.

Now, we consider the moment generating function of X_n M_n(y) :=E( generating function, we get that

µ_n= M_n^′(0) = ∑ n₀ = 2. For more general cases, our method will also apply with slight modiﬁcations.

Now, we utilize the idea of Poissonization which was already used in previous sections. We let

f˜₁(z) = e^−z∑

then (5.4) yields that

Next, we utilize the idea of Poissonized variance with correction and let V˜X(z) = ˜f2(z)− ˜f1(z)²− z ˜f₁^′(z)²,

V˜_T(z) = ˜h₂(z)− ˜h1(z)²− z˜h^′1(z)². From (5.5), we derive that

V˜_X(z) =

Before we go on to derive asymptotic expressions, we introduce the Hadamard product of Poisson generating functions.

Deﬁnition 5.1.4. Given two Poisson generating functions

F˜1(z) = e^−z∑ we deﬁne the Hadamard product of these two functions as

F˜3(z) := ˜F1(z)⊙ ˜F2(z) = e^−z∑

n≥0

a_nb_n n! zⁿ.

Note that the deﬁnition is diﬀerent from the usual one since we consider the exponential generating function.

Subsequently, we will use Hadamard products to handle the function ˜ϕ₁ in (5.6). For this, we will need the following theorem which shows that JS-admissibility is closed under the Hadamard product.

Theorem 5.1.5. If ˜

F₁ ∈ J S_α₁_,β₁ and ˜F₂ ∈ J S_α₂_,β₂, then ˜F₃ ∈ J S_α₁_+α₂_,β₁_+β₂. More precisely, we have

F˜₃(z) = ˜F₁(z) ˜F₂(z) + z ˜F₁^′(z) ˜F₂^′(z) +O(

|z|^α¹^+α²⁻²(log₊|z|)^β¹^+β²) , uniformly as |z| → ∞ and | arg(z)| ≤ θ, where 0 < θ < π/2.

Proof. See the proof of Proposition 3.5 of [75].

We now can state the result on asymptotic expressions of mean and vari-ance. (This result was ﬁrst obtained by Fuchs et al. in [75].)

Proposition 5.1.6. If ˜h

₁(z)∈ J S_α₁_,γ₁ with 0≤ α1 < 1, then E (Xn) = 1

∑

ωk∈Z<−α1−ϵ

G_E(ω_k)n^−ω^k +O(n^α¹^+ϵ),

where the sum expression is inﬁnitely diﬀerentiable and

G_E(ω) =M [˜h1; ω] =

∫ _∞

˜h(z)z^ω⁻¹dz.

Moreover, if ˜V_T(z)∈ J S_α₂_,γ₂ with 0≤ α2 < 1 and ˜h₂(z)∈ J S , then

Var (Xn)∼ 1 h

∑

ωk∈Z⁼−1

GV(ωk)n^−ω^k,

where the sum expression is inﬁnitely diﬀerentiable and

G_V(ω) =M [ ˜VT + ˜ϕ₁+ ˜ϕ₂; ω] =

∫ _∞

(V˜_T(z) + ˜ϕ₁(z) + ˜ϕ₂(z) )

z^ω⁻¹dz.

Proof. The expression of the mean follows directly from (5.5), Lemma5.1.3 and depoissonization.

For the variance, we start from (5.6). We apply Theorem 5.1.5 to ˜g(z),

Now, we turn to ˜ϕ₂(z). First, by applying the Mellin transform to (5.5), we get that for −2 < ℜ(ω) < −1,

M [ ˜f₁; ω] = G_E(ω) Λ(ω) . Thus, from inverse Mellin transform,

f˜₁^′(z) = d

where the latter follows from the fact that the integral has no poles atℜ(ω) =

−1. As a result, ˜ϕ2(z) = o(|z|) as z → ∞ which in turn shows that M [˜ϕ2; ω]

has no poles at ℜ(ω) = −1. Now, the converse mapping theorem proves the claimed expansion for ˜V (z). Moreover, by JS-admissibility, the expansion holds for Var(Xn), as well.

Now, we can state the general central limit theorem.

Theorem 5.1.7. Suppose that ˜h

₁(z)∈ J S_α₁_,γ₁ with 0≤ α1 < 1/2, ˜h₂(z)∈ J S and ˜VT(z) ∈ J S_α₂_,γ₂ with 0 ≤ α2 < 1. Moreover, we assume that

∥Tn∥s = o(√

n) with 2 < s ≤ 3 and V(Xn)≥ cn for all n large enough and some c > 0. Then, as n→ ∞,

X_n− E(Xn)

√V(Xn)

−→ N (0, 1).d

Proof. From Proposition5.1.6, we get that E(Xn) =1

∑

ωk∈Z<−α1−ϵ

G_E(ω_k)n^−ω^k+O(n^α¹^+ϵ), V(Xn)∼1

∑

ωk∈Z=−1

G_V(ω_k)n^−ω^k.

From the assumption, we can choose ϵ such that α₁+ ϵ < 1/2. Next, we set ϖ₁(x) = ∑

ω_k∈Z<−α1−ϵ

G₁(ω_k) h x^−ω^k, ϖ₂(x) = ∑

ωk∈Z=−1

G₂(ω_k)

h x^−ω^k⁻¹.

To apply the contraction method, we need to verify the following conditions:

(a) (

In^(r)ϖ2(In^(r)) nϖ₂(n)

)1/2 Ls

−→ Ar,

∑m r=1

A²_r = 1 and P(∃r : Ar = 1) < 1.

(b)

(nϖ₂(n))^−1/2 (

T_n− ϖ1(n) +

∑m r=1

ϖ₁(I_n^(r)) )

L^s

−→ 0.

We begin with the veriﬁcation of (a). By the strong law of large number and the dominating converge theorem,

I_n^(r)−→ p^L^p r, 1≤ r ≤ m. (5.7)

Moreover, by the deﬁnition of ϖ₂(x), we have that

ϖ₂(p_rn) = ϖ₂(n) for all 1≤ r ≤ m and ϖ^′2(n) =O(n⁻¹).

By the Taylor series expansion of ϖ₂:

ϖ₂(I_n^(r)) = ϖ₂(n) +O(

Combining (5.7) and (5.8), we get ( and hence the condition (a) is veriﬁed.

Now, we turn to the veriﬁcation of condition (b). Note that from the assumption on ∥Tn∥s and V(Xn), the term T_n can be dropped from (b).

Therefore, we only need to check that

(nϖ₂(n))^−1/2 Chernoﬀ’s bound yields that

P(A^c_n) = O

Again, we compute the Taylor expansion of ϖ₁ (on A_n):

We estimate the terms in (5.10) individually. First, we consider

ϖ₁^′(p_rn)−ϖ^′1(p_sn) = ∑

ωk∈Z<−α1−ϵ

G1(ωk)

h (−ωk)n^−ω^k⁻¹(p^−ω_r ^k⁻¹−p^−ωs ^k⁻¹) = o(n).

Together with the assumption on V(Xn), we get

Similarly, we also have Substituting (5.11) and (5.12) back into (5.9) shows that (b) holds.

在文檔中隨機數位樹上加法性參數之機率分析 (頁 112-124)