運動賽事期間隊伍能力的變化分析

(1)

國立臺灣大學理學院應用數學科學研究所碩士論文

Institute of Applied Mathematical Sciences College of Science

National Taiwan University Master Thesis

運動賽事期間隊伍能力的變化分析

Analyzing Dynamic Abilities of Teams in Sports Events

管敏仁 Min-Ren Guan

指導教授：江金倉博士

Advisor: Chin-Tsang Chiang, Ph.D.

(2)

國立臺灣大學碩士學位論文

口試委員會審定書

-� 動賽事期間隊伍能力的變化分析

nalyzing Dynamic Abilities of Teams in Sports Event

論文亻系嵒敏仁君(R07246013)在國立全滑大學教學學系、所完成之碩士學位論文，於民國109年7月22日承下列考拭委員-L ^-=

面過及口讠式及格，特此證明

口試委

5L仝定（簽名）

4 勺巨

L

、h�., ,,

系主任、所表（簽名）

（走否須簽幸依各院系所規定）

(3)

中文摘要

在成對比較類型的運動數據分析中，實務工作者和研究者們認知球隊在連續比賽中，由於受傷、團隊心理和團隊進步會導致團隊能力變化。描述分數差或比賽結果最常用的框架主要是對主場隊伍和客場隊伍的能力差做適當的轉換。在這樣的考慮下，隊伍能力的變化可以在頻率派學者或貝氏的觀點下進一步建模。通過整合這些特點到模型的構造中，我們對團隊能力提出更通用的動態模型。此外，我們還制定了一些準則來從競爭模型中選出擁有較好季後賽預測力的模型。我們透過美國國家籃球協會 2009-2010 賽季到 2018-2019 賽季的數據來調查提案的實用性。

關鍵字：成對比較、動態能力、混合效應模型、模型選擇、預測率、預測均方差

(4)

Abstract

In paired-comparison sports data analysis, practitioners and researchers have iden- tiﬁed the varying abilities of teams due to injuries, team psychology, and team improvement in the course of sequential competitions. The most commonly used framework to describe the score difference or the match outcome is mainly based on an appropriate transformation of the difference in abilities of the home team and the visiting team. Under such consideration, the abilities of teams can be further modelled with dynamic effects in the frequentist or Bayesian perspective. By integrating these features into a model formulation, we propose more general dynamic models for the abilities of teams. In addition, some criteria are developed to select a better predictive model for playoffs among competing models. The practicality of our proposal is also investigated by the data from the 2009-2010 season to the 2018-2019 season of the National Basketball Association.

KEY WORDS: Paired comparisons; Dynamic abilities; Mixed effects models;

Model selection; Proportion of correct predictions; Prediction mean squared error.

(5)

List of Figures

6.1 Proportion of correct predictions of M·PCP (red line), M·PMSE

(blue line), andM·BIC(green line). Black lines are the highest and lowest proportions of correct predictions amongM·for· = 1, 2, and T . . . . 20 6.2 Selected plots of minimized sum of squares value to λ. . . 22 6.3 Selected plots of minimized sum of squares value to λ. . . 22

(7)

List of Tables

6.1 The estimated proportion of correct predictions from the Dynamic Bradley-Terry models (cf. [6]) and the selected proposed models on playoffs data.. . . 21 6.2 Mean of proportion of correct predictions of Dynamic Bradley-

Terry model (cf. [6]) and proposed models over the ten seasons. . 22 6.3 Ratio of ˆσ_h² to ˆσ₀² and ˆσ_h² to ˆσ₀²(rounded to 3 decimal places) . . . 23 6.4 Mean of PCP and BIC over ten seasons . . . 24

(8)

Chapter 1 Introduction

How to assess abilities of sports teams has been of great interest to researchers and practitioners. National Collegiate Athletic Association (NCAA) established a ranking system reﬂecting the abilities of teams to select teams for playoffs. Pre- dictions of future outcomes can be made by the abilities of participating teams, which are highly concerned by practitioners.

Paired comparison models have been commonly used for sports events. A season of basketball matches in NBA league can be regarded as a series of paired comparisons. The advantage of paired comparisons is reducing the effects of con- founding. For example, two teams share the same referee in a match, whereas one team may played with several different referees throughout the whole season and there may be judgement biases among referees. Existing Paired comparison models for sports events characterize the score difference or outcome to be related with home team’s ability and visiting team’s ability by a linear model or generalized linear model respectively.

Previous studies proposed a variety of paired comparison models for sports events including random/fixed effects models with/without dynamic effects on the abilities. In the spirit of existing models, we further propose two flexible models under different cases and many of existing models can be unified in the proposed models. We connect Bayesian and frequentist viewpoints by mixed effects models.

(9)

1. Introduction 2

The dynamic scheme of abilities is more general by considering ﬁxed dynamic scheme and random processes for the abilities simultaneously. We provide model selection criteria to select a better model and setup for the prediction purpose. Two measures of predictive ability are used to compare the predictive performances of competing models.

In section 2, several existing paired comparison models are introduced. Sec- tion 3 describes the proposed models under different setups. Section 4 intro- duces the estimation method, which consists of least squares method, maximizing observed likelihood, and maximizing posterior likelihood. The measures of goodness-of-ﬁt and predictive ability are also introduced. Section 5 presents an application to the National Basketball Association.

(10)

Chapter 2 Existing Paired Comparison Models

Let m be the number of matches; T the number of teams; Yithe score difference of match i, i = 1, . . . , m; a_kand b_kthe home ability and visiting ability of team k respectively, k = 1, . . . , T ; hi and vithe home team and visiting team in match i respectively; and t_ithe time of match i.

The ﬁrst paired comparison model for sports events proposed by [1] did not consider the dynamic effects and the home ability and visiting ability were con- sidered to be the same. That is, ak = b_k , αk,∀i = 1, . . . , m, and k = 1, . . . , T , which leads to the following model:

Y_i = α_h_i− αvi+ ε_i, i = 1, . . . , m. (2.1) [2] improved model (2.1) by considering the home court advantage θ, i.e.,

a_k, αk+ θ and b_k , αk, which leads to the following model:

Y_i = θ + α_h_i− αvi+ ε_i, i = 1, . . . , m. (2.2) [3] considered team-speciﬁc home court advantages θ_k, i.e., a_k , αk + θ_k and b_k , αk, which leads to the following model:

Y_i = θ_h_i + α_h_i − αvi+ ε_i, i = 1, . . . , m. (2.3) The ﬁrst model considering the dynamic abilities was proposed by [4]. The model suggested that there is a deviation of performance Sk(ti) from a team’s

(11)

2. Existing Paired Comparison Models 4

underlying ability in each game. In the formulation of (2.2), let a_k(t_i), θ+αk(t_i) and bk(ti), αk(ti). The model leads to

α_k(t_i) = α_k+ S_k(t_i), (2.4) where Sk(t_i) follows a random process. [5] proposed a model for ordered cate- gories:

P (Y_i ≤ r) = F (θr+ α_hi(t_i)− αvi(t_i)), r = 1, . . . , k (2.5) with αk(ti) following a random process and αk(0) = αkfor all k. [4] and [5] both assumed that the dynamic effects on ability depends on some random processes, wheras [6] proposed a ﬁxed dynamic scheme for the dynamic evolution of ability.

Let λ₁, λ₂ ∈ [0, 1] and Yi = 1 if the home team won, and Y_i = 0 if the visiting team won. t⁽_i⁻¹⁾ denotes the time of the previous home match in which hi was also the home team, t^′(−1)_i denotes the time of the previous away match in which vi was also the visiting team. [6] proposed the following dynamic Bradley-Terry model:

P (Yi = 1|Yi−1 = yi−1, . . . , Y1 = y1) = exp{ahi(t_i)− bvi(t_i)} 1 + exp{ahi(t_i)− bvi(t_i)}

with _









a_h_i(t_i) = λ₁γ₁y(t⁽_i⁻¹⁾) + (1− λ1)a_h_i(t⁽_i⁻¹⁾), b_v_i(t_i) = λ₂γ₂(1− y(t^′(−1)i )) + (1− λ2)b_v_i(t^′(−1)_i ),

(2.6)

where y(ti) denotes the outcome of the match at time t_i. [6] assumed that all teams started with the same home and visiting underlying abilities γ₁r¯_h and γ₂r¯_v respectively, where ¯r_hand ¯r_vare the average win rates of home matches and away matches over the previous regular season respectively.

(12)

Chapter 3 Proposed Models for Dynamic Effects

Based on existing models, a paired comparison model for sports events can be formulated as

Yi = ahi(ti)− bvi(ti) + εi, i = 1, . . . , m. (3.1) The main issue is how to model the dynamic evolutions of abilities ahi(ti) and b_v_i(t_i) for i = 1, . . . , m. Let Y_i denote the score difference of match i. Y_h(t_i) and Yv(ti) denote the scores of the home team and the visiting team of match i respectively. For the regression formulation, we deﬁne the following notations:

Y =







Y1

... Ym







, ε =







ε1

... εm







, a =







a1

... aT







, b =







b1

... bT







, γ₁ =







γ11

... γ1T







, and γ₂ =







γ21

... γ2T







.

3.1 Background

In the spirit of [6], we propose a more general model for abilities (hereinafter referred to M1):









a_h_i(t_i) = λ₁γ_1h_iy_h(t⁽_i⁻¹⁾) + (1− λ1)a_h_i(t⁽_i⁻¹⁾), b_v_i(t_i) = λ₂γ_2v_iy_v(t^′(−1)_i ) + (1− λ2)b_v_i(t^′(−1)_i ).

(3.2)

(13)

3. Proposed Models for Dynamic Effects 6

The underlying abilities a_kand b_kare ﬁxed unknown parameters for k = 1, . . . , T . The dynamic Bradley-Terry model is a special case of model M1 under the following three conditions: (i) underlying abilities are set to be the average win rates of home matches and away matches over the previous regular season respectively.

(ii) γ₁₁ = · · · = γ1T and γ₂₁ = · · · = γ2T. (iii) The score difference is replaced with the outcome.

In model M1, the underlying abilities a and b are involved in the updating scheme with scores. Thus the effects of underlying abilities will decrease as the season goes on. However, model (2.4) provides a different aspect: The effects of underlying abilities should be the same throughout the season. The dynamic effects are explained as the deviations of the actual performances from the underlying abilities. Considering the feature, we propose a model in which the underlying abilities are not involved in the updating scheme (hereinafter referred to M2):











a_h_i(t_i) = a_h_i+ γ_1h_iS_h_i(t_i) and b_v_i(t_i) = b_v_i+ γ_2v_iS_v_i(t_i), S_h_i(t_i) = (1− λ1)S_h_i(t⁽_i⁻¹⁾) + λ₁y_h(t⁽_i⁻¹⁾),

S_v_i(t_i) = (1− λ2)S_v_i(t^′(−1)_i ) + λ₂y_v(t^′(−1)_i ),

(3.3)

where S_v_i(0) = 0 and S_h_i(0) = 0.

There are two meaningful cases of model M2: λ₁ = λ₂ = 1 and λ₁ = λ₂ = 0 (hereinafter referred to M2-i and M2-ii respectively). The former implies that the dynamic effects only count on the result of the previous match. That is,

S_h_i(t_i) = y_h(t⁽_i⁻¹⁾) and S_v_i(t_i) = y_v(t^′(−1)_i ). (3.4) The case of λ1 = λ2 = 0 means that there is no dynamic effect, i.e.,

S (t )≡ 0 and S (t )≡ 0. (3.5)

(14)

Above models are in the frequentist framework, the abilities are updated by ﬁxed parameters and historic data. In Bayesian framework, the abilities are updated by some random processes. The simplest case is the ﬁrst-order random

walk model: _









ahi(ti) = ahi(t⁽⁻¹⁾_i ) + uh(ti), bvi(ti) = bvi(t^′(−1)_i ) + uv(ti),

(3.6)

where u_h(t_i)’s î.i.d._e N (0, σ_h²), u_v(t_i)’s î.i.d._e N (0, σ_v²) and they are mutually inde- pendent for i = 1, . . . , m. Considering both factors simultaneously, we further propose models M1R, M2R, M2R-i and M2R-ii from M1, M2, M2-i and M2-ii respectively, in which the abilities follow a random process. In the case of first-order random walk, M1R and M2R are proposed as the follows respectively.









a_h_i(t_i) = λ₁γ_1h_iy_h(t⁽_i⁻¹⁾) + (1− λ1)a_h_i(t⁽_i⁻¹⁾) + u_h(t_i), b_v_i(t_i) = λ₂γ_2v_iy_v(t^′(−1)_i ) + (1− λ2)b_v_i(t^′(−1)_i ) + u_v(t_i).

(3.7)











a_h_i(t_i) = a_h_i+ γ_1h_iS_h_i(t_i) and b_v_i(t_i) = b_v_i+ γ_2v_iS_v_i(t_i), S_h_i(t_i) = (1− λ1)S_h_i(t⁽_i⁻¹⁾) + λ₁y_h(t⁽_i⁻¹⁾) + u_h(t_i),

S_v_i(t_i) = (1− λ2)S_v_i(t^′(−1)_i ) + λ₂y_v(t^′(−1)_i ) + u_v(t_i).

(3.8)

The underlying abilities a, b and parameters of dynamic effects γ₁, γ₂ can be ﬁxed unknown parameters or random parameters. From frequentist viewpoint, if the sample size is large enough, consistency of the estimation guarantees the estimated parameters will converge to the true parameters. Form Bayesian viewpoint, by assuming the parameters follow some prior distributions, it can reduce the number of parameters to be estimated, which is an advantage when the sample size is small. We cover above two viewpoints by considering the following 4 different cases:

Case 1 . a, b and γ₁, γ₂ are ﬁxed;

Case 2 . a, b are ﬁxed and γ1, γ2are random;

Case 3 . a, b are random and γ₁, γ₂are ﬁxed; and

(15)

Case 4 . a, b and γ₁, γ₂ are random.

In each of above cases, if a and b are random, we assume that ak’s î.i.d._e N (µa, σ²_a) and b_k’sî.i.d._e N (µ_b, σ_b²). If γ₁and γ₂are random, we assume that γ_1k’s î.i.d._e N (µ₁, σ₁²) and γ2k’s î.i.d._e N (µ2, σ₂²) and all the random parameters are mutually independent.

If there is any random parameters, the normality assumption ε_i’s ^i.i.d._e N (0, σ²₀) is required due to concerns about estimation.

3.2 Regression Model Formulation

All proposed models can be rewritten as the following form:

Y = Xβ + ε, (3.9)

where

β =







a b γ₁ γ₂ u_h u_v







with u_h =







u_h(t₁) ... uh(tm)







, and u_v =







u_v(t₁) ... uv(tm)







.

Model M1 is chosen as an example to show how to obtain the regression formula- tion. In an arbitrary match i (or at time ti), the home ability and visiting ability of team h_i and team v_iunder model M1 can be rewritten as the follows respectively:









a_h_i(t_i) = (1− λ1)^K¹a_h_i+^hλ₁^P^K_j=0¹⁻¹(1− λ1)^jy_h(t⁽_i^−j−1))ⁱγ_1h_i, b_v_i(t_i) = (1− λ2)^K²b_v_i+^hλ₂^P^K_j=0²⁻¹(1− λ2)^jy_v(t^{′(−j−1)}_i )ⁱγ_2v_i,

(3.10)

(16)

M2R, M2R-i and M2R-ii. If there are random effects in the model, mixed effects model formulation has more advantages in estimation.

Let β_R denotes the random parameters and β_F denotes the ﬁxed parameters, i.e.,

Case 2. β_R =



γ₁− µ11T

γ₂− µ21T



, β_F =







a b µ₁ µ₂







;

Case 3. β_R=



a− µa1T

b− µb1T



, β_F =







µa

µ_b γ1

γ₂







; and

Case 4. βR=







a− µa1T

b− µb1T

γ₁− µ11T

γ₂− µ21T







, β_F =







µ_a µ_b µ₁ µ₂







.

Due to the problem of identiﬁability, we assume that µb = 0 in model M2, M2-i, M2-ii, M2R, M2R-i and M2R-ii. By similar procedures, one can derive the mixed effects model formulation:

Y = X_Rβ_R+ X_Fβ_F + ε. (3.11)

(17)

Chapter 4 Estimation and Model Selection

To write down the estimation approach explicitly, we ﬁrst deﬁne some notations.

Let X_γ₁ and X_γ₂ be the covariate matrix of γ₁ and γ₂ respectively. Let γ = (γ₁^T, γ₂^T)^T, Xγ = (X_γ₁, X_γ₂), and Y^∗ = Y − XFβ_F.

4.1 Estimation

By the model formulation (3.11):

Y = X_Rβ_R+ X_Fβ_F + ε, XFβ_F + ε^∗, (4.1) where ε^∗ = ε + XRβRand ε^∗ ∼ Nm(0, σ²₀Im+ XRV ar(βR)X_R^T). The estimation approach of β_F and λ is proposed to minimize the sum of squares

SS(λ, β_F) = (Y − XFβ_F)^T(Y − XFβ_F). (4.2) Least square method can be applied in this minimization.

minλ,β SS(λ, β_F) = min

λ min

β SS(λ, β_F) = min

λ (Y − XFβˆ_F)^T(Y − XFβˆ_F),

(18)

4. Estimation and Model Selection 11

where Σ = σ₀²I_m+ σ₁²X_γ₁X_γ^T₁ + σ₂²X_γ₂X_γ^T₂ and the observed log-likelihood function is

−m

2 log(2π)−m

2 log σ²₀−1

2log|Im+ σ²₁

σ²₀X_γ₁X_γ^T

1 + σ²₂

σ²₀X_γ₂X_γ^T

2|

− 1

2σ₀²(Y − XFβ_F)^T(I_m+ σ²₁

σ²₀X_γ₁X_γ^T₁ +σ₂²

σ₀²X_γ₂X_γ^T₂)⁻¹(Y − XFβ_F).

(4.4)

The estimation approach for (σ₀², σ²₁, σ₂²) is proposed to maximize the observed log-likelihood function, i.e.,

(ˆσ₀², ˆσ²₁, ˆσ₂²) = argmax

(σ²₀,σ₁²,σ²₂)

l(σ²₀, σ₁², σ₂²|X, Y ).

To estimate the predictors ˆγ₁ and ˆγ₂ of γ₁ and γ₂, it is proposed to maximize the posterior log-likelihood. That is,



ˆγ₁ ˆ γ₂



= argmax

(γ1,γ2)

log L(γ₁, γ₂|ˆσ²₀, ˆσ₁², ˆσ₂², X, Y ), (4.5)

where

log L(γ₁, γ₂|σ0², σ₁², σ²₂, X, Y )∝ log fY(y|σ0², γ₁, γ₂)π(γ₁, γ₂|σ²1, σ₂²)

= −m

2 log(2π)−m

2 log σ₀²− 1

2σ₀²|(Y − XFβ_F − Xγ1γ₁+ X_γ₂γ₂)|² (−m) log(2π)−m

2(log σ²₁ + log σ²₂)− 1 2( 1

σ₁²|γ1|²+ 1 σ₂²|γ1|²)

∝ −(Y^∗− Xγγ)^T(Y^∗− Xγγ)− γ^TW γ,

and

W =







σ₀²

σ₁²I_T 0 0 σ₀²

σ₂²I_T





.

The posterior log-likelihood is maximized when



γˆ₁ ˆ γ₂



= (X_γ^TX_γ+ W )⁻¹X_γ^TY^∗ = E[γ|σ²1, σ₂², σ_v², X, Y ]. (4.6)

(19)

4.2 Model Selection

With all the models and cases, how to select the best model is an important issue. Good predictions of outcomes and difference in scores are important for practitioners. In the following paragraphs, we ﬁrst introduce two measures of predictive ability to measure the performance of a model on the prediction purpose.

Then several criteria are proposed based on goodness-of-ﬁt and predictive ability measured by regular season data.

The measures of predictive ability are proportion of correct predictions and prediction mean squared error, hereinafter denoted by PCP and PMSE respec- tively. Let (X⁰, Y⁰) be a future run.

PCP(M) = P (sign(Y⁰)· sign(X⁰βˆ_M) > 0) + 0.5P (X⁰βˆ_M = 0), (4.7) and

PMSE(M) = E(Y⁰− X⁰βˆ_M)², (4.8) where ˆβ_M denotes the estimate of β under modelM. To estimate the PCP and PMSE of modelM, the playoffs data are considered as future runs and the prob- ability is estimated by the empirical distribution of playoffs data.

The Bayesian Information Criterion in [7] is a common approach in model selection. With the observed log-likelihood function (4.4), the BIC value of a modelM is derived by

BIC(M) = −2 log L(M) + pMlog m, (4.9) where p_M denotes the number of parameters in the modelM. [7] suggested to choose the model with smallest BIC value.

(20)

by applying the estimated parameters to the testing data. To be more explicit, we introduce the following notations:

Y =



Y^tr Y^te



, X =



X^tr X^te



,

where (X^tr, Y^tr) and (X^te, Y^te) denote the training data and testing data respectively. Let ˆβ_M^tr be the estimate of β by the training data under modelM. The PCP and PMSE of modelM are estimated by

PCP(d M) = 1 S

XS i=1

h

I(sign(Y_i^te)· sign(Xi^teβˆ_M^tr) > 0) + 0.5I(X_i^teβˆ_M^tr = 0)

i

, (4.10) and

PMSE(\ M) = 1 S

XS i=1

(Y_i^te− X_i^teβˆ_M^tr)², (4.11) where S is the number of matches in testing data. The modelMPCP andMPMSE

with highestPCP and smallest \^d PMSE are chosen, i.e., MPCP = argmax

M

PCP(d M) (4.12)

and

MPMSE = argmin

M PMSE(\ M). (4.13)

(21)

Chapter 5 An Application to National Basketball Association

The proposed models are applied to 2009–2010 season to 2018–2019 season of National Basketball Association. The data are available via an API provided in [8].

The data consist of the index of every match sorting by calendar time, the home teams and visiting teams of every match, and scores of the home teams and away teams in every match. The regular season is used to ﬁt the proposed models and the playoffs data are treated as future runs to estimate the proportion of correct predictions and prediction mean squared error. Model M1 and M2 with λ ∈ Λ are also considered in this section, where Λ = {k(0.1, 0.1)^T : k = 0, . . . , 9}.

Hereinafter we denoteM· = {M · including the cases λ ∈ Λ}, for · = 1, 2, and MT =M1∪ M2.

We investigate the dynamic effects of abilities in model M1R, M2R, M2R-i and M2R-ii with ﬁrst-order random walk. The means of the ratios ˆσ_h²/ˆσ²₀ and ˆ

σ²/ˆσ² over ten seasons are shown in Table6.3. In these NBA data, the variances

(22)

5. An Application to National Basketball Association 15

The proportion of correct predictions of dynamic Bradley-Terry model (2.6) proposed in [6] (hereinafter referred to DBT model) and its modiﬁcation which replaces the outcome with score difference (hereinafter referred to DBTS model) are listed in table6.1compared to the proposed models. DBT model produces predictions that the home team will win for every match. DBTS model has lower PCP than the proposed models. One step of estimating the parameters is to minimize the sum of squares function (4.2). Figure6.2and6.3show that the non-convexity of sum of squares function under model M1 and M2 makes the minimization dif- ﬁcult. Thus we do not recommend DBT model and DBTS model.

Table6.4shows the mean of BIC values over ten seasons under different models and cases. BIC suggests that model M2-ii in the case that abilities are random should be selected, which means that there is no dynamic effect. The conclusion also holds if we look at the BIC values season by season.

We compare the PCP of MPCP, MPMSE and model MBIC (without dynamic effect) for ten seasons to see if there is an evidence of dynamic effect. In most of the seasons except 2015-2016 season, the models with dynamic effects can have higher PCP than model MBIC. This can be an evidence that there are dynamic effects in most of the seasons except 2015-2016 season. Moreover, in 2010–2011 season to 2012-2013 season we successfully select the models with dynamic effect with higher PCP than model MBIC. Over the ten seasons, the PCP of selected models are comparable to the highest PCP of all models except 2013-2014 season and 2017-2018 season.

Table 6.4 shows the mean of PCP’s over ten seasons under different models and cases. We can see that most of the models are comparable except model M1 under case 1, M1 under case 3, and model M2 under case 2.

(23)

Chapter 6 Conclusion and Discussion

In the application to NBA, table6.2shows that the fixed dynamic scheme inspired by [6] performs poorly in the sense of prediction with PCP 0.586 and DBT model tends to produce meaningless predictions that the home team always wins. Es- timation of the weight λ is also difficult (see Figure6.2) and such estimation is based on the sense of goodness-of-fit, which may not correspond to the predictive ability. The fact that u_h and u_v are inapparent shows that the first-order random walk assumption on the home ability and visiting ability (cf. [4] and [5]) has no contribution to the dynamic effects. To sum up, most of the existing approaches to estimate the dynamic effects are based on the goodness-of-fit with some specific models. By such approaches, either there is no evidence of dynamic effects or the dynamic effects may produce poor predictions.

In the aspect of regression, λ should play the role as designed covariate. As- signing given values to λ avoids the difﬁculties in estimation and the proposed model selection criteria can suggest the best λ in the sense of predictive ability.

In the applications to NBA, the proposed model selection criteria can select the

(24)

6. Conclusion and Discussion 17

better. In the applications to NBA, the format of playoffs and regular season are different, which may decrease the PCP and increase PMSE for our models. This problem is still needed to be solved in future research.

(25)

Reference

[1] Stefani, R. T. (1977). Football and basketball predictions using least squares.

IEEE Transactions on Systems, Man, and Cybernetics, 7(2), 117-121.

[2] Stefani, R. T. (1980). Improved least squares football, basketball, and soc- cer predictions. IEEE Transactions on Systems, Man, and Cybernetics, 10(2), 116-123.

[3] Clarke, S. R., and Norman, J. M. (1995). Home ground advantage of individ- ual clubs in english soccer. The Statistician, 44(4), 509.

[4] Harville, D. (1977). The use of linear-model methodology to rate high school or college football teams. Journal of the American Statistical Association, 72(358), 278-289.

[5] Fahrmeir, L., and Tutz, G. (1994). Dynamic stochastic models for time- dependent ordered paired comparison systems. Journal of the American Sta- tistical Association, 89(428), 1438-1449.

[6] Cattelan, M., Varin, C., and Firth, D. (2012). Dynamic Bradley-Terry mod- elling of sports tournaments. Journal of the Royal Statistical Society: Series

(26)

REFERENCE 19

[8] Bresler, A. (n.d.). R’s interface to NBA data. Retrieved July 29, 2020 from http://asbcllc.com/nbastatR/

[9] Thurstone, L. L. (1927). A law of comparative judgment. Psychological Re- view, 34(4), 273286.

[10] Zermelo, E. (1929). Die berechnung der turnier-Ergebnisse als ein maxi- mumproblem der wahrscheinlichkeitsrechnung. Math Z 29, 436460.

[11] Bradley, R. A., and; Terry, M. E. (1952). Rank analysis of incomplete block designs: I. the method of paired comparisons. Biometrika, 39(3/4), 324.

[12] Harville, D. (1976). Extension of the Gauss-Markov Theorem to Include the Estimation of Random Effects. The Annals of Statistics, 4(2), 384-395.

[13] Batchelder, W. H., Bershad, N. J., and Simpson, R. S. (1992). Dynamic paired-comparison scaling. Journal of Mathematical Psychology, 36(2), 185- 212.

[14] Harville, D. A. (2003). The selection or seeding of college basketball or football teams for postseason competition. Journal of the American Statistical Association, 98(461), 17-27.

[15] Wang J. (2010). Consistent selection of the number of clusters via crossvali- dation. Biometrika, 97(4), 893904.

[16] Lim, A., Chiang, C. T., and Teng, J. C. (2018). Estimating robot strengths with application to selection of alliance members in FIRST robotics competi- tions. arXiv preprint arXiv:1810.05763.

(27)

Figure 6.1: Proportion of correct predictions ofM·PCP (red line), M·PMSE (blue line), andM_·BIC (green line). Black lines are the highest and lowest proportions of correct predictions amongM·for· = 1, 2, and T .

(a) M1

(b) M2

(28)

Table 6.1: The estimated proportion of correct predictions from the Dynamic Bradley-Terry models (cf. [6]) and the selected proposed models on playoffs data.

PPPModelPPPPPPPPPPP

Season

09–10 10–11 11–12 12–13 13–14

DBT 0.6707 0.6667 0.6786 0.6353 0.5618

DBTS 0.5000 0.5926 0.5000 0.5647 0.5618 M1PCP 0.6402 0.6420 0.7143 0.6588 0.5730

M1PMSE 0.6341 0.6420 0.6905 0.6941 0.5730

M1BIC 0.6463 0.6420 0.6667 0.6588 0.5955 M2PCP 0.6402 0.6543 0.7143 0.6824 0.5506

M2PMSE 0.6463 0.6667 0.7143 0.7059 0.6292

M2BIC 0.6463 0.6420 0.6667 0.6588 0.5955

MT PCP 0.6382 0.6543 0.7143 0.6588 0.5506

MT PMSE 0.6341 0.6420 0.6667 0.6941 0.5843

MT BIC 0.6463 0.6420 0.6667 0.6588 0.5955

PPPModelPPPPPPPPPPP

Season

14–15 15–16 16–17 17–18 18–19

DBT 0.5926 0.6744 0.5696 0.7073 0.5610

DBTS 0.5926 0.6744 0.5696 0.7073 0.5610 M1PCP 0.6296 0.7229 0.6835 0.6341 0.6585

M1PMSE 0.6296 0.7093 0.6709 0.6098 0.6951

M1BIC 0.6296 0.7209 0.6709 0.6098 0.6707 M2PCP 0.6296 0.7171 0.6835 0.5976 0.6951

M2PMSE 0.6173 0.7093 0.6709 0.5610 0.7073

M2BIC 0.6296 0.7209 0.6709 0.6098 0.6707

MT PCP 0.6296 0.7200 0.6835 0.6341 0.6768

MT PMSE 0.6049 0.6628 0.6582 0.5732 0.6341

MT BIC 0.6296 0.7209 0.6709 0.6098 0.6707

(29)

Table 6.2: Mean of proportion of correct predictions of Dynamic Bradley-Terry model (cf. [6]) and proposed models over the ten seasons.

DBT DBTS M1PCP M1PMSE M1BIC

0.6318 0.5824 0.6557 0.6548 0.6511

M2PCP M2PMSE M2BIC MPCP MPMSE MBIC

0.6565 0.6628 0.6511 0.6560 0.6596 0.6511

Figure 6.2: Selected plots of minimized sum of squares value to λ

Figure 6.3: Selected plots of minimized sum of squares value to λ

(30)

Table 6.3: Ratio of ˆσ²_hto ˆσ²₀ and ˆσ_h² to ˆσ₀²(rounded to 3 decimal places) Ratio of ˆσ²_hto ˆσ²₀

Model M1R M2R M2R-i M2R-ii Case 1 0.013 0.000 0.000 0.000 Case 2 0.018 0.000 0.000

Case 3 0.769 0.000 0.000 0.002 Case 4 0.011 0.002 0.002

Ratio of ˆσ_v²to ˆσ²₀

Model M1R M2R M2R-i M2R-ii Case 1 0.000 0.000 0.000 0.000 Case 2 0.022 0.000 0.000

Case 3 0.000 0.000 0.000 0.001 Case 4 0.020 0.001 0.001

(31)

Table 6.4: Mean of PCP and BIC over ten seasons PCP

Model M1 M2 M2-i M2-ii

Case 1 0.586 0.631 0.654 0.661

Case 2 0.655 0.661 0.657

Case 3 0.632 0.639 0.664 0.646

Case 4 0.652 0.652 0.655

BIC

Model M1 M2 M2-i M2-ii

Case 1 10105.43 10109.69 10125.14 9765.25 Case 2 9854.61 9838.85 9807.57

Case 3 9838.49 9792.86 9793.35 9540.60 Case 4 9569.06 9559.94 9559.02

運動賽事期間隊伍能力的變化分析

國立臺灣大學理學院應用數學科學研究所 碩士論文

Institute of Applied Mathematical Sciences College of Science

National Taiwan University Master Thesis

運動賽事期間隊伍能力的變化分析

Analyzing Dynamic Abilities of Teams in Sports Events

管敏仁 Min-Ren Guan

指導教授：江金倉 博士

Advisor: Chin-Tsang Chiang, Ph.D.

國立臺灣大學碩士學位論文

口試委員會審定書

-� 動賽事期間隊伍能力的變化分析

nalyzing Dynamic Abilities of Teams in Sports Event

5L仝定 （簽名）

4 勺巨

、h�., ,,

中文摘要

Abstract

Contents

List of Figures

List of Tables

Chapter 1 Introduction

Chapter 2

Existing Paired Comparison Models

Chapter 3

Proposed Models for Dynamic Effects

3.1 Background

3.2 Regression Model Formulation

Chapter 4

Estimation and Model Selection

4.1 Estimation

4.2 Model Selection

Chapter 5

An Application to National Basketball Association

Chapter 6

Conclusion and Discussion

Reference

國立臺灣大學理學院應用數學科學研究所碩士論文

指導教授：江金倉博士

5L仝定（簽名）