CHAPTER FOUR RESULTS AND DISCUSSION
In this chapter, the results of statistical analyses are presented and discussed in the first section. The three phases of data computation comprise analyses of reliability and validity, test equating and regression analyses. For reliability analysis, Cronbach’s coefficients were performed on the DST and the RCT. For validity, the analyses tackled the RCT, insomuch as it was experimentally designed to meet purposes of the exploratory study. Compared with the DST, a standardized, high-stakes test, the RCT demanded a meticulous examination. Test equating and regression analyses were performed on both tests, so as to shed more light on the interrelationship between these gap-filling tests under the hypothesized construct of cohesion. In the second section, discussion and explanation of research findings are emphasized on the
establishment of reliability and validity, test inequivalence and the predictive power of the RCT subtests.
Overall Results
In the section, the general descriptive statistics of individual test measures are presented. As shown in Table 4 to 7, except the 11th grade, the means of the RCT are slightly higher than those of the DST across the remaining grade levels. The
differences were essentially small: .55, .87, .33, and 1.08 for Grades 10–12, Grade 10, Grade 11 and Grade 12, respectively
1. Overall, the RCT was shown to be an easier test format than the DST.
8 The results of statistical significance of differences in means are presented in the section of test
Table 4. Descriptive Statistics for Grades 10–12
RCT DST
n
k M SD α SEM
354 30 21.09 4.84 .80 .257
354 30 20.54 5.45 .84 .290
Note: k represents the total number of test items.
Table 5. Descriptive Statistics for Grade 10
RCT DST
n
k M SD α SEM
124 30 19.42 4.79 .80 .43
124 30 18.55 5.52 .83 .50
Table 6. Descriptive Statistics for Grade 11
RCT DST
n
k M SD α SEM
115 30 20.50 5.02 .81 .47
115 30 20.83 5.40 .84 .50
Table 7. Descriptive Statistics for Grade 12
RCT DST
n
k M SD α SEM
115 30 23.48 3.65 .70 .34
115 30 22.40 4.70 .80
.44
Table 8 presents the mean scores of the three RCT subtests. As can be seen in the table, among the three, Cloze B (lexical cohesion) tends to be the most difficult subtest, while Cloze A (reference) appears the easiest. This pattern remained across different grade levels except Grade 12, in which Cloze C (conjunction) was shown to be the easiest subtest.
Table 8. Mean Scores of the RCT Subtests
Cloze A Cloze B Cloze C
Grade 10
Grade 11 Grade 12 Grades 10–12
7.27 7.32 7.97 7.51
5.61 6.11 7.43 6.37
6.54 7.07 8.06 7.21
To show the difference of means between the subtests, a repeated-measure ANOVA and multiple comparisons were performed. The results are shown in Table 9.
The comparisons exhibited statistically significant differences of means between each
pair of the three subtests. Therefore, based on differences in mean scores, the
Table 9. A Repeated-Measure ANOVA and Multiple Comparisons for the RCT Subtests
Mean Square df F p Group 124.73 2 62.67*** .000 Residual 1.99 706
Note. ***p .001
Cloze A Cloze B Cloze C
Cloze A – 112.55***
Cloze B – Cloze C
9.12**
60.42***
–
Note. ***p .001, **p .01
Concerning the overall data, Table 10 presents the Pearson product-moment correlation coefficients between the test components. With the alpha level preset at p
< .001, all of the tests were found to correlate with each other significantly. The three RCT subtests were found to correlate at a low-intermediate level, suggesting that the three may tap separate constructs of cohesion. This finding may lend further support to results previously shown that the three subtests were statistically different tests (see Table 8 and Table 9).
The terms grades correlated most highly with the RCT, while the coefficients
with the remaining tests were lower than .50. The result should be interpreted with
cautions. As previously stated, the term grades were not assigned to each participant
based on the same measure. Therefore, Table 11 to 13 may provide a clearer trajectory
about the correlation between the term grades and the other tests within specific grade
levels. In the tables, the term grades were still shown to correlate most highly with the RCT. In actuality, the term grades comprised various test components (e.g., monthly exams, listening, writing, grammar, etc.) at the school. Higher correlations between the term grades and the RCT may imply the multi-dimensional, integrative nature of the RCT.
Table 10. An Intercorrelation Matrix for the Tests (Grades 10 –12)
1 2 3 4 5 6
1. Cloze A –
2. Cloze B 3. Cloze C 4. RCT 5. DST
6. Term Grades
.51 –
.44 .55 –
.77 .87 .81 –
.51 .59 .51 .66 –
.49 .44 .39 .53 .45 –
Note. All correlations are significant at ***p < .001.
Table 11. An Intercorrelation Matrix for the Tests (Grade 10)
1 2 3 4 5 6
1. Cloze A –
2. Cloze B 3. Cloze C 4. RCT 5. DST
6. Term Grades
.47 –
.31 .48 –
.72 .86 .76 –
.50 .65 .52 .72 –
.49 .66 .48 .70 .62 –
Note. All correlations are significant at ***p < .001.
Table 12. An Intercorrelation Matrix for the Tests (Grade 11)
1 2 3 4 5 6
1. Cloze A –
2. Cloze B 3. Cloze C 4. RCT 5. DST
6. Term Grades
.53 –
.54 .53 –
.82 .84 .83 –
.51 .53 .50 .62 –
.57 .54 .53 .66 .49 –
Note. All correlations are significant at ***p < .001.
Table 13. An Intercorrelation Matrix for the Tests (Grade 12)
1 2 3 4 5 6
1. Cloze A –
2. Cloze B 3. Cloze C 4. RCT 5. DST
6. Term Grades
.44 –
.36 .45 –
.73 .85 .76 –
.45 .43 .31 .51 –
.54 .42 .41 .57 .54 –
Note. All correlations are significant at ***p < .001.
Analyses of Reliability and Validity
In the subsection, the results of the internal reliability coefficients (Cronbach’s alpha) for the DST and the RCT are presented. An analysis of validity is also reported.
Analyses of Reliability of the Discourse Structure Test and the Rational Cloze Test Table 4 shows Cronbach’s alpha coefficients of the DST and RCT. Considering the whole subject pool, the DST was more reliable than the RCT, with a slight
difference of .04 (.84 – .80). In Manning’s (1986) study, the multiple-choice cloze test
registered a reliability coefficient of .80. The obtained coefficient for the
multiple-choice RCT in the present study may be considered satisfactory. Table 5 to 7 present Cronbach’s alpha coefficients of both tests at individual grade levels. For Grades 10 and 11, the reliability coefficient of the DST remained slightly higher than that of the RCT, with a difference of .03 (.83 – .80). For Grade 12, however, a larger difference of .10 (.80 – .70) was identified.
To briefly sum up, concerning the data as a whole and at specific levels, the spread of Cronbach’s alpha coefficients was shown to be more consistent for the DST, ranging from .80 to .84, while a wider distribution was observed for the RCT,
spanning from .70 to .81. Administered to senior high school students of all grade levels, the DST could be considered a more stable, reliable test format.
Analyses of Validity of the Rational Cloze Test
Evidence for validity of the RCT was garnered in light of content validity and concurrent validity. For content validity, a qualitative approach
2was performed by concerned parties in the field of language studies and in-service English teachers at the school where the present study was conducted. Concurrent, criterion-related validity was sought for by the correlation coefficient between the DST and the RCT, both of which were hypothesized to tap the construct of cohesion.
Analysis of content validity. Through collective judgments from three graduate students (including the researcher) and three experienced teachers whose students participated in the study, a consensus was reached that the RCT measured the test-takers’ knowledge of cohesion in discourse. Based on the a priori discourse analysis, subcategories of the three major types of cohesion were as proportionally sampled as possible, which would to some extent be representative of the
hypothesized constructs of cohesion.
9The rationale for not using a quantitative approach (i.e., factor analysis) to the analysis of construct
Analysis of concurrent validity. Evidence for concurrent validity was supported by the intercorrelation between the DST and the RCT, inasmuch as both were
hypothesized to measure knowledge of intersentential cohesion and were administered within a short period of time (i.e., within two weeks). In addition, because the DST has been an established, high-stakes test incorporated in the DRET, it may serve as a qualified external criterion. A moderate-high correlation of .66 was shown for the two gap-filling tests (see Table 10). Therefore, the presence of
concurrent validity of the RCT as an exploratory test format could be claimed.
To briefly sum up, validity of the RCT could be maintained to a satisfactory extent based on analyses of content validity and concurrent validity. However, more evidence for construct validity will be absolutely necessitated to broaden the horizon of the cloze research for the RCT to be a measure of cohesion.
Test Equating
The equating of the DST and the RCT was performed by classical equating methods. The methods were applied to examine the equivalence of different test forms in means, variances, and inter-form covariance. The results are presented as follows.
Testing the Equivalence of Means
As a whole, the mean of the RCT was slightly higher than that of the DST, with a difference of .55 (20.54 – 21.09). A paired-sample t-test was used to confirm
statistical significance of the difference. The result indicated that a difference of .55, though small, remained statistically significant with the alpha level preset at p < .05
t
= 2.43, df = 353, p = .02). Therefore, in terms of means, the DST and the RCT were not equivalent test forms.
While the global pattern was shown that both tests were not equivalent, local
differences were observed. For individual grade levels, the mean scores of the DST
and the RCT were fairly close. In Grade 10 and Grade 12, the means of both tests
were still shown to be significantly different with α preset at p < .05, t (123) = 2.47 and t (114) = 2.73, respectively. However, an interesting pattern was identified in Grade 11. The difference in means was shown to be statistically non-significant, t (114) = -.76. In other words, at this level the DST and the RCT were found to be equivalent in light of the equivalence of means.
Testing the Equivalence of Variances
In the second phase of classical equating methods, the equivalence of variance distributions was examined. As can be seen in Table 14, for the whole sample the difference of variances between the DST and the RCT is 6.28 (29.69 – 23.41). The homogeneity of variances was examined by the F-max test based on the equation: t =
−
−
− 2 4 1
2 2 2 2 1
2 2 2 1
n r s
s s
s , df = n – 2
where S
12and S
22refer to sample variances and r
2refers to the square of the sample correlation coefficients. With the alpha level preset at p < .05, the difference of
variances was found to be statistically significant. The two-tailed t-ratio of -2.98 (df = 352) far exceeded the critical t-ratio of -1.96, rejecting the null hypothesis that there was no statistically significant difference in variances.
However, still an interesting pattern was identified in Grade 11. As shown in
Table 14, except Grade 11, significant differences in the spread of variances between
both tests across grade levels were confirmed by the F-max tests. On the contrary, for
Grade 11 the result showed statistical non-significance of the difference in variances,
with the t-ratio of -0.98 anchoring within the critical t-ratio of -1.96.
Table 14. Testing the Equivalence of Variance
Variance r t
RCT DST
Grade 10
Grade 11 Grade 12 Grades 10–12
22.95 25.23 13.36 23.41
30.46 29.15 22.05 29.69
.72 .62 .51 .66
-2.26*
-0.98 -3.12*
-2.98*
Note. t-distribution is significant at *p < .05
Testing the Equivalence of Inter-Form Covariance
After the examination of equivalence in means and variances, the results of analyses of inter-form covariance, the final stage of classical equating methods, is presented in this subsection.
First of all, a significant correlation between the DST and the RCT required confirmation. As previously shown in Table 10, the correlation coefficient between the tests was found to be .66 for Grades 10–12, which was statistically significantly at p < .001. Correlations of the DST and the RCT with the term grades were found
statistically significant at .45 and .53, respectively. Only when significance of correlations were confirmed could the equating of inter-form covariance be legitimately performed.
Evidence for equivalence in inter-form covariance would show the difference in the correlation between the RCT and the term grades (r = .53) and the correlation between the DST and the term grades (r = .45). Hotelling’s (1940) t-test was utilized, written in the following equation:
t = ( ) ( ) ( )
( r r r r r r r )
r
r
xy xz yz xy xz yz xyxz yz
n
2 1
2
1 3
2 2
2
− − +
−
+
− − df = n – 3
A difference of .08 (.53 – .45) was found to reach the significant level at p < .05, with a t-ratio of -2.15. All subject pool considered, the result suggests that the DST and the RCT were not statistically equivalent tests, inasmuch as equivalence in inter-form covariance failed to be satisfied.
The inquivalence for both tests in inter-form covariance may be examined solely in terms of the correlation coefficient. That is, the correlation between the DST and the RCT may shed some light on the likelihood of equivalence in inter-form
covariance without incorporating the term grades into data computation. In Beglar and Hunt’s (1999) study, two revised test forms of the 2000 Word Level Test (Nation, 1990) were shown to be equivalent by classical equating methods. These two forms correlated at a high of r = .90. In the same study, two revised test forms of the University Word Level Test (Nation, 1990) were also found to be equivalent, highly correlated at r = .84. Compared to such high correlations, the coefficient of .66 for the DST and the RCT was essentially not as high. Seen from this perspective, the failure in inter-form covariance could be explained.
Although the equivalence of inter-form covariance was shown absent for the whole subject pool, some local patterns were observed in Grade 10 and Grade 12.
Table 15 contains individual correlation coefficients between the DST, the RCT and the term grades at each grade level. While significant differences in inter-form covariance remained for Grade 11 and Grades 10 – 12, non-significance was
confirmed for Grade 10 and Grade 12, as the t-ratios lay within the critical t of 1.96.
Statistically speaking, for Grade 10 and Grade 12 the DST and the RCT were
equivalent in light of inter-form covariance.
Table 15. Testing the Equivalence of Inter-Form Covariance
r
xyr
xzr
yzHotelling’s t
Grade 10 Grade 11 Grade 12 Grades 10–12
.72 .62 .51 .66
.70 .66 .57 .53
.62 .49 .54 .45
-1.75 -2.76*
-0.36 -2.15*
Note.
1. t-distribution is significant at *p < .05
2. rxy represents the correlation coefficient of the RCT (x) and the DST (y).
3. rxz represents the correlation coefficient of the RCT (x) and the Term Grade (z).
4. ryz represents the correlation coefficient of the DST (y) and the Term Grade (z).
5. Correlation is significant at the .01 level (2-tailed).
Regression Analyses
In this subsection, the results of regression analyses are presented, following the order: a) linearity of the regression model; b) a simple linear regression analysis; and c) a linear multiple regression analysis. The extent to which the RCT as a whole and its major subtests, i.e., Cloze A (reference), Cloze B (lexical cohesion) and Cloze C (conjunction), exerted influence on the prediction of the DST, are exemplified in detail.
Linearity of a Regression Model
To confirm the linearity of a regression model, patterns of regression and
residuals were first examined with a scatterplot and a Probability-Probability (P-P)
plot. Figure 1 shows a scatterplot for the RCT on the X axis and the DST on and the Y
axis. As the plot shows, a positive relationship between both tests can be revealed and
under the least squares principle a linear regression line can be visualized as serving
the best-fitting model for the regression of the DST on the RCT.
Figure 1. A Scatterplot for the RCT and the DST (Grades 10 – 12)
After the initial examination of the regression of the DST, the distribution of residuals was further sought for by means of a P-P plot. The P-P plot can be
performed to test the assumption of normality: the sampling distribution of residuals against the hypothesized, expected normal distribution (Cook & Weisberg, 1994, p.
209). First, the observed residuals were reordered and ranked from the smallest to largest as r
i(i = 1, 2, 3…, n). The residuals were then standardized and transformed into probability values. The ordinal values of r
i’s were also converted into probability values as percentile ranks (p), based on the assumption of normal distribution. With slight differences, a variety of functions for p have been proposed as follows (Blom, 1958; Bowerman & O’Connel, 1990; Montgomery & Peck, 1982):
8 1 3
+
= − n
i
p or
1 3
1 3
+
= − n
p i
or
n
p i 2
− 1
=
If the observed standardized residuals meet the hypothesized residuals, a perfect normal distribution can be conceptualized as a 45-degree straight line, along which the observed and the expected residuals fall. As shown in Figure 2, the standardized residuals of the DST on the RCT generally cluster closely around the perfect diagonal line, a 45-degree straight line with a slope of 1. In line of the distribution of residuals,
“a distinct straight-line appearance” was approximately validated (Bowerman &
O’Connel, 1990, p. 247). This suggests that a linear regression model, instead of a curvilinear one, could be claimed as the best-fitting equation accounting for the RCT as a predictor and the DST as a criterion.
Figure 2. A Normal P-P Plot
Simple Linear Regression Analysis
Holding the RCT as the sole predictor, a simple linear regression analysis was
first computed for the constant and regression coefficient. As shown in Table 16, a
simple regression equation can be derived and written as Y = 4.84 + .74 X where Y
represents the predicted score or regression of the DST, while X refers to the RCT
score. For instance, if a student has a score of 25 on the RCT, his or her DST is
predicted to be around 23 (4.84 + .74*25 = 23.34). The equation can be expressed in terms of the standardized beta coefficient or “beta weight” (Younger, 1985, p. 164): Y
= .66X. This means that an increase of one standard deviation in X (i.e., the RCT) is predicted to cause an increase of .66 standard deviation above the mean in Y (i.e., the DST). On average, the simple regression equation, either expressed in raw or
standardized scores, would serve as the best-fitting line under the method of least squares, which minimizes the residual sum of squares.
Table 16. Regression Coefficients for the RCT as a Predictor (Grades 10–12) Model Unstandardized Stadndardized t Sig.
Coefficients (B) Coefficients (Beta)
Constant
RCT
4.84
.74 .66
4.96 16.53
.000*
.000*
Note. t-distribution is significant at *p < .05
Linear Multiple Regression Analysis
After considering the RCT as a sole predictor, patterns of Cloze A, B and C in predicting performance on the DST were analyzed. Because the three were entered as individual predictors, instead of a simple one-predictor regression model, a
multiple-predictor model was applied. Table 17 shows the results. Adopting the standardized beta coefficients β ), the equation can be written as Y = .23X
1+ .36X
2+ .22 X
3where Y represents the standardized score of the DST and X
1, X
2and X
3refer to the standardized scores of Cloze A, B, and C, respectively. With the other
predictors held constant, Cloze B (lexical cohesion) was loaded with the highest beta
with Cloze A ( β = .23) and C ( β = .22), respectively. In other words, of the three predictors, Cloze B was the most explanatory variable, imposing stronger predictive power and influence on the regression of the DST.
Table 17. Regression Coefficients for the RCT Subtests as Predictors (Grades 10–12) Variable Unstandardized Stadndardized t Sig.
Coefficients (B) Coefficients (Beta)
Constant 5.12
Cloze A .75 .23 4.85 .000*
Cloze B .85 .36 6.94 .000*
Cloze C .61 .22 4.41 .000*
Note. t-distribution is significant at *p < .05
After the computation of beta weights for each subtest, a linear multiple
regression model was performed for R-square (R
2), the coefficient of determination.
Table 18 exhibits the results using the Enter method by simultaneously submitting the predictors to the model. As shown in the table, Cloze C, the predictor loaded with the lowest beta coefficient, was first fitted into the model followed by Cloze A, the predictor loaded with the second highest beta coefficient. Cloze B was the last to be added. Based on the adjusted R
2, it was shown that .26 of the shared variability in the performance on the DST was accounted for by Cloze C alone. To increase the
accuracy of prediction, Cloze A was then added to the first model. It is shown in the same table that adding Cloze A resulted in an increase of R
2by .10 (.36 – .26), which was statically significantly according to F change. When Cloze B was finally entered, the aggregate adjusted R
2amounted to.43, registering an increase of .07 (.43 – .36).
That is, the predictive power of Cloze B could be construed as contributing .07 of the
explained variance over and above Cloze C and Cloze A combined. The adjusted
aggregate R
2of .43 indicated that the three predictors accounted for 43 percent of the shared variance of observed performance on the DST.
Table 18. A Linear Multiple Regression Analysis (Grades 10–12)
Model Predictor R R
2Adjusted R
2F Change Sig. F Change
1 Cloze C 2 Cloze C Cloze A 3 Cloze C Cloze A Cloze B
.51 .60
.66
.26 .36
.44
.26 .36
.43
125.59 53.96
48.19
.000*
.000*
.000*
Note. *F change is significant at *p < .05
To further examine the unique predictive power of Cloze B against Cloze A and C, the hierarchical regression method was utilized to build another five fresh models.
This method involved a stepwise procedure that manipulated a designated order of predictors (see Qian, 1999, 2002). The changes in the magnitude of the coefficient of determination (R
2) indicated the predictive power of a given predictor over and above any combination of the other previously entered predictors, which served the purpose in the phase. As shown in Table 19 to 23, both Cloze A and Cloze C contributed .03 (.43 – .40) of the shared variance over and above any combinations of first two predictors. An additional variance of .03 was essentially smaller than that of .07 contributed by Cloze B. Therefore, Cloze B was confirmed to show the most
influential power for the prediction of performance on the DST. Moreover, the results
of hierarchical regression analyses indicated that despite showing approximate beta
coefficients, Cloze A and Cloze C still served as valid predictors and functioned
and above any combination of the other predictors. Such results may justify the use of the hierarchical approach.
Table 19. A Hierarchical Regression Analysis (Order: Cloze C–B–A)
Model Predictor R R
2Adjusted R
2F Change Sig. F Change
1 Cloze C 2 Cloze C Cloze B 3 Cloze C Cloze B Cloze A
.51 .63
.66
.26 .40
.44
.26 .40
.43
125.59 80.75
23.48
.000*
.000*
.000*
Note. *F change is significant at *p < .05
Table 20. A Hierarchical Regression Analysis (Order: Cloze B–C–A)
Model Predictor R R
2Adjusted R
2F Change Sig. F Change
1 Cloze B 2 Cloze B Cloze C 3 Cloze B Cloze C Cloze A
.59 .63
.66
.35 .40
.44
.35 .40
.43
188.85 30.24
23.48
.000*
.000*
.000*
Note. *F change is significant at *p < .05
Table 21. A Hierarchical Regression Analysis (Order: Cloze B–A–C)
Model Predictor R R
2Adjusted R
2F Change Sig. F Change 1 Cloze B .59 .35 .35 188.85 .000*
2 Cloze B .64 .41 .40 34.43 .000*
Cloze A
3 Cloze B .66 .44 .43 19.43 .000*
Cloze A Cloze C
Note. *F change is significant at *p < .05
Table 22. A Hierarchical Regression Analysis (Order: Cloze A–B–C)
Model Predictor R R
2Adjusted R
2F Change Sig. F Change
2 Cloze A 2 Cloze A Cloze B 3 Cloze A Cloze B Cloze C
.51 .64
.66
.26 .41
.44
.26 .40
.43
122.60 88.23
19.43
.000*
.000*
.000*
Note. *F change is significant at *p < .05
Table 23. A Hierarchical Regression Analysis (Order: Cloze A–C–B)
Model Predictor R R
2Adjusted R
2F Change Sig. F Change
1 Cloze A 2 Cloze A Cloze C 3 Cloze A Cloze C Cloze B
.51 .60
.66
.26 .36
.44
.26 .36
.43
122.60 56.51
48.19
.000*
.000*
.000*
As shown in Table 20 and Table 21, Cloze B, the most influential predictor, was fitted first into the regression models, followed by the other less powerful predictors.
This approach might trigger “a problem of fitting” predictors as Weisberg (1985, p. 51) maintains. What if the order had been reshuffled by fitting Cloze C first, followed by Cloze A and B accordingly? As Weisberg (1985) indicates: “In multiple regression, if the predictors are correlated the sign of a coefficient may change depending on the other predictors in the model” (p. 65), when the first predictor has been entered, the second will be adjusted, and so will the third. This makes sense because the three correlated significantly at .51, .44 and .55 for Cloze A and Cloze B, Cloze A and Cloze C, and Cloze B and Cloze C, respectively. Since the three predictors were fitted altogether as an aggregate in the final phase, R
2would remain the same no matter which predictor was entered first.
Entering the best predictor first was intended to test the redundancy of adding another predictor. If entering Cloze A had not significantly increased the accuracy of prediction and “added more relevant unique information” (Glass & Hopkins, 1996, p.
176), only using Cloze B would have sufficed for the model. This justified the utility of a linear multiple regression model using three predictors in lieu of utilizing a simple model with Cloze B as the sole predictor.
On top of the regression analyses for the whole data set, models for individual grades were also constructed. As Table 24 to 26 show, in Grades 10 and 11 all of the three predictors significantly explained and predicted the shared variances in the DST.
In contrast, in Grade 12 Cloze C failed to contribute significantly explanatory power to the model. In other words, for the Grade 12 data, only Cloze A and Cloze B would suffice for the prediction of performance on the DST.
Table 24. A Linear Multiple Regression Analysis (Grade 10)
Model Predictor β
R R
2Adjusted R
2F Change Sig. F Change
1 Cloze B 2 Cloze B Cloze C 3 Cloze B Cloze C Cloze A
.43 .25 .22
.65 .69
.72
.42 .48
.52
.42 .47
.51
89.43 13.51
9.44
.000*
.000*
.003*
Note. *F change significant at *p < .05
Table 25. A Linear Multiple Regression Analysis (Grade 11)
Model Predictor β R R
2Adjusted R
2F Change Sig. F Change
1 Cloze B 2 Cloze B Cloze A 3 Cloze B Cloze A Cloze C
.28 .25 .22
.53 .60
.62
.28 .36
.39 .27 .35
.37
43.74 13.40
5.61
.000*
.000*
.020*
Note. *F change significant at *p < .05
Table 26. A Linear Multiple Regression Analysis (Grade 12)
Model Predictor β
R R
2Adjusted R
2F Change Sig. F Change
1 2
3
Cloze A Cloze A Cloze B Cloze A Cloze B Cloze C
.30 .26 .09
.45 .52
.52 .20 .27.
.27 .19 .25
.25
28.02 10.51
.87
.000*
.002*
.352
Note. *F change significant at *p < .05