Mining Formative Evaluation Rules Using Web-Based Learning Portfolios for Web-Based Learning Systems

(1)

Chen, C.-M., Hong, C.-M., Chen, S.-Y. & Liu, C.-Y. (2006). Mining Formative Evaluation Rules Using Web-based Learning Portfolios for Web-based Learning Systems. Educational Technology & Society, 9 (3), 69-87.

69 ISSN 1436-4522 (online) and 1176-3647 (print). © International Forum of Educational Technology & Society (IFETS). The authors and the forum jointly retain the

copyright of the articles. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear the full citation on the first page. Copyrights for components of this work owned by others than IFETS must be honoured. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, or to redistribute to lists, requires prior

Mining Formative Evaluation Rules Using Web-based Learning Portfolios

for Web-based Learning Systems

Chih-Ming Chen

Institute of Learning Tech, National Hualien Univ of Education, Hualien 970, Taiwan, R.O.C. [email protected]

Chin-Ming Hong

Institute of Applied Electronics Technology, National Taiwan Normal University, Taipei 106, Taiwan, R.O.C. [email protected]

Shyuan-Yi Chen

Institute of Industrial Education, National Taiwan Normal University, Taipei 106, Taiwan, R.O.C. [email protected]

Chao-Yu Liu

Institute of Learning Technology, National Hualien University of Education, Hualien 970, Taiwan, R.O.C. [email protected]

ABSTRACT

Learning performance assessment aims to evaluate what knowledge learners have acquired from teaching activities. Objective technical measures of learning performance are difficult to develop, but are extremely important for both teachers and learners. Learning performance assessment using learning portfolios or web server log data is becoming an essential research issue in web-based learning, owing to the rapid growth of e-learning systems and real application in teaching scenes. The traditional summative evaluation by performing examinations or feedback forms is usually employed to evaluate the learning performance for both the traditional classroom learning and the web-based learning. However, summative evaluation only considers final learning outcomes without considering learning processes of learners. This study presents a learning performance assessment scheme by combining four computational intelligence theories, i.e., the proposed refined K-means algorithm, the neuro-fuzzy classifier, the proposed feature reduction scheme, and fuzzy inference, to identify the learning performance assessment rules using the web-based learning portfolios of an individual learner. Experimental results indicate that the evaluation results of the proposed scheme are very close to those of summative assessment results of grade levels. In other words, this scheme can help teachers to assess individual learners precisely utilizing only the learning portfolios in a web-based learning environment. Additionally, teachers can devote themselves to teaching and designing courseware since they save a lot of time in evaluating learning. This idea can be beneficially applied to immediately examine the learning progress of learners, and to perform interactively control learning for e-learning systems. More significantly, teachers could understand the factors influencing e-learning performance in a web-based learning environment according to the obtained interpretable learning performance assessment rules.

Keywords:

Learning Performance Assessment, Web-based Learning, Web-based Learning Portfolio, Data Mining

Introduction

Gagnés’ research on the internal process of learning has indicated that the complete learning process should assess learning performance (Gagnés, 1997). Learning performance evaluation instruments can generally be classified as summative assessment and formative assessment (Torrance & Pryor, 1998; Campbell et al, 2000). While summative evaluation is generally performed after finishing an instruction unit or class, formative assessment emphasizes the learning process. Teachers can assess the overall learning performance using summative assessments. Conversely, the use of formative assessments helps teachers obtain feedback about how well students are learning and particular difficulties they might be having. The standard summative assessment scheme at course level is to perform either an examination or an assessment form. However, these methods cannot collect all learning process information. Since the formative assessment strongly emphasizes integrating “what people learned” with “how people learned”, the formative assessment occurs when teachers feed information back to students in ways that enable the student to learn better, or when students can engage in a self-reflective process. Therefore, while assessing learning performance using the portfolio has become increasingly popular, technology is facilitating its evolution and management (Torrance & Pryor, 1998;

(2)

Campbell et al, 2000; Lankes, 1995). Moreover, learning performance assessment based on the learning portfolio is a very useful concept for some fields of learning assessment, such as psychological testing, knowing how subjects take the test can be more important than their answers.

Traditional portfolio assessment relies on man-made data collection and a writing-centered learning process. Difficulties in data storage, search and management after long-term implementation have become problematic in developing and implementing portfolio assessment (Chang, 2002). In contrast, a web-based learning portfolio can be collected, stored and managed automatically by computers when learners interact with an e-learning platform. Therefore, learning performance assessment using a web-based learning portfolio has received significant attention recently (Lankes, 1995; Rahkila & Karjalainen, 1999; Wu & Leung, 2002). Rasmussen et al. (Rasmussen et al. 1997) suggested that Internet-based instruction could also allow student progress to be evaluated through participation in group discussions and portfolio development. Lankes (Lankes, 1995) stated that implementing computer-based student assessment portfolios is an innovative educational innovation owing to its ability not only to offer an authentic demonstration of accomplishments, but also enable students to take responsibility for their completed tasks. Rahkila (Rahkila & Karjalainen, 1999) also proposed using the user profiles with log data for the learning performance assessment, finding that it can be applied to interactively control learning effectively. Rahkila emphasized that the learning portfolio assessment is supported by the cognitive–constructive theory of learning (Rahkila & Karjalainen, 1999; Bruner, 1996). Zaiane (Zaiane & Luo, 2001) proposed applying web access logs and advanced data mining techniques to extract useful patterns that can help educators evaluate on-line course activities. Wang et al. (Wang et al. 2003) found that learning behavior information, commonly referred to as a learning portfolio, can help teachers understand why a learner obtained a high or low grade. Carrieira (Carrira & Crato, 2004) proposed monitoring users’ reading behaviors to infer their interest in particular articles.

However, developing a precise learning performance assessment scheme using web-based learning portfolio is a challenging task for web-based learning systems. Data mining, or knowledge discovery, attempts to obtain valuable knowledge from data stored in large repositories. Data mining has been considered as an appropriate method of knowledge discovery to excavate the implicit information (Dunham, 2002). This study presents a data mining approach that integrates four computational intelligence schemes, i.e., the refined K-means clustering algorithm, the neuro-fuzzy classifier (Nauck & Kruse, 1999), the proposed feature reduction scheme, and the fuzzy inference (Lin & Lee, 1996) , to evaluate on-line learning behavior and learning performance. The four computational intelligence schemes are employed to logically determine fuzzy membership functions for the neuro-fuzzy classifier, discover the fuzzy rules relating to the learning performance assessment, reduce the feature dimension of the discovered fuzzy rules, and infer the learning performance using the discovered fuzzy rules based on the learning portfolio of an individual learner, respectively.

The proposed learning performance assessment scheme was implemented on the personalized e-learning system, and its effectiveness was demonstrated in a real-world teaching scenario. Experimental results show that the proposed learning assessment scheme can correctly measure learners’ learning performance according to their learning portfolios as well as help teachers save a lot of time for the learning evaluation during the learning process. Significantly, the learning evaluation results were applied to help teachers examine the learning progress of learners and interactively control learning for the personalized e-learning system. The remainder of this study is organized as follow. Section 2 describes the problem description based on the gathered learning portfolios for the proposed learning performance assessment scheme. Section 3 explains the proposed learning performance assessment scheme. Section 4 presents the detailed experiments. Conclusions are drawn in Section 5.

Problem Description

Personalized E-learning System (PELS) with Learning Performance Assessment Mechanism Based on Learning Portfolios

The personalized e-learning system (PELS) based on the Item Response Theory, which includes an off-line courseware modeling process, four intelligent agents and four databases, is presented in our previous study for adaptive courseware recommendation (Chen,, Lee & Chen, 2005), (Chen, Liu & Chang, 2006). However, the PELS mainly focuses on performing adaptive learning based on the difficulty parameters of courseware and learner ability of individual learner, the learning performance assessment is lacked feature in this system. In this paper, the functionalities of the PELS system are extended to include the learning performance assessment agent

(3)

in order to perform the evaluation of learning performance using the gathered learning portfolios of individual learners. The extended system architecture is shown as Figure 1.

User Profile Database Learner Learning Interface Agent Courseware Recommendation Agent Test Agent Courseware Modeling Process Courseware Management

Agent Teacher AccountDatabase XML-based Courseware database Learning Performance Assessment Agent User Account Database Personalized Learning Module Courseware Construction Module Testing Items Database

Figure 1. The system architecture

(4)

Figure 2 shows the entire layout of the learning interface on PELS. In the left frame, system shows the course categories, course units and the list of all courseware in the courseware database using a hierarchical tree topology structure. While a learner clicks a courseware for learning, the content of selected courseware will be exhibited in the upper-right window. Besides, the feedback interface is arranged in the bottom-right window. The proposed system can get learner’s feedback response from the interface of feedback agent through learner replies one randomly testing question related to the conveyed learning content. The answer of testing question helps system to get the learner’s comprehension percentage for the recommended courseware. System passes the feedback response to the courseware recommendation agent to infer the learner’s ability using the item response theory (Baker & Frank, 1992) detailed in our previous study (Chen,, Lee & Chen, 2005; Chen, Liu & Chang, 2006). After a learner presses the button of “submit”, this system will reveal a list of the recommended courseware based on his current ability. After the learner selects the next courseware according to the suggestion of the courseware recommendation agent for further learning, the learner can continue to learn the selected courseware. The PELS will continue to run the learning cycle until the evaluated learner ability satisfies the stop criterion. Next, the learning procedure will enter the final testing stage to perform a summative assessment through replying 25 randomly selected testing questions. The results of final testing will be used to verify the evaluation quality of the proposed learning performance assessment approach in the analysis of experimental results. Besides, the learner can also log in the discussion board on PELS to publish the learning questions of the individual self or contribute helpful ideas for the learned course unit. The PELS will automatically gather the useful learning portfolios of individual learners for the learning performance assessment during learning processes.

Considered Learning Portfolio in the User Profile Database

The IMS Global Learning Consortium defined that ePortfolios are collections of personal information about a learner that represent accomplishments, goals, experiences, and other personalized records that a learner can present to schools, employers, or other entities (http://support.imsglobal.org/ePortfolio/). The ePortfolio specification developed by IMS Global Learning Consortium enables the use of the Internet for personalized, competency-based learning and assessment, enhanced career planning, and improved employment opportunities. In the meanwhile, the proposed ePortfolio specification supports many kinds of portfolios, including assessment ePortfolios, presentation ePortfolios, learning ePortfolios, personal development ePortfolios, and working ePortfolios. The IMS Global Learning Consortium also summarized that ePortfolio can contain personal information about the owners, achievements, interests and values, test and examination results, and so on. Next, the learning portfolio information collected by PELS for the proposed learning performance assessment approach is presented based on the IMS ePortfolio specification. This learning portfolio information was saved into the user profile database according to the learner’s interaction with PELS. Hence, the learning performance was graded according to their PELS learning portfolios. The eight gathered learning factors are described in detail as follows:

1. Reading rate of course materials (RR)

After a learner has logged onto the PELS system, the PELS automatically counts and accumulates the reading amount of the learned course materials for each learner. The learning parameter is progressively increased for each section learning process. Thus, the reading rate of course materials is defined as the rate of studying course materials in a course unit, and the notation RR is employed to represent the learning factor. Table 1 presents an example of calculating the reading rate for both the learners A and B in the learning course unit.

Table 1. A calculating example for the learning factor of reading rate

Learner Learning path of course materials

The

number of studying course materials

The total number of course materials in the learned course unit Reading rate (RR) A 1Ö5Ö8Ö10Ö11Ö12Ö13Ö14Ö15Ö16Ö17 11 11 1 B 1Ö5Ö4Ö6Ö7Ö8Ö9Ö10Ö(9)Ö(10)Ö11 9 11 0.82

(5)

2. Total accumulated reading time of all learned course materials (RT)

The total accumulated reading time of each learner is calculated by summing up the reading time of all learned course materials on the PELS system, and the notation RT is used to represent the learning factor. Table 2 presents an example of calculating the total accumulated reading time for both the learners A and B in the learning course unit.

Table 2. A calculating example for the learning factor of total accumulated reading time

Learner Learning path of course materials Total accumulated _{reading time(RT)}

A 1(25)Ö5(32)Ö8(33)Ö10(30)Ö11(36)Ö12(43)Ö13(47)Ö14(43)

Ö15(45)Ö16(40)Ö17(46) 420(secs)

B 1(42)Ö5(56)Ö4(50)Ö6(62)Ö7(67)Ö8(72)Ö9(79)Ö10(93) _{Ö9(70)Ö10(81)Ö11(118)} 790(secs) ( ) indicates the reading time for the learned course material

3. Learner ability evaluated by PELS (LA)

After studying the recommended courseware, the PELS (Chen, Lee & Chen, 2005; Chen, Liu & Chang, 2006) can dynamically estimate a learner’s ability according to the item response theory (Baker & Frank, 1992) by collecting the replied responses of the learner to the randomly selected testing questions in the learned course unit. The value denotes the learner’s ability in the learned course unit measured by the PELS system during the learning process, and the range of learner’s ability are limited from -3 (i.e. lowest ability) to +3 (i.e. highest ability). As a learner logs in this system, if the user account database does not have any history records in the selected course unit for this learner, then his initial ability will be regarded as 0. That is, the system assumes beginner’s ability is moderate level. As a learner clicked the recommended courseware for learning, his/her ability in this course unit will be re-evaluated according to his/her feedback responses and the corresponding difficulty parameter of the learned courseware. The notation LA is used to represent the learning factor of learner ability in this paper.

In the PELS, the quadrature form proposed by Bock and Mislevy (Baker & Frank, 1992) was employed to approximately estimate learner’s ability as follows:

∑

θ θ θ θ θ = θ q k n k k q k k k n , k ) ( A ) | u ,..., u , u ( L ) ( A ) | u ,..., u u ( L ˆ 2 1 2 1

(1)

where _{θˆ denotes the learner’s ability of estimation,}L(u₁,u₂,_L,u_n|_θ_k) is the value of likelihood function at a level below their ability level θ and learner’s responses are k u1,u2,...,un, θ is the k kth split value of ability in the standard normal distribution, and A(_{θ represents the quadrature weight at a level below}_k) their ability level θ . k

In Eq. (1), the likelihood function L(u₁,u₂,_L,u_n|_θ_k) can be further described as follows:

∏

= − = n j u k j u k j k n P jQ j u u u L 1 1 2 1, , , | ) ( ) ( ) ( L θ θ θ

(2)

where ( ) ) ( 1 ) ( j k j k b D b D k j e e P ₋ − + = _θ θ θ

, Qj(θk)=1−Pj(θk),Pj(θ) denotes the probability that learners can understand the

j

th courseware at a level below their ability level θ , _k Q_j(_{θ represents the probability}_k)

(6)

that learners cannot understand the jth courseware at a level below their ability level θ , _k bj is the difficulty parameter of the

j

th courseware, and D is a constant 1.702, and U is the understanding or not _j

understanding answer obtained from learner feedback to the jth courseware, i.e. if the answer is understanding then Uj =1; otherwise, Uj =0.

4. Correct response rate of randomly selecting testing questions (CR)

After a learner has studied the recommended course material, the PELS tests the learner on his understanding by randomly selecting a relevant question from the testing item database. The rate of correct responses to test questions helps determine the learner’s degree of understanding for all learned courseware, and the notation CR is used to represent the learning factor in this paper. For example, if a learner gave seven correct responses and three incorrect responses for ten randomly selecting testing questions from the testing item database, then the correct response rate is 0.7.

5. Posted amount of articles on the forum board (PN)

The PELS system can automatically count the total number of posted articles and answered questions on the forum board for each learner during the learning process. The value usually shows a learner level of participation in group discussion, and the notation PN is used to represent the learning factor in this paper. For example, if a learner posted three articles and answered five questions on the forum board, then the posted amount of articles on the forum board is eight pieces.

6. Accumulated score on the forum board (AS)

The PELS system gives various scores for the various interactive behaviors among learners on the forum board. In this study, the PELS will automatically increment a learner’s score of one point when he/she logs onto the forum board, and accumulate a learner’s score of two points if he/she posts an article on the forum board. The accumulated score on the forum board represents the level of useful feedback given by a learner, and the notation AS is used to represent the learning factor in this paper. For example, if a learner logged onto the forum board two times and posted three articles, then the accumulated score on the forum board will be eight points.

7. Effort level of studying course Materials (EL)

Since each course material in the PELS system is assigned a required minimum reading time by course experts based on the courseware content, the effort level is defined as the actual reading time compared with the required minimum reading time for the learned courseware, and the notation EL is used to represent the learning factor in this paper. Suppose the actual reading time of learners A and B in the learned course unit is 420 seconds and 790 seconds, respectively, and listed as Table 2. Table 3 presents an example of calculating the effort level for the learners A and B in the learned course unit. In this study, the range of the effort level is limited from 0 to 1. Restated, the value of the effort level will be assigned as 1 if the calculated value is over 1.

Table 3.A calculating examplefor the learning factor of effort level

Learner Learning path of course materials Effort level (EL)

A 1(30)Ö5(60)Ö8(75)Ö10(60)Ö11(75)Ö12(75)Ö13(60)Ö14(60)Ö15(75)

Ö16(75)Ö17(90)

420 /

735 ≈

0 .

57

B 1(30)Ö5(60)Ö4(45)Ö6(60)Ö7(60)Ö8(75)Ö9(60)Ö10(60)Ö9(60)

Ö10(60)Ö11(75)

790/585

→

1

(7)

8. Final test grade (G)

This study measures the final test grade through the summative assessment scheme of fixed-length testing examination after the entire learning process is completed, and the notation G is used to represent the learning factor in this paper.

Mining Learning Performance Assessment Rules Based on Learning Portfolios

Flowchart of Mining Learning Performance Assessment Rules

Figure 3 shows the entire flowchart of the proposed learning performance assessment scheme. The proposed refined K-means algorithm is initially employed to logically determine the fuzzy membership functions of each learning factor based on real data distribution of learning portfolios for mining learning performance assessment rules by the neuro-fuzzy classifier (Nauck & Kruse, 1999). After the fuzzy rules for learning performance assessment are identified by the neuro-fuzzy classifier, the feature reduction scheme is employed to simplify the linguistic terms of the discovered learning performance assessment rules under keeping a satisfied accuracy rate of learning performance assessment. This procedure aims to obtain simplified learning performance assessment rules for promoting the inferring efficiency in web-based learning systems as well as discover the main learning factors that affect the learning outcomes. Finally, the fuzzy inference is employed to grade the learning performance by the simplified fuzzy rules for individual learners. The following sections give details for the proposed learning performance assessment scheme for individual learners.

Teacher Learning Performance START Fuzzy Inferring 0.5 1 Feature Reduction Algorithm Refined fuzzy rules Primal fuzzy rules Neuro-Fuzzy Classifier

Learner _{Learning behavior} The gathered learning

portfolios Refined K-means Algorithm Clustering C1 C2 C3 Membership functions determined

Figure 3. The flowchart of learning performance assessment

Mining Learning Performance Rules Using Web-based Learning Portfolios

Determining Fuzzy Membership Functions by the Refined K-means Algorithm for the Neuro-Fuzzy Classifier

This study employed the neuro-fuzzy classifier to discover the fuzzy knowledge rules from the learning portfolios for learning performance assessment. To identify the fuzzy knowledge rules by the neuron-fuzzy classifier, the membership functions used in the neuro-fuzzy classifier must be logically determined according to the data distribution of learning portfolios in advance. In this work, the proposed refined K-means clustering algorithm improved from the original K-means clustering algorithm (Xu & Wunsch, 2005) was employed to determine the centers of the triangle fuzzy membership functions automatically according to the data distribution

(8)

of each learning factor in learning portfolios for the neuro-fuzzy classifier herein. To obtain appropriate and interpretable learning performance assessment rules by the neuro-fuzzy classifier, this study sets the number of clusters in the refined K-means clustering algorithm as three based on the requirements of learning performance assessment in real teaching scenario. In other words, each considered learning factor contains three linguistic terms, i.e. low, moderate, and high, to describe a fuzzy rule. After that, the membership functions of the triangle fuzzy sets were automatically determined according to three cluster centers of each learning factor. Suppose the centers of three linguistic terms determined by the refined K-means clustering algorithm are respectively

c

₁,

2

c

, and

c

₃, the membership functions for the linguistic terms “low”, “moderate”, and “high” can be formulated as follows:

i. The membership function for the linguistic term “low”

⎪

⎩

⎪⎪

⎨

⎧

≥

<

−

≤

=

2 2 1 1 2 2 1

0

1 )

(

c

x

if

c

x

c

if

c

x

c

x

if

x

Low

μ

(3)

ii. The membership function for the linguistic term “moderate”

⎪

⎩

⎪⎪

⎪

⎨

⎧

≥

<

−

=

<

−

≤

=

3 3 2 2 3 3 2 2 1 1 2 1 1 Moderate

0

1

0 )

(

c

x

if

c

x

c

if

c

x

c

x

if

c

x

c

if

c

x

c

x

if

x

μ

(4)

iii. The membership function for the linguistic term “high”

⎪

⎩

⎪⎪

⎨

⎧

≥

<

−

≤

=

3 3 2 2 3 2 2 High

1

0 )

(

c

x

if

c

x

c

if

c

x

c

x

if

x

μ

(5)

cluster centers, cl is the cluster center with the nearest Euclidean distance to the left boundary, and c_r is the cluster center with the nearest Euclidean distance to the right boundary.

In the proposed refined K-means clustering algorithm, the cluster centers c_l and c_r determined by the original K-means clustering algorithm are tuned by the refined value

δ

; moreover, the other cluster centers are unchangeable. The refined value is computed as follows:

2 )

min(

)

max(

S

δ

=

+

(7) where

max

( )

S

and

min

( )

S

represent the maximum and minimum feature values of patterns in each learning factor, respectively.

Therefore, the new cluster centers determined by the refined K-means clustering algorithm can be represented as follows:

{

l r k

}

new

s c c c c

C( )∈ 1,..., −δ,..., +δ,..., (8) where C(new) is the new set of the

k

cluster centers.

(9)

μ

12

Figure 4. Tuning membership functions by the refined value

δ

, where the circle and square notations with black and grey colors respectively represent the cluster centers determined by the K-means clustering algorithm and refined K-means clustering algorithm, and the dotted and solid lines respectively represent the membership

functions determined by the K-means and refined K-means clustering algorithms

Neuro-Fuzzy Classifier for Mining Learning Performance Assessment Rules

After the triangle fuzzy membership functions were determined by the refined K-means clustering algorithm, the neuro-fuzzy classifier proposed by Nauck (Nauck & Kruse, 1999) was employed to infer the fuzzy rules for the learning performance assessment based on learning portfolios. Figure 5 shows the learning architecture of the used neuro-fuzzy classifier which can be represented as four-layer feed-forward networks. The first layer is the input layer which consists of input neurons and each neuron corresponds to an input feature. The second layer is the fuzzification layer which consists of fuzzy set neurons and each neuron corresponds to a linguistic term, such as low, moderate, and high, and so on. The third layer is the rule layer which consists of rule neurons and each neuron corresponds to a fuzzy rule. The fourth layer is the class layer which consists of class neurons and each neuron corresponds to a class label. To explain the detailed operation procedures of the neuro-fuzzy classifier, the notations used in Fig. 5 are first explained as follows:

x

_k is the

k

th input feature,

μ

_kl denotes the

l

th

nm

1

Input Input k Input n

1

x

_k

x

_n

1 Class

Figure. 5 Four-layer learning architecture of the employed neuro-fuzzy classifier

In this study, the employed neuro-fuzzy classifier follows six steps to generate fuzzy rules for learning performance assessment, and summarized as follows:

Step 1. Computing the activation strength of each fuzzy neuron in the rule layer by the minimum operator, and

formulated as follows:

{ }

( ) ,..., 2 , 1

min

j k n k j

u

ar

=

(9) where

ar

_j is the activation strength of the antecedent part of the

j

th rule neuron,

u

(_kj) represents the membership degree of the

j

th rule fired by the

k

th input feature, which can be computed as

{

k k km

}

j

k

u

( )

₌

max

₁

,

₂

,...,

_{, and}

_m

_{is the number of the defined linguistic terms of each input feature.} Step 2. Computing the activation strength of each class neuron in the class layer by the maximum operator, and

formulated as follows:

{ }

() ,..., 2 , 1

max

i j z j i

{

ar

₁

,

ar

₂

,...,

ar

_j

,...,

ar

_r

}

, and

z

_i is the number of the fired fuzzy rules of the

i

th class neuron, and

z

_i

≤

r

.

(11)

2

1 ,

2

1 where

,

1

r

,...,

,

j

c

,...,

,

i

)

X

(

ac

)

j

(

t

y p p ) j ( i i

=

∑

=

(11)

where

t

_i

(

j

)

is the class register value of the

i

th class for the

j

th rule,

ac

_i(j)

(

X

_p

)

is the activation strength of the

i

th class neuron fired by the

p

th training pattern for the

j

th rule, and

y

is the total number of training patterns.

Step 4.Finding the class label with the largest class register value for each fuzzy rule, and formulated as follows:

)}

j

(

t

{

max

arg

)

j

(

_i c ,..., , i=12

=

τ

(12)

where

τ

(

j

)

represents the class label with largest class register value for the

j

th rule.

Step 5. Evaluating the performance for each fuzzy rule, and formulated as follows:

∑

τ ≠ = τ

−

=

i, c ,..., , i i ) j (

(

j

)

t

(

j

)

t

Step 6. Collecting the fuzzy rules with the best performance in each class as the learning performance

assessment rules.

Feature Reduction Scheme for Simplifying the Learning Performance Assessment Rules

To generate greater efficient and even accurate fuzzy rules, the original fuzzy rules discovered by the neuro-fuzzy classifier can be refined by the proposed feature reduction scheme. In the feature reduction process, it is assumed that there are sufficient relevant features in the original feature set to discriminate clearly between categories, and that some irrelevant features can be eliminated to improve efficiency and even accuracy. Generally, elimination of redundancy will improve efficiency without losing accuracy, and elimination of noise will improve both efficiency and accuracy (Chakraborty & Pal, 2004). In particular, the antecedent parts of fuzzy rules discovered by the neuro-fuzzy classifier always contain all features so that the main relevant features related to the classification accuracy cannot be clearly revealed as well as the inference efficiency is descended. More significantly, the antecedent parts of the fuzzy rules discovered by the neuro-fuzzy classifier containing all features will lead to over strict fired conditions so that many patterns cannot be inferred, and called as unknown patterns herein. The number of unknown patterns will obviously reduce the classification accuracy rate. Moreover, the simplified fuzzy rules for learning performance assessment are more easily interpreted by teachers, thus helping teachers to tune their teaching strategies. Based on these reasons, this study proposes a feature reduction scheme to simplify the discovered fuzzy rules. To obtain the simplified fuzzy rules, the proposed feature reduction scheme only performs the feature reduction scheme to the fuzzy rules with the best performance in each class. After that, the feature reduction is performed based on the proposed performance evaluation method for all feature combinations of the antecedent parts containing in each discovered fuzzy rule. Figure 6 shows that the linguistic terms used in a fuzzy rule usually have overlapped intervals between two neighborhood linguistic terms, such as the intervals

I

₁

,

I

₂

,

I

₃

,

and

−

=

n k , k l jk l j l j

i

u

i

u

i

u

1

(14)

where

u

l_j

th feature of the

l

i

th training pattern,

n

represents the number of the defined linguistic terms in each fuzzy rule,

λ

indicates the linguistic term with the largest membership degree fired by the

i

th training pattern.

After the match degree is obtained, the performance of the discovered fuzzy rule with the considered feature combination can be measured as follows:

( )

ϕ

∑∑

= =

=

j 1 1 t i l j l q

i

u

v

(15)

where

v

_ql represents the performance of the

l

th fuzzy rule under the

q

th feature combination,

t

is the number of training patterns,

ϕ

is the number of features,

u

_jl

(

i

)

j

th feature of the

l

th

fuzzy rule fired by the

i

th training pattern.

Moreover, the total number of feature combinations for a discovered fuzzy rule can be computed as follows:

(

)

∑

=

−

=

p j

j

!

p

j

!

p

g

1

(16) where

g

is the total number of feature combinations for a discovered fuzzy rule,

p

is the total number of feature dimensions,

j

is the number of the considered features.

Figure 6. The overlapped regions between various linguistic terms, where

u

_jkl

(

i

)

k

th linguistic term of the

j

th feature of the

l

i

th training pattern, and

x

l_j

(

i

)

is

the feature value of the

i

th training pattern corresponding to the

j

th feature of the

l

th fuzzy rule

Fuzzy Inference by the Discovered Fuzzy Rules for the Learning Performance Assessment

This section explains how to infer the learning performance according to the discovered fuzzy rules by fuzzy inference. The discovered fuzzy production rules are formed by IF-THEN rules represented as follows:

B Y THEN A X and A X IF = = =

(13)

where

X

_i and

Y

denote linguistic variables, and

A

_i and

B

represent linguistic terms.

A defuzzification strategy aims to convert the outcome of fuzzy inference into a crisp class label. In this study, the maximum operator (Lin & Lee, 1996) is employed as the defuzzification scheme, to infer the crisp class label from the respective fired degrees of membership functions. Restated, the result of the learning performance assessment is assigned as the class label with the largest membership degree fired by a training pattern among the discovered fuzzy rules.

Experiments

The personalized e-learning system (PELS) was published on the web site http://192.192.6.86/irt4 to provide personalized e-learning services and assess the learning performance of individual learners by web-based learning portfolios. To verify the evaluation quality of the discovered fuzzy rules for the learning performance assessment, some third-year students of Jee-May Elementary School (http://www.jmes.tpc.edu.tw/), who had majored in the “Fractions” course unit in elementary school mathematics, were invited to test this system. The experimental results are described as follows.

The Format of Learning Portfolio Gathered by PELS

The “Fractions” unit currently includes a total of 17 course materials with various difficulty levels, each conveying similar concepts. The system was used by 583 learners from 18 different classes of Taipei County Jee-May Elementary School (http://www.jmes.tpc.edu.tw/). Among the eight gathered learning factors, the final testing score G was obtained through the summative assessment scheme of fixed-length testing examination after the entire learning process is completed. Table 4 shows the format of the partial learning portfolios in the user profile database gathered by PELS.

Table 4. The format of the learning portfolio in the user profile database

Learning Factor Learning Record

G

(0~100) (0~1) RR (sec) RT (-3~+3) LA (0~1) CR (piece) PN (point) AS (0~1) EL

T1 _83.24_0.630_{2212 1.68}_{0.815 0 41 1.00}

T2 _74.17_0.600_{1413 0.26}_{0.533 0 42 1.00}

︴︴︴︴︴︴︴︴︴

T583 84.81 0.436 3747 1.36 0.846 1 27 0.95

Evaluating Accuracy Rate of Learning Performance Assessment Evaluation Method

To measure the accuracy rate of learning performance assessment for the proposed method, the score level method was used to evaluate the accuracy rate of the predicted learning performance. In this evaluation method, each learner is assessed according to one of three score levels based on the mapping membership degrees of learning factor G. For example, the score level of a learner with a final test score of 86.16 was set to moderate level if the linguistic term of moderate level has the largest mapping membership degree among all linguistic terms of the learning factor G.

Experimental Results

In our experiments, half the learning portfolios among 583 learners selected randomly were used as training data and the remaining learning portfolios were used as testing data. Table 5 illustrates the number of learning records in each grade level. To obtain simple and interpretable fuzzy rules for learning performance assessment, the number of cluster centers in the K-means algorithm is set to 3. That is, each learning factor contains three

(14)

various linguistic terms to describe a fuzzy rule, and named as “low”, “moderate”, and “high” herein. Table 6 shows the determined centers of the linguistic terms for eight learning factors by the K-means algorithm. To obtain fuzzy rules for learning performance assessment more accurately, the proposed refined K-means algorithm was employed to tune the centers of the linguistic terms listed in Table 6. Tables 7 and 8 display the revised amount of cluster centers for each learning factor and the revised centers of the linguistic terms of eight learning factors by the refined K-means algorithm, respectively. Figure 7 reveals the membership functions of triangle fuzzy sets for eight learning factors determined by the revised centers of the linguistic terms in the refined K-means algorithm. Basically, the left and right widths of triangle membership functions of the linguistic terms “moderate” for each learning factor are determined by the differences between the self-center with the left and right neighborhood centers, respectively. Moreover, the left and right widths of triangle membership functions of the linguistic terms “low” and “high” for each learning factor are determined by the difference between the self-center with the left or right neighborhood center as well as the boundary of each learning factor.

Table 5. The number of learning records in each grade level

Training data set Testing data set Whole data set

Data set

Item Low Moderate High Low Moderate High Low Moderate High

Number of

patterns 51 194 47 65 188 38 116 382 85

Total number of

patterns 292 291 583

Table 6. The determined centers of the linguistic terms for eight learning factors by the K-means algorithm

Learning factor

Linguistic term G RR RT LA CR PN AS EL

The centers of the linguistic term

“low” 81.95 0.25 739.24 1.25 0.77 1.52 28.38 0.74

“moderate” 87.25 0.44 1585.02 1.58 0.87 2.61 53.88 1.05

“high” 90.44 0.63 3284.04 1.62 0.91 5.08 90.12 1.08

Table 7. The revised amount of cluster centers for each learning factor

Learning factor

Revised amount G RR RT LA CR PN AS EL

The revised amount

δ

12.5 0.5 3154 1.15 0.5 13 175 1.87

Table 8. The revised centers of the linguistic terms of eight learning factors by the refined K-means algorithm

Learning factor

Linguistic term G RR RT LA CR PN AS EL

The revised center of the linguistic

term “low” 69.45 -0.26 -2414.76 0.10 0.27 -11.48 -146.62 -1.13

term “moderate” 87.25 0.44 1585.02 1.58 0.87 2.61 53.88 1.05

term “high” 102.94 1.13 6438.04 2.77 1.41 18.80 256.12 2.95

Next, the neuro-fuzzy neural network with the refined K-means algorithm for logically determining membership functions was employed to discover the formative evaluation rules based on the learning portfolios of the training data set. There are totally 24 fuzzy rules discovered by the employed neuro-fuzzy neural network in our

(15)

experiment. Among all discovered fuzzy rules, there are 10, 9, and 5 rules discovered for assessing the low, moderate, and high grade levels, respectively. Table 9 shows three fuzzy rules with the best performance for assessing three different grade levels based on gathered learning portfolios. In our experiment, ten independent runs were performed to yield an average performance for the training and testing data sets. Table 10 displays the averaged accuracy rate of learning performance assessment evaluated by the discovered rules listed in Table 9. The averaged accuracy rates of learning performance assessment for the training and testing data sets are 74.3% and 71.1%, respectively. 69.45 87.25 102.94 0 0.5 1 G -0.26 0.44 1.13 0 0.5 1 RR -2414.760 1585.02 6438.04 0.5 1 RT 0.1 1.58 2.77 0 0.5 1 LA 0.27 0.87 1.41 0 0.5 1 CR -11.48 2.61 18.8 0 0.5 1 PN -146.62 53.88 256.12 0 0.5 1 AS -1.13 1.05 2.95 0 0.5 1 EL

Figure 7. The membership functions of learning factors determined by the refined K-means algorithm

Table 9. The learning performance assessment fuzzy rules discovered by the neuro-fuzzy network with the

refined K-means algorithm for determining fuzzy membership functions

Antecedent part Consequent _part

Learning factor

The discovered rule RR RT LA CR PN AS EL G

The rule 1 M M L L M M M Low

The rule 2 M M M M M M M Moderate

The rule 3 H M M M M M M High

Table 10. The accuracy rate of learning performance assessment evaluated by the discovered rules listed in

Table 9 Data set

Item Training data set Testing data set

Number of patterns 292 291

Number of the discovered rules 3 3

Number of the used fuzzy sets in the discovered rules 10 10

Number of patterns with correct learning performance

assessment 217 207

Number of patterns with incorrect learning performance

(16)

Number of patterns with unknown learning performance

assessment 9 11

Averaged accuracy rate of learning performance assessment 0.743 0.711

₁. The result indicates that the third learning factor of the discovered fuzzy rule 1, i.e. learner ability (LA), is the main learning factor for assessing the learners with low score level. Similarly, the results also indicate that the second and first learning factors are the main learning factors for the discovered fuzzy rules 2 and 3, respectively. Table 13 shows the simplified learning performance assessment rules after performing the proposed feature reduction scheme. Table 14 displays the accuracy rate of learning performance assessment by these simplified fuzzy rules.

(17)

Compared with the results of learning performance assessment listed in Table 10, we find that the accuracy rate of the learning performance assessment using the simplified fuzzy rules is only slightly poor than the discovered fuzzy rules without performing feature reduction scheme. However, the simplified fuzzy rules provide benefits to teachers in terms of easily understanding main learning factors that affect learning performance for learners with various score levels (Paiva & Dourado, 2004; Mikut, Jakel & Groll, 2005). Moreover, this idea can be beneficially applied to immediately examine the learning progress of learners, and to perform interactively control learning for web-based learning systems. In the meanwhile, the simplified fuzzy rules will obviously promote the inference efficiency of learning performance assessment for web-based learning systems (Paiva & Dourado, 2004; Mikut, Jakel & Groll, 2005).

0 20 40 60 80 100 120 140 -10 -5 0 5 10 15 20

P er for m anc e

Figure 9. The performance evaluation plot of the discovered rule 2 for assessing the moderate score level

0 20 40 60 80 100 120 140 -10 -5 0 5 10 15 20 25 30

P er for m anc e

Figure 10. The performance evaluation plot of the discovered rule 3 for assessing the high score level

Table 12. The top six combined learning factors with excellent performance of learning performance assessment

for three discovered rules

The discovered rule 1 The discovered rule 2 The discovered rule 3

Training cycle The combined learning factor Performance

index Training cycle

The combined

learning factor

₁,

x

₂,

x

₇ 18.25

(18)

Table 13. The discovered learning performance assessment rules after performing feature reduction

Antecedent part Consequent _part Learning factor

The discovered rule RR RT LA G

The rule 1 L Low

The rule 2 M Moderate

The rule 3 H High

Table 14. The accuracy rate of learning performance assessment by the simplified fuzzy rules listed in Table 13

Data set

Item _{Training data set} _{Testing data set}

Number of patterns 292 291

Number of the discovered rules 3 3

Number of the used fuzzy sets in the

discovered rules 3 3

Number of patterns with correct learning

performance assessment 212 204

Number of patterns with incorrect learning

Number of patterns with unknown learning

Accuracy rate of learning performance

assessment 0.726 0.701

Conclusion

This study presents an effective learning performance assessment approach which contains the neuro-fuzzy network with the refined K-means algorithm for logically determining membership functions and the proposed feature reduction scheme to discover simplified and small amount fuzzy rules for evaluating learning performance of individual learners based on the gathered learning portfolios. The proposed method can help teachers to perform precise formative assessments according to the web-based learning portfolios of individual learners in a web-based learning environment. The inferred learning performance can be applied as a reference guide for teachers and as learning feedback for learners. Such a feedback mechanism enables learners to understand their current learning status and make suitable learning adjustments. Additionally, teachers can determine the main factors affecting learning performance in a web-based learning environment according to the interpretable learning performance assessment rules obtained. These factors can be used by teachers to tune their teaching strategies. Meanwhile, teachers can devote themselves to teaching job, since they save significant time in performing learning evaluation.

Acknowledgment

The authors would like to thank the National Science Council of the Republic of China, Taiwan for financially supporting this research under Contract No. NSC94-2520-S-026-002.

References

Baker, B. B., Baker, F. B., & Frank B. B. (1996), Item Response Theory: Parameter Estimation Techniques,

New York: Marcel Dekker.

(19)

Campbell, D., Nettles. D., & Melenyzer, B. (2000). Portfolio and Performance Assessment in Teacher Education, Boston: Allyn and Bacon.

Carrira, R., & Crato, J. M. (2004). Evaluating Adaptive User Profiles for News Classification. International Conference on Intelligent User Interfaces, January 13-16, 2004, 206-212.

Chakraborty, D., & Pal, N. R. (2004). A Neuro-Fuzzy Scheme for Simultaneous Feature Selection and Fuzzy

Rule-Based Classification. IEEE Transactions on Neural Networks, 15, 110–123.

Chang, C.-C. (2002). Building A Web-Based Learning Portfolio for Authentic Assessment. Proceedings of the International Conference on Computers in Education (Vol. 1), 129-133.

Chen, C.-M., Lee, H.-M., & Chen, Y.-H. (2005). Personalized E-Learning System Using Item Response Theory.

Computers & Education, 44 (3), 237-255.

Chen, C.-M., Liu, C.-Y., & Chang, M.-H. (2006). Personalized Curriculum Sequencing Using Modified Item Response Theory for Web-based Instruction. Expert Systems with Applications, 30 (2), 378-396.

Dunham, M. H. (2002). Data Mining: Introductory and Advanced Topics, Prentice Hall.

Gagnés, R, (1997). The Conditions of Learning and Theory of Instruction, New York: Holt, Reinehart &

Winston.

Lankes, A. M. D. (1995). Electronic Portfolios: A New Idea in Assessment, (ERIC Digest EDO-IR-95-9),

Available April 11, 2006 at http://searcheric.org/digests/ed390377.html.

Lin, C.-T., & Lee, C. S. G. (1996). Neural Fuzzy Systems: A Neuro-Fuzzy Synergism to Intelligent Systems,

Prentice-Hall.

Nauck, D., & Kruse, R. (1999). Obtaining Interpretable Fuzzy Classification Rules from Medical Data. Artificial Intelligence in Medicine, 16, 149-169.

Mikut, R., Jakel, J., & Groll, L. (2005).Interpretability Issues in Data-Based Learning of Fuzzy Systems.Fuzzy Sets and Systems, 150 (2), 179–197.

Paiva, R. P., & Dourado, A. (2004). Interpretability and Learning in Neuro-Fuzzy Systems. Fuzzy Sets and Systems, 147 (1), 17-38.

Rahkila, M., & Karjalainen, M. (1999). Evaluation of Learning in Computer based Education Using Log Systems. 29th_{ASEE/IEEE Frontiers in Education Conference, 1, 12A3/16-12A3/21.}

Rasmussen, K., Northrup, P., & Lee, R. (1997). Implementing Web-based Instruction. In Khan, B. H. (Ed.),

Web-Based Instruction, Englewood Cliffs, NJ: Educational Technology, 341-346.

Torrance, H., & Pryor, J. (1998). Investigating Formative Assessment: Teaching, Learning and Assessment in the Classroom, Buckingham: Open University Press.

Wang, W., Weng, J.-F., Su, J.-M., & Tseng, S.-S. (2003). Learning Portfolio Analysis and Mining in SCORM Compliant Environment. The 34th ASEE/IEEE Frontiers in Education Conference, TC2-17-TC2-34.

Wu, A. K. W., & Leung, C. H. (2002). Evaluating Learning Behavior of Web-Based Training using Web Log. In Proceedings of the International Conference on Computers in Education,736-737.

Xu, R., & Wunsch, D. II (2005). Survey of Clustering Algorithms. IEEE Transactions on Neural Networks, 16

(3), 645-678.

Zaiane, O. R., & Luo, J. (2001). Towards Evaluating Learners' Behavior in a Web-based Distance Learning Environment. IEEE International Conference on Advanced Learning Technologies, 357-360.