應用 RB 與 CBR建構一個解答擷取系統

(1)

國

立

交

通

大

學

電機資訊學院資訊學程

碩

士

論

文

應用 RB 與 CBR

建構一個解答擷取系統

Building a Solution-Retrieval System

Based on RB and CBR Approaches

研究生：李宗平

指導教授：曾憲雄博士

(2)

應用 RB 與 CBR 建構一個解答擷取系統

Building a Solution-Retrieval System Based on RB and CBR

Approaches

研究生：李宗平 Student：Tsung-Ping Lee

指導教授：曾憲雄 Advisor：Shian-Shyong Tseng

國立交通大學

電機學院與資訊學院專班資訊學程

碩士論文

A Thesis

Submitted to Degree Program of Electrical Engineering and Computer Science College of Computer Science

National Chiao Tung University in Partial Fulfillment of the Requirements

for the Degree of Master of Science

in

Computer Science June 2006

Hsinchu, Taiwan, Republic of China

(3)

應用 RB 與 CBR 建構

一個解答擷取系統

研究生：李宗平指導教授：曾憲雄博士國立交通大學電機資訊學院資訊學程

摘要

由於網路快速發展，服務多人上線的系統，已從傳統的兩層式 (client-server) 架構轉型成三層式 (client-application server-database)架構，因此，問題的偵測跟解問題的方法，對於這樣的領域來說，其複雜度就不斷增加。就我們所知，很多公司需要花很多人力物力去維護這樣的系統，然而公司的專家不可能一直在公司解問題，既使用文件搜尋來找答案，也可能找不到解答，所以提出根據專家系統的方法，來建構一個解答擷取系統。在本篇論文中，我們提出 Solution-Retrieval System (SRS) 架構來輔助使用者解決問題。在 SRS 中，結合 Rule Base (RB)跟 Case Based Reasoning (CBR)方法，我們使用 RB 推論來縮小問題的範圍，用 CBR 來找問題的解答，這樣處理的方式是模仿專家的處理模式，依著系統的特性，用推論的方式來找出錯誤類型，再將推論出來的錯誤類型，用專家的經驗法則，比較相似度，將相似度高的解決方案應用到問題上去解系統問題。再者，為了提高解問題的及時性，我們的系統也可以在 PDA 等小型裝置上及時的運用來解問題。未來，這樣的架構可以應用在相關的領域上，例如: IC 設計，整合供應鍊等。

(4)

Building a Solution-Retrieval System Based on

RB and CBR Approaches

Student: Tsung-Ping Lee Advisor: Dr. Shian-Shyong Tseng Department of Master Program of Computer Science

National Chiao Tung University

Abstract

Due to the fast development of web services, most of the service activities have been moved from 2 tiers (client-server) to 3 tiers (client-application server-database); hence, the importance of problem diagnosis and solution retrieving in integrated domains becomes more complicated. As we know, many of companies devote lots of time and effect to deal with this problem. However, since experts are not always available, using traditional approach or knowledge management center to search solutions may still fail. Hence, the idea of developing a solution-retrieval system based upon expert system approach is proposed. In the thesis, we propose an architecture based on rule base (RB) and case-based reasoning (CBR) and build a Solution-Retrieval System (SRS) to help users to solve the problems, in which RB is used to reduce error scope and CBR is used to find the corresponding solutions. Similar to the expert’s diagnosis approach, we use the SRS to diagnose the error type by RB inference, and retrieve solutions by CBR. Finally, the retrieved high similarity solution cases can be used to solve problems. Furthermore, the PDA and hand-held devices could be used in our system for solving problem promptly. In the near future, this architecture will be adapted on other related domains, e.g., IC design and Supply Chain.

(5)

誌謝

能夠完成這篇論文，要感謝的人很多，首要感謝我的指導教授曾憲雄教授，教授做事的態度，研究的方法與待人接物方面，對我的影響既深且遠，能夠在教授的指導下完成學業，是一種福氣，十分感謝。同時也要感謝我的口試委員林妙聰教授，洪宗貝教授與李允中教授，他們對於這篇論文提供了不少寶貴的建議。再來我要感謝實驗室的學長，翁瑞鋒學長、蘇俊銘學長與王慶堯學長，除了在專家系統上專業知識的指點，對於論文也提供了不少寶貴的建議。尤其是瑞鋒學長對於論文的協助，不宜餘力，與我度過無數個長夜，嘔心瀝血完成這篇論文。這一切都是深受曾憲雄教授平時的教誨，所養成的良好實驗室風氣。另外我要感謝，我的同學：碧如、亮中、秀怡、薰尹、書源、廼仁、鳳琴、仁杰，平常時對於我的協助，以及在功課方面的幫忙，讓我可以順利完成這篇論文，非常感謝。最後我要感謝我的家人，我的女友秋詩，在我意志薄弱的時候，給我關懷，給我鼓勵，讓我有信心可以站起來，往前走，順利完成這篇論文。要感謝的人很多，無法一一描述，謹在此獻上我最誠摯的感謝。

(6)

Content

Abstract (In Chinese)...i

Abstract...ii Acknowledgement ...iii Content ...iv List of Definitions... v List of Examples... v List of Figures...vi Chapter 1. Introduction ... 1

Chapter 2. Related Work... 3

2.1. Problem Diagnosis... 3

2.2. Solution Retrieving... 6

Chapter 3. The Design of Solution-Retrieval System ... 8

3.1. Knowledge Representation... 8

3.2. The SRS System Architecture ... 11

Chapter 4. Problem Diagnosis by Rule-Based Inference... 15

4.1. Knowledge Acquisition ... 15

4.2. Embedded Rules ... 19

Chapter 5. Solution Retrieving by CBR... 22

5.1. Case Retain ... 22

5.2. Case Retrieve... 23

Chapter 6. Implementation and Evaluations ... 32

6.1. Implementation of SRS ... 33

6.2. Evaluation of SRS ... 36

Chapter 7. Conclusion and Future Work ... 41

Bibliography ... 42

Appendix A: AT of Rule class Daemon home ... 45

Appendix B: AT of Rule class DB Daemon ... 48

Appendix C: Oracle database 10g architecture ... 50

(7)

List of Definitions

Definition 4.1. Relation Acquire in Rule Classes... 15

Definition 4.2. Relation Trigger in Rule Classes. ... 16

Definition 4.3. Acquisition Table (AT). ... 16

Definition 4.4. Attribute Ordering Table (AOT)... 16

Definition 4.5. EMCUD... 19

Definition 5.1. Similarity Equation between Query and Case. ... 23

Definition 5.2. Similarity Equation of Subject... 24

Definition 5.3. Similarity Equation of Application. ... 26

Definition 5.4. Similarity Function of Platform... 27

Definition 5.5. Similarity Function of Version. ... 29

List of Examples

Example 3.1. The Error Type of System Pending... 13

Example 4.1. Acquire System Status. ... 15

Example 4.2. Trigger Sub-Module... 16

Example 4.3. AT of Rule Class “Oracle DB”... 16

Example 4.4. Rule Generation by EMCUD. ... 17

Example 4.5. EMCUD... 20

Example 5.1. Subject Similarity. ... 25

Example 5.2. Application Similarity... 27

Example 5.3. Platform Similarity... 28

Example 5.4. Version Similarity. ... 30

(8)

List of Figures

Figure 2.1. Problem diagnosis by on-duty employees... 4

Figure 2.2. Oracle Application Server [17] ... 5

Figure 2.3. Oracle Database Server [18] ... 6

Figure 3.1. Database Module Tree ... 9

Figure 3.2. Knowledge Representation ... 10

Figure 3.3. Rule base knowledge representation on Oracle database ... 11

Figure 3.4. SRS Architecture Overview ... 12

Figure 4.1. Rule Base Structure for System Problem Diagnosis... 15

Figure 4.2. Rule Generation from Rule Class Oracle DB ... 17

Figure 4.3. Rules example in rule class Oracle DB... 18

Figure 4.4. Rule Generation from Rule Class Database... 19

Figure 4.5. EMCUD ... 21

Figure 5.1. Adaptive query in CBR... 22

Figure 5.2. Case Attributes representation in case base ... 22

Figure 5.3. Case example ... 23

Figure 5.4. Example of query and case comparison... 25

Figure 5.5. Application ontology... 26

Figure 5.6. Weight Platform Tree... 28

Figure 5.7. Weighted database version tree... 30

Figure 6.1. SRS User Interface... 32

Figure 6.2. SRS User Interface in Query... 34

Figure 6.3. Solution List in SRS... 35

Figure 6.4. Solution Description in SRS System ... 35

Figure 6.5. Problem Solving Accuracy... 39

(9)

Chapter 1. Introduction

Due to the fast development of web services, most of the service activities have been moved from 2 tiers (client-server) to 3 tiers (client-application server-database), and the importance of problem diagnosis and solution retrieving in application server and database integration domain becomes important increasingly. As we know, many of companies devote lots of time and effect to deal with this problem in recent years. However, since the cost is too high and experts are not always available, using traditional knowledge management (KM) center search approaches may fail. However, the mission is critical and is difficult for novice to handle this job; once the abnormal situation happens, on-duty employees have to fix the problem as soon as possible. Therefore, the idea of developing a solution-retrieval system based on expert system approaches is proposed in this thesis.

To deal with these problems, the novices usually find the solutions from KM center, but somehow they have to face two main challenges listed below.

(1) Problem diagnosis issue: Since it is difficult to diagnose system for novices,

diagnosis system problems have to refer other related logs and system status to diagnose precisely. Besides, it is difficult to determine which part of the log content is meaningful.

(2) Solution retrieving issue: Although there are traditional approaches, e.g., KM

Center, novices may not know how to input the right keywords for solution retrieving.

As on-duty employees may fail to fix problems, usually they call experts for help. Then experts will reason the solutions of the problem based on their experience. Based on the problem diagnosis and solution retrieving mechanisms, the Solution-Retrieval System (SRS) was produced. We use rule based inference to infer the error type for reducing the error scope,

(10)

which imitates experts and use case based reasoning (CBR) to reason the similarity between error type and cases in case base to find out the corresponding solutions.

The system SRS includes three main components: preprocessor, inference module [22], and reasoning module. When the monitored system (Application Server and Database) is abnormal, the preprocessor retrieve error logs and system information. Then inference module infers the error type immediately. And then users have to input related keywords to trigger reasoning module to reason the solutions immediately. Therefore, the main contribution of this thesis includes, providing the hybrid architecture to solve system problems, using RB to reduce error scope, and using CBR to reason the solutions.

In this thesis, we face the challenges listed as follows.

(1) Knowledge Acquisition (KA): Using rule-based inference in KA, there are Oracle consultants and database administrators (DBA) in our company; hence KA related work and the perfection of the rules could be easily done.

(2) Similarity Calculation: Since it is difficult to define attributes in query and the existing cases, to calculate the similarity between query and cases is difficult. This rest of thesis is organized as follows. Chapter 2 introduces the related works. Chapter 3 describes domain analysis, knowledge representation, SRS system architecture overview, and preprocessor in SRS. Chapter 4 describes System Diagnosis by Rule Base. Chapter 5 describes Solution Retrieving by CBR. Chapter 6 is the experiment processing and evaluation including system design and implementation. Chapter 7 gives future work and conclusion of the thesis.

(11)

Chapter 2. Related Work

As we know, more and more complex systems (e.g., Oracle Application Server and Oracle database) are revised in product version; it results in more and more difficult problems in trouble shooting occurred. In our study, many traditional approaches, e.g., search engine or Knowledge Management (KM) center for trouble shooting may fail; hence, we want to propose new approaches to do problem diagnosis and solution retrieving.

In order to solve the issues mentioned above, we design an architecture SRS, which can not only find the root-cause efficiently but also reuse and retain expert’s experience to help users to deal with problems promptly.

2.1. Problem Diagnosis

As for problem diagnosis aspect, using probabilistic reasoning techniques can be combined with information-theoretic approach to focus on distributed system problem diagnosis [14]. This approach may be efficient in problem diagnosis by adapting multi-layers system architecture and probabilistic reasoning techniques, but lacks the corresponding solutions. Using a new data-mining algorithm with ontology-based approach to fault diagnosis [7] still lacks the solutions to aid users deal with problems. On the other related approaches, using neural network and spectral analysis to detect the operating machine fault [6] can only provide the diagnosis result for one specific domain only. A hybrid fault diagnosis expert system based on knowledge and neural network in a steel factory [16] did not provide the solutions. Some other related researches use multi-CBR based recommendation engine for e-commerce may not be efficient [9], due to lack of decision making mechanism. Hence, we propose an inference approach (Rule-Based Inference) to reduce time in problems

(12)

diagnosis and make the SRS more efficient.

As we know, problem diagnosis is difficult for IT employees due to the lack of expert’s inference skills, as shown in Figure 2.1.

Figure 2.1. Problem diagnosis by on-duty employees

The overall architecture for trouble shooting is given, where Oracle Application Server and Oracle database server are particularly presented in Figure 2.2 and Figure 2.3, respectively. In Figure 2.2, users can use mobile, PDA, notebook and desktop to get the services from Oracle Application Server (OAS), and there are four main services, including communication services, business logic services, presentation services and caching services [17].

(13)

Figure 2.2. Oracle Application Server [17]

Communication services are responsible for the web resources dispatching between end-users, Business logic services are responsible for the interpreting the business logic to web server, Presentation services are responsible for the interpreting JSP, PSP, etc. languages to web server, and Caching service are responsible for the various caching mechanism including database caching, web cache for performance issue. In Figure 2.2, we can observe that it is difficult to diagnose the complicated module inside the OAS. In Figure 2.3, Oracle Database Server (ODS) is divided into three main components, including Memory, File and Daemon [18]. Memory is responsible for memory control for performance issue. File is responsible for information records. Daemon is responsible for database operation control. In Figure 2.3, we can observe that it is difficult to diagnose the complicated relationship between modules inside the database.

(14)

Memory

File

Daemon

Figure 2.3. Oracle Database Server [18]

Traditionally, experts infer the system to find out the problems according to system characteristics; therefore we imitate experts to infer the problems by rule-based inference and use EMCUD [22] to elicit the embedded meaning while the system information can not be easily obtained.

2.2. Solution Retrieving

(15)

proper decision making mechanism. Those approaches only focus on similarity calculation precisely, and propose auto knowledge acquisition. Some papers propose the conversational case-based reasoning (CCBR) approach [4] to reduce comparison effect in attributes for cases retrieving; however once facing complex system in time limited domain, it may still be unable to bring functions into full play. Some papers propose the new technology in order to describe the scenario in multi-CBR architecture to handle complicated relationship between system modules [9], but still may fail to solve problem efficiently.

As we know, once employees who lack enough experience to solve problems, they usually face a lot of troubles and need to retrieve the corresponding solution; hence the approach case-based reasoning, which uses case retrieve, case reuse, case revise and case retain approaches to help employees to find the suitable solutions is proposed. However, usually each problem has at least one solution, so it is difficult to find all solutions.

Traditionally, when experts want to find the desired solutions for a given problem, they usually find out the pre-stored answer of similar problems. Assume the problems can be described by their attributes; using CBR as our approach to retrieve cases, reuse similar cases, retain the related information into case base, and revise attributes or attribute values in case base seems to be a good approach.

(16)

Chapter 3. The Design of Solution-Retrieval System

As we know, many traditional approaches, e.g., search engine or Knowledge Management (KM) center may be used for trouble shooting. However, those approaches may fail due to resulting too much information to help users to solve problems efficiently. In a complicated and time limited situation, such as critical system recovery, the accuracy is important.

In order to ensure the problems can be handled in limited time, we propose Solution-Retrieval System (SRS) to help on-duty employees.

3.1. Knowledge Representation

Trouble shooting is difficult for novices in some specific domains, e.g., Supply Chain related domains, which used Application server and DB as applications to support such a system. There are tree structures in modules of Application Server and DB. Therefore, based on system characteristics, we design the SRS by ontology approach.

As we know, the Oracle DB is composed of modules, modules have predecessor, successor and sibling relationships, e.g., the root module “Oracle DB” has two successors, “instance” and “database”, their common predecessor are “Oracle DB”, “instance” and “database” are sibling relationship, details are shown in Appendix C. The Oracle database modules are shown in Figure 3.1.

(17)

Figure 3.1. Database Module Tree

Since database modules have tree structure in nature, and the error type diagnosis is based upon the hierarchical structure to narrow down the error type. Therefore, as system is abnormal, experts can diagnose problems based on system characteristics to infer the error type.

Although knowledge acquisition (KA) is a bottleneck in expert system, Oracle consultants and DBA in our company make KA to be easily done. Besides, we use EMCUD [22] to elicit embedded meaning of the original knowledge when system information can not be easily obtained.

When problem occurs, experts usually use the following two steps to solve problems. Firstly, experts refer the error type from system. Secondly, they compare similarity between the problem and solution cases.

As we know, experts use their domain knowledge to narrow down the error scope, then compare the similarity between the narrowed down error scope (called the error type) and solved solutions according to some attributes, finally select high similarity cases as solutions to solve problems. Hence, we imitate experts to adapt hybrid knowledge base approaches to solve problems, as shown in Figure 3.2. In Phase one, experts infer the error type based on system characteristics, system status and related logs, e.g., inference from module “Oracle DB”, “Database”, “Daemon” then “SMON”, along with the root module “Oracle DB” to leaf

(18)

module “SMON” to diagnose the error type. Furthermore, experts reason the high similarity solutions to solve problems depending on the experience. The knowledge representation in trouble shooting is shown in Figure 3.2.

Figure 3.2. Knowledge Representation

In Phase one, we use “If <condition> Then <action>” condition statement as inference approach. Due to the varied system problems, we use NORM (New Object-oriented Rule Model), a forwarding chaining rule-based inference. The knowledge base (KB) can be divided into different knowledge classes depending on system characteristics, and the relations between classes include Trigger, Acquire, Reference, and Extension-of. Figure 3.3 displays the rule based classes for diagnosing Oracle DB.

(19)

Figure 3.3. Rule base knowledge representation on Oracle database

In our approach, solution cases could be saved as data row consisting of several attributes in DB. In Phase two, we compare similarity between problems and cases using case-based reasoning for solution retrieving, e.g., problem subject, product version, product installed platform, error type, are for similarity calculation, and document type and revised date are for solutions ranking .

3.2. The SRS System Architecture

There are two phases in SRS. Phase one is the problem diagnosis by rule based

inference with system logs, error logs and system status information as the input, and the

error type as the output. Phase two solution retrieving by case based reasoning with query (the error type and user input) as the input and the related solutions as the output. The SRS is composed of three main components: preprocessor, rule-based inference module [22] and CBR module. Preprocessor is responsible for translating logs into facts, rule-based Inference module is responsible for inferring the error type, and CBR module is responsible for reasoning the related solutions. In Figure 3.4, totally there are ten steps to describe overall system operating flow by following scenario.

(20)

Figure 3.4. SRS Architecture Overview

Step 1. System Monitor: Once the system is abnormal, the system monitoring tool

(called System Monitor) will detect the abnormal conditions and deliver related logs and system status to preprocessor for further process. The step is in on-line process.

Step 2. Preprocessor: The preprocessor will parse related logs and translate system

status information into weight and facts, which can be used in adaptive queries and can be inferred by the inference engine, respectively. The step is in on-line process.

Step 3. Rule-based Inference: Via logs preprocessing, the inference engine will infer

the facts and output the error type. The step is in on-line process.

Step 4. Query: Query is generated according to the obtained error type and users input.

(21)

Step 5. Case Retrieve: CBR module will retrieve similar solutions from case base. The step is in on-line process.

Step 6. Case Reuse: The existing solutions can be reused when cases similarity is higher

than a threshold value. The step is in on-line process.

Step 7. Retrieved Solutions: Feedback solutions to users while the solutions are found.

The step is in on-line process.

Step 8. No Solution be Found: If there are no proper solutions could be found, users

can try another keywords again. The step is in off-line process.

Step 9. Case Revise: Once no solutions could be found confirmedly, on-duty employees

need to call experts for help, and the attributes or attribute values in case base need to be revised to fulfill the new problem requirements. The step is in off-line process.

Step 10. Case Retain: Retain revised value or information into case base to fit new

requirements. The step is in on-line process.

In our design, preprocessor is responsible for transferring related logs and system status into facts. According to the different logs and system status, the preprocessor parses logs into different facts depending upon index table.

Example 3.1. The Error Type of System Pending.

1. As system is in a pending situation, the preprocessor retrieves system logs, e.g., “ORA-01575: timeout waiting for space management resource enquene”.

2. Based on keywords index table, preprocessor parses the logs contains keywords (e.g., “ORA”, “space management”, and “enquene”)

3. The facts are generated depending on parsed keywords, e.g., keyword ORA becomes fact “system=DB”, keyword space management becomes fact “module stuff=physical”, and “One Stuff” and keyword enqueue reveal the weight of “database version” is high.

(22)

Not only system logs but also system status which includes CPU, memory, and disk utilities, etc, are critical for problem reference to assist the diagnosis precisely.

(23)

Chapter 4. Problem Diagnosis by Rule-Based Inference

As we know, rules could be generated according to system modules, system characteristics, system logs ,and system status by knowledge acquiring from experts. We use NORM as knowledge representation and adapt forward chaining as our approach based on system characteristics.

4.1. Knowledge Acquisition

Based upon NORM concept, the rule base structure for system problem diagnosis is proposed. In Figure 4.1, the root rule class System consists of two rule classes Application

server and Oracle DB, where A means acquire relation, T means trigger relation in rule

classes.

Figure 4.1. Rule Base Structure for System Problem Diagnosis Definition 4.1. Relation Acquire in Rule Classes.

Sub-problem may be solved by acquiring another rule class, Once the acquired knowledge object ends inference process, it backs to original inference process.

(24)

Rule class DB Daemon acquires rule class Daemon Home, then backs to rule class

DB Daemon and carries on inference job.

Definition 4.2. Relation Trigger in Rule Classes.

Some rule class is triggered when some specific conditions are satisfied. It means that a problem may be transformed into another problem.

Example 4.2. Trigger Sub-Module.

Rule class Database triggers rule class DB Daemon if the condition is satisfied.

Definition 4.3. Acquisition Table (AT).

AT is a repertory grid of multiple data types. Boolean ：true or false

Single value：an integer, a real, or a symbol

Set of values：a set of integers, real numbers or symbols. Range of values：a set of integers or real numbers.

The table of approach EMCUD [22], columns composes of objects, rows composes of facts. The corresponding object and fact have a value to identify the value of the object feature.

Example 4.3. AT of Rule Class “Oracle DB”.

Due to rule class “Oracle DB” containing objects “DB” and “Instance”, those objects are put in column title of AT. Facts are put in row title of AT to separate different objects, as shown in Figure 4.2.

Definition 4.4. Attribute Ordering Table (AOT).

(25)

composes of objects, rows composes of facts. The corresponding object and fact have a value to identify the relationship between the object and the fact. D: means dominate the relationship, X: means no relationship, integer means the strength of relationship (from 1 to 5, 5 is the strongest relationship). According to database ontology, the inferring sequence is from top (the root-module) to bottom (the leaf-module).

Rule generation is composed of three steps described as follows. Step (1). Ontology: The system owns trigger and acquire relationships between rule classes depending on rule class features. Step (2) AT and AOT of EMCUD [22]: The objects are put in columns, and facts are put in rows to build objects values and relationships. Step (3) Rule generation: The rules are generated by AT and AOT, as shown in Figure 4.2.

Figure 4.2. Rule Generation from Rule Class Oracle DB

Example 4.4. Rule Generation by EMCUD.

In Figure 4.2. Rules are Generated from Rule Class Oracle DB

(1) Rule class Oracle DB is the root class of the ontology.

(2) Based on the features of Oracle DB, objects in column contain DB and Instance, facts

(26)

no, and yes in AT are listed from top to bottom, left to right in sequence to describe the

feature values between each object and fact. Input values are D, D, 4, 3 in AOT to describe the relationships strength between each object and fact. The objects DB and

Instance are described as follows:

DB: The module of database which is persistent. Instance: The module of database which is virtual. The facts are listed as follows:

System status: The system status is the fact which describes system status.

System rename: The system rename is the fact which means whether the system could be renamed or not.

(3) The rules are generated from AT and AOT, as shown in Figure 4.3.

Figure 4.3. Rules example in rule class Oracle DB

After inferring root class Oracle DB, the inference engine will determine next rule class. Based on the features of the next rule class Database which was inferred by Oracle DB, both the AT and AOT are produced and rules are generated by those tables, example is shown in Figure 4.4. The object daemon and file are described as below:

Daemon: The daemon is the DB module which keeps the database alive and do periodically system check.

File: The file is the system files which contains system files to control and record system status.

The facts are listed as below:

(27)

Module behavior: The module behavior is the behavior which describes modules working behavior.

Module number: The module number is the number which describes the module number.

Figure 4.4. Rule Generation from Rule Class Database

Finally the rule class DB daemon is triggered, and the rule class Daemon home will be acquired on next step, and go back to rule class DB daemon. The rule class Daemon home describes system information. Appendix A shows AT and AOT of rule class Daemon home. The rule class DB daemon describes database daemon. Appendix B shows the details of AT, AOT of the rule class DB daemon.

4.2. Embedded Rules

Since system information are not always obtained; using EMCUD [22] to elicit embed meaning seems useful and the error type could be inferred.

Definition 4.5. EMCUD.

Embedded Meaning Capturing and Uncertainty Deciding (EMCUD). A New Model for Eliciting. Knowledge Representation combine AT (or Conventional Repertory grid ) and

(28)

AOT.

Example 4.5. EMCUD.

Figure 4.5 shows the EMCUD inference processes, which are composed of two parts (Error type diagnosis and Embedded rule generation). The error type diagnosis is composed of AT and AOT in rule class “Daemon Home” as shown in Figure 4.4. In Figure 4.5, there are five different inference conditions.

(A). Transaction abort.

In Oracle database, describe the database transaction situation. Transaction abort is caused by the disconnected connections between client side and server side.

(B). Not commit transaction.

In Oracle database, when a unconfirmed transaction has been modified (insert, update or delete).

(C). Transaction Number.

In Oracle database, transaction number is the count of online processes. It will cause an abnormal situation in DB when transaction number is overloading.

(D). CPU loading.

Central Processing Unit (CPU) is the hardware device in server for calculation and control other related units. When CPU utility is overloading, It will cause an abnormal condition.

(E). Memory Loading.

Memory is the hardware device for data high speed calculation and caching buffer. Usually, memory loading is the important reference fact during problem diagnosis.

Object Definition. TROLL. Transaction Rollback.

(29)

due to related transaction abort or error.

In normal condition, inference engine refers the error type, which contains higher certainty factor (CF) by using EMCUD. e.g., rule IF A & B & C & D & E Then T with high CF, T is the inference result. As system information can not be easily obtained in problem diagnosis, we use EMCUD [22] to elicit the embedded meaning. e.g., rule IF A & B Then T with lower CF, T is the inference result.

(30)

Chapter 5. Solution Retrieving by CBR

After problem diagnosis in phase one, the query is generated according to the obtained error type and users input. In Phase two, our goal is using CBR approach to find similar cases, and use the retrieved solutions to solve problems.

The output in phase consists of the error type and weights, the error type could be used to increase performance in similarity calculation, and weights could be used to adjust weights in case attributes to increase precision, as shown in Figure 5.1.

Figure 5.1. Adaptive query in CBR

5.1. Case Retain

As we know, the error type contains multi-attributes, e.g., problems subjects, module name (application), installed platform (platform) and product version (version). Therefore, 12 attributes are chosen as the representative of CASE_ID, NAME, SUBJECT, VERSION, PLATFORM, APP, DOC_TYPE, REVISION_DATE, PRODUCT, WORD_VECTOR, SOLUTIONS, SOLUTION1, as shown in Figure 5.2.

(31)

Furthermore we use two attributes : “document type” and “revision date” to rank the obtained solutions. Figure 5.3 shows an example of case in case base.

Figure 5.3. Case example

In Figure 5.3, red rectangle dotted line contains 4 attributes (subject, app, version, platform) which contained in both are query and case, where each case has a corresponding solution (blue rectangle dotted line) on different attributes (e.g., subject, app, version and platform could be integrated as an unique combination to identify different solutions.). Since those attributes are comparable between query and cases, we use equations to calculate their similarity the weights of the attributes can be adjusted due to different logs and system information in Oracle database and Oracle Application Server.

5.2. Case Retrieve

Definition 5.1. Similarity Equation between Query and Case.

{

S q c( , )=Ws×S q cs( , )+Wa×S q ca( , )+Wp×S q cp( , )+Wv×S q cv( , )

1

s a p v

W +W +W +W =

(1) The attributes Subject, Application, Platform and Version are used to identify the

similarity between query and case in case base, different logs and system status will form different weights in each attribute designed by experts. The calculation method is shown in Definition 5.1. Similarity Equation between Query and Case.

(32)

(2) In the equation, where

the similarity between query and case

( , )

S q c

the similarity between query and case in subject attribute ( , )

s

S q c

the similarity between query and case in application attribute ( , )

a

S q c

( , )

p

S q c _{the similarity between query and case in platform attribute}

( , )

v

S q c _{the similarity between query and case in version attribute}

s W subject weight a W _{application weight} p W _{platform weight} version weight v W

(3) As we know, both error type and case have 4 attributes to compute similarity.

Definition 5.2. Similarity Equation of Subject.

{

,i=1

Min number of Keywords of query and case min

N

max

N Max number of Keywords of query and case m

W Keyword mapping weight 0 or 1 i

N the number query keywords i

W Keyword weight c

N _{The number of case keywords}

where

{

(1) Subject is the short description to describe the error situation. Firstly, the attribute

“subject” could be translated as keywords based on keywords table. The similarity is different based upon different keywords and the keywords ranking order in sequence, as shown Figure 5.4. ( , ) s S q c = ,i>1 { } { } { } min max min max min 1 max , , , , ( ) , 0,1 1 m i i c i i n m i i m i i i W N W N N N N N N N N W N W N W N N N = × × = = − × × = −

∑

c

(33)

Figure 5.4. Example of query and case comparison

(2) In Figure 5.4, query and first case are similar, since they have the same keywords

(e.g., ORA-1575 and Timeout) and the same ranking order. From this idea, the calculation equation was derived, as shown in Definition 5.2. Similarity Equation of

Subject, where S q c_s( , ) is the similarity between query and case in subject attribute.

Example 5.1. Subject Similarity.

Similarity example was shown as below.

2 1 3 2 1 1 ( , ) 0.6 2 0 3 2 0 1 ( , 6 3 2 2 3 ) 0 3 2 2 2 2 2 3 2 s s S q S q c c = × × + × × × + × × = = × =

Similarity : _{Query and Case pairs in Blue}

Query and Case pairs in Gray

In Example 5.1, query and first case are similar, where the similarity is 0.66 (the highest value is 1 in similarity value).

(34)

Definition 5.3. Similarity Equation of Application.

{

tan tan max max ( , ) ( , ) ( , ) ( , ) ( , ) ( , )

ad dis ce cons ce dicrection

ad a S q c D q c C q c D q c D S q c S q c D = + × − = max D tan ( , ) dis ce D q c tan ( , ) cons ce C q c ( , ) dicrection D q c ( , ) ad S q c

D irection change tim es

C onstance betw een query and case D istance betw een query and case Sim ilarity distance

M ax distance

{

w here

(1) Application is the inferred module name in problem diagnosis phase, which is the

root cause of the error occurred, e.g., SMON (system monitoring). Since database modules are composed of tree structure, the attribute “application” could be translated as ontology model, called application ontology, as shown in Figure 5.5.

Figure 5.5. Application ontology

(2) In Figure 5.5, using application ontology to determine similar pairs from modules in

database, e.g., the distance of modules CKPT and ARCN is closer than modules ARCN and SMON; hence the similarity of modules CKPT and ARCN is higher that modules ARCN and SMON.

(35)

(3) In the equation, where is the similarity between query and case in app attribute.

( , )

a

S q c

Example 5.2. Application Similarity.

Application similarity example is shown as below.

Similarity distance: ( , ) 6 0.5 1 6.5 ad q S c = + × ( , ) 2 0.5 1 2.5 ad S q c × = = = + 7.5 6.5 ( , ) 0.13 7.5 7.5 a q c S = − = Similarity: 2.5 ( , ) 0.66 7.5 a S q c = − =

In Example 5.2, firstly, calculate distance in modules, secondly, using similarity equation to determine similarity. The similarity between two modules (SMON and ARCN) is 0.66, which is higher than the similarity value 0.13 between modules CKPT and ARCN.

Definition 5.4. Similarity Function of Platform.

( , ) | | | | | ( , ) | ( , ) pd platform platform w w pd p S q c C Q C Q D S q c S q c D α β = − + − − =

{

Cplatform platform Q

Platform w eight of case Platform w eight of query w

C V ersion w eight of case w

Q V ersion w eight of query α β D C onstant C onstant C onstant Platform distance Platform sim ilarity ( , ) pd S q c ( , ) p S q c where

(36)

Server. The platform is the attribute to compute similarity in solution cases. For this reason, the platform in our example are divided into Windows NT, Red hat Linux, Sun Solaris and IBM AIX, the sub revision in versions could be separated in each different platform. According to this concept, the platform representation could be composed of a tree structure, named Weight Platform Tree, as shown in Figure 5.6.

Platform

NT Linux Solaris

2000 2003 F1 F3 S8 S10

AIX

5.2 5.3

Weighted Platform Tree

i W j W Wk l W 1 i W in W Wj1 Wjn Wk1 Wkn Wl1 ln W

Figure 5.6. Weight Platform Tree

(2) In the equation, where are platform weights, are the NT platform sub revision weights, are the Linux platform small revision weight, are the Solaris platform small revision weight, and are the AIX platform small revision weight. The weights are greater than other small version weights, which adjust by experts.

, , , i j k W W W Wl in jn kn ln i j k l 1~ i W W 1~ j W W 1~ k W W W_l₁~W , , , W W W W

(3) The equation is shown in Definition 5.4. Similarity function of platform, where

is the platform similarity between query and case. ( , )

p

S q c

Example 5.3. Platform Similarity. The example is shown as follows.

(37)

Similar between queryand is high 1, 1 / 2 α = β =

{

_D ₌₁ where case | 0.215 | 0. ( , ) ( , ) 7 58 p p S q c S q c D D − = = ( , ) | 0.7 0.5 | | 0.05 0.02 | 0.215 pd S q c =α − +β − = Similarity distance : Similarity :

, The value were adjusted by experts

1, 1 / 2 α = β =

{

α =1,β =1 / 2

{

where | 0.005 | 0.995 D D− = = ( , ) | 0.5 0.5 | | 0.02 0.01| 0.005 pd S q c =α − +β − = 1 D = 1 D = Similar between queryand is high case | 0.215 | 0. ( , ) ( , ) 7 58 p p S q c S q c D D − = = ( , ) | 0.7 0.5 | | 0.05 0.02 | 0.215 pd S q c =α − +β − = Similarity distance : Similarity :

, The value were adjusted by experts | 0.005 | 0.995 D D− = = ( , ) | 0.5 0.5 | | 0.02 0.01| 0.005 pd S q c =α − +β − =

In Example 5.3, firstly, calculate similarity distance based on platform version tree structure, secondly, using similarity equation to determine similarity. The similarity of platforms, (e.g., NT 2003 and Solaris 8) value is 0.785 , which is lower than similarity value 0.995 in platform versions ( e.g., NT 2000 and NT 2003).

Definition 5.5. Similarity Function of Version.

( , ) | | | | | ( , ) | ( , ) vd version version vw vw v vd v v S q c C Q C Q D S q c S q c D γ λ = − + − − =

{

(1) The version definition is the installed version of Oracle DB and Application Server. Except for subject, application, and platform, the product version is still an attribute. Oracle DB versions have tree structure stuff; as shown in Figure 5.7.

M ain version w eight of case M ain version of query

S ub version w eight of case S ub version w eight of query

C o nstant C o nstant C onstant

V ersio n distan ce V ersion sim ilarity

version C version Q vw vw ( , ) vd S q c ( , ) v S q c v where D C Q γ λ

(38)

Oracle DB

Oracle 8 Oracle 9 Oracle 10

8.0.5 8.1.7 _9.0.1 _9.2.0.7

10.0.1 10.1.3.0.1

Weighted Database Version Tree

' i W ' j W ' k W ' 1 i W _' in W Wj'1 ' jn W ' 1 k W Wkn'

Figure 5.7. Weighted database version tree

(2) Nowadays, the most common used version are Oracle 8, 9 and Oracle 10g in Oracle database. Each version has its own sub version, e.g., Oracle 9 have 9.0.1 ~ 9.2.0.7 sub versions. The main version e.g., the weight of Oracle 8, is higher than the sub versions, e.g., 8.0.5 , weights are designed by experts.

' i W ' 1 i W

Example 5.4. Version Similarity.

Version similarity was shown as below.

( , ) | 0.9 0.8 | | 0.03 0.02 | 0.11 vd S q c γ λ ( , ) | 0.8 0.8 | | 0.02 0.01| 0.01 vd S q c γ λ = − + − = = − + − = Similarity distance: Similarity: 1 0.11 ( , ) 0.89 1 v q c S = − = 1 0.01 ( , ) 0.99 1 v S q c = − =

In Example 5.4, firstly, calculate similarity distance since database versions have tree structure in nature, secondly, using similarity equation to determine similarity. The similarity between two versions (Oracle database 9.0.1 and 8.1.7) is 0.89, which is lower than similarity value 0.99 in versions Oracle database 8.0.5 and 8.1.7.

(39)

As we know, different log will trigger different weight, e.g., log pattern “enquence” means “version weight” need to adjust higher value by experts. e.g., = 0.3 *

+ 0.25* + 0.05 * + 0.4 * , where 0.4 is the version weight. ( , )

S q c

( , )

s

S q c S q c_a( , ) S q c_p( , ) S q c_v( , )

Example 5.5. Case Similarity.

Case 1: S q c = 0.3x0.66+0.25x0.66+0.05x0.995+0.4x0.99 = 0.808( , ) = 0.3x0.00+0.25x0.13+0.05x0.785+0.4x0.89 = 0.427 ( , )

S q c

Case 2:

In Example 5.5, the similarity values are 0.808 and 0.427 in case1 and case2, respectively. From experiment, case 1 is similar than case 2 in the problem “system pending”.

(40)

Chapter 6. Implementation and Evaluations

In our experiment, the main operating system deployed is Microsoft 2003 and Solaris 8; the expert system tool is DRAMA enterprise 2.5 [22]; the application server is Oracle Application Server 10g (9.0.4) [19]; the database server is Oracle database (9.2.0.1) [20]; the implementation of SRS is in Oracle Jdeveloper 10g [21]. The Small device platform simulator is Windows CE 5.0 with 320 by 240 dimensions.

In the following, we will focus on the system problem diagnosis module (Rule-Based Inference) and solution retrieving module (Case-Based Reasoning). Figure 6.1 displays the user interface of SRS.

(41)

6.1. Implementation of SRS

When the on-duty employees receive the error warning from short message server, traditionally they use KM Center or Search Engine as their approaches to solve problems, e.g., “Metalink” (https://metalink.oracle.com) or Google (http://www.google.com). Although they found the solution eventually, the processing of system diagnosis and solution retrieving are still not efficient due to lack of domain knowledge and experience. Therefore, we propose the SRS to assist on-duty employees to deal with those problems efficiently. SRS is composed of two main modules, problem diagnosis module and solution retrieving module.

In problem diagnosis module, we use rule-based inference approach which imitates expert’s inference model [22]. In other words, SRS provides a system problem diagnosis module. As system is abnormal, monitoring daemons will collect error logs and system information and trigger preprocessor to translate related logs into the facts, inference module [22] continues to infer the possible error types.

In solution retrieving module, we use CBR approach to calculate similarity between query and cases in case base. In other worlds, SRS provides a solution retrieving module that assist users to find out the proper solutions.

When the error type is referred by problem diagnosis module, then the solution retrieving module will retrieve the cases, reuse the solution cases and feedback solution to users. When no proper solution is found, on-duty employees will call expert for help. The experts will be on site to solve problem, and revise the attribute values or attributes in case base to full fill the needs of the new problem.

Example 6.1. System Problem Diagnosis Example of Database Pending.

We use database pending as an example and capture the symptoms of system pending as inference information. In SRS, users will receive the diagnosed error type [1][15] by small

(42)

devices, then user can input keywords to search the solutions, as shown in Figure 6.1 and Figure 6.2.

Figure 6.2. SRS User Interface in Query

After submitting the query, the solution lists will be displayed [13]. Each solution contains subject, application, version and platform, and the probabilistic similarity will be displayed by descending list, as shown in Figure 6.3. Therefore, users can choose solutions depending on probabilistic similarity [2] [11].

(43)

Figure 6.3. Solution List in SRS

Eventually, the solution will show the know-how and assist users step by step to solve system problems efficiently, as shown in Figure 6.4 [3].

(44)

6.2. Evaluation of SRS

In this thesis, we use questionnaire to evaluate our system. Firstly, we find the expert in our company to diagnose problems, and we find five users in our company to test the SRS. Finally, we use 6 kinds of predefined problems to test the accuracy and performance between SRS and KM center. Hence, both the users satisfaction and SRS accuracy could be evaluated. In this experiment, we use about 10800 rules to infer the error type, and use 36 real cases in case base that contains six kinds of error type and the corresponding solutions. We use 54 keywords in index table to assist preprocess precisely. SRS retrieves top 10 solutions list to assist employees trouble shooting; hence we compare probabilistic similarity toward 6 kinds of problem. Details are described in Table 6.1

(45)

Table 6.1

Error Types of System Diagnosis

Error Type Description

db crash Db crash means database crashed and unable to startup, this problem could cause other related issues, e.g., missing data integrity.

missing

redo log

Redo log is a buffer that save data in memory for data caching, once missing redo log, this database will crash immediately.

archive log Archive log is for data compression in database, once archive log is missing or crashing will cause data compression issue.

Data file Data file is the file for saving data records those are committed in database. Once data file is missing or crashing, it will cause data lost and database crash.

control file Control file is for controlling data file profile and related database parameters setting. Once control file is missing or crashing, it will cause database crash

system

monitoring

System monitoring (SMON) performs instance recovery following an instance crash, combine free spaces in database, and managed spaces used for sorting. Once SMON is crashing, it will cause database crash immediately.

Experiment Case. Solutions list probability on similarity towards 6 kinds of error type

SRS training cases are listed in Table 6.2 which shows solutions list probability on similarity between solution cases towards 6 kinds of error type.

(46)

Table 6.2

SRS system toward 6 kinds of error type on similarity Error Types

Top3 Lists db crash Missing redo log archive log data file control file system monitoring top1 64.50% 87.00% 87% 70.00% 92.00% 93.00% top2 59.00% 61.00% 15.00% 50.50% 35.00% 78.50% top3 59.00% 15.00% 9.50% 32.00% 9.50% 58.00% The distributed probability is shown in Table 6.2. From the experiment result, top 3 solutions list in error type “db crash” is closer on accuracy. In error type “control file” and “system monitoring” of top1 solutions probability are more accurate than others. On average, SRS system solutions probability is over 60 percent.

Evaluation 1. Problem solving on accuracy between SRS and expert

Since occurred keywords are changeable in real cases; hence we use Table 6.3 to

describe experiment the number of keywords, test times, KM Center hit the problem times in solution top 10 list, SRS hit the problem times in solution top 10 list to compare hit ratio of problem solving accuracy between SRS and expert. Besides, test times are based on keywords to set different numbers of keyword.

Table 6.3

Problem Solving on Precision

Error Types Evaluation Entities db crash missing redo log archive log

data file control file system monitoring Test times 5 4 6 4 4 5 KM Center hit 3 3 4 2 2 3 SOS hit 4 4 5 3 3 4 KM Center accuracy 60% 75% 66% 50% 50% 60% SOS accuracy 80% 100% 83% 75% 75% 80% The diagram of precision evaluation table (Table 6.3) is shown in Figure 6.5.

(47)

0% 20% 40% 60% 80% 100% 120% db cras h mis sing redo log archiv e lo g data file cont rol file system m onito rin g error types solutions accuracy KM Center accruacy SOS accuracy

Figure 6.5. Problem Solving Precision

From experiment of Figure 6.5, SRS is more precise than KM Center. The solutions occurs in top 10 list towards six problems.

Evaluation 3. system diagnosis and solution retrieving in time aspect.

In time evaluation, SRS system listed in Table 6.4 is more quick than expert. Table 6.4

Time evaluation between SRS and KM Center

Solution Retrieving Time (minutes)

Evaluation Entities db crash missing redo log archive log

data file control file system monitoring SOS (min) 2.00 2.00 1.00 2.00 2.00 3.00 KM (min) 8.00 6.00 6.00 7.00 6.00 7.00

The diagram of time evaluation table (Table 6.4) is shown in Figure 6.6. Figure 6.6 shows the comparison result between SRS and KM Center towards system diagnosis and solution retrieving in time aspect.

(48)

0 1 2 3 4 5 6 7 8 9 db cr ash miss ing re do lo g arch ive l og data file cont rol f ile syste m m onito ring error types time (minutes) SOS (min) KM (min)

Figure 6.6. Time evaluation between SRS and expert

In time evaluation, SRS is quicker than KM Center in problem diagnosis and retrieving the corresponding solutions towards six kinds of error type. From experiment, the error type “db crash” is the complicated problem, and using KM approach needs eight minutes to do problem diagnosis and solution retrieving.

(49)

Chapter 7. Conclusion and Future Work

System diagnosis and solution retrieving are very important for IT employees, until now, most of them are unable to find a better way to solve problem, even though they have document center and search engine to assist employees to solve problems, but the cost is still too high and not efficient. Unlike traditional mechanism, SRS system is a hybrid system, using rule-based inference and case-based reasoning. It is proper for applying system diagnosis and solution retrieving. Thereby we can refine rule base and revise case base quickly, while new version created and new problem occurs. Besides, SRS system integrates with monitoring tool and mobility devices that make problem solving more efficient.

In this thesis, we design and implement Solution Offering System (SRS) to assist employee to solve problem and discover problem easily. Our main contributions are: (1) Provide hybrid architecture to solve system problem, use case-based inference to infer the error type and increase performance, and use case-based reasoning retrieving solved case to assist employee deal with the abnormal situations. (2) Define NORM structure of Oracle Application Server and Oracle database to enhance inference. (3) Define attributes in query and case to increase accuracy for finding solutions.

In future work, we wish to apply this hybrid architecture in different domain to help more employees, e.g., IC design and supply chain. Due to they have tree structure stuff on modules and the attributes could be identified and compared. Depending on tree structure, each rule class has its own objects and facts, the object name, fact name and fact value need to be modified to fulfill new related domains. On the other hand, attributes need to be modified in query and case to satisfy the similarity comparison in new domain. Basically, the architecture of SRS system could be kept for new related domains because this architecture imitates the thinking model from experts.

(50)

Bibliography

[1] Y. I. Chang and W. H. Hsieh, “An efficient scheduling method for query-set-based

broadcasting in mobile environments,” Distributed Computing Systems Workshops, 2004. Proceedings. 24th International Conference, Page(s):478 – 483, 2004.

[2] Y. F. Chen, H. Huang, R. Jana, S. John, S. Jora, A. Reibman and B. Wei, “Personalized

multimedia services using a mobile service platform,” Wireless Communications and Networking Conference, 2002. WCNC2002. 2002 IEEE, Page(s):918 - 925 vol.2, 17-21 March 2002.

[3] Y. Chen, W. Y. Ma and H. J. Zhang, “Detecting web page structure for adaptive viewing

on small form factor devices,” ACM TXN, 2003.

[4] M. Gu, X. Tong and A. Agnar, “Comparing similarity calculation methods in

conversational CBR, “Information reuse and integration, Conf, 2005. IRI -2005 IEEE International Conference, Page(s):427 – 432, 15-17 Aug. 2005.

[5] M. J. Hajar and S. P. Lee, PhD, “Applying machine learning using case-based

reasoning (CBR) and rule-based reasoning (RBR) approaches to object-oriented application framework documentation,＂ Information Technology and Applications, 2005. ICITA 2005. Third International Conference Volume 1, Page(s):52 - 57 vol.1, 4-7 July 2005. [6] S. Hayashi, T. Asakura and S. Zhang, “Study of machine fault diagnosis system using

neural networks,” Neural Networks, 2002. IJCNN '02. Proceedings of the 2002 International Joint Conference on Volume 1, Page(s):956 – 961, 12-17 May 2002.

[7] X. Hou, J. Gu, X. Shen and W. Yan, “Application of data mining in fault diagnosis

based on ontology”, Proceedings of the Third International Conference on Information Technology and Applications (ICITA’05), 2005.

(51)

[8] S. Krishnaswamy, S. W. Loke and A. Zaslasvky “A hybrid model for improving

response time in distributed data mining,” Systems, Man and Cybernetics, Part B, IEEE Transactions Volume 34, Issue 6, Page(s):2466 – 2479, Dec. 2004.

[9] F. R. Kumar, S. Gopalan and V. Sridhar, “ Context enabled Multi-CBR based

Recommendation Engine for E-commerce,＂ e-Business Engineering, 2005. ICEBE 2005. IEEE International Conference, Page(s):237 – 244, 12-18 Oct. 2005.

[10] G. Lambert-Torres, H. G. Martins, R. Rossi and L. E. B. da Silva, “Using similarity

assessment in case-based reasoning to solve power system substation problems,” Electrical and Computer Engineering, 2003. IEEE CCECE 2003. Canadian Conference

Volume 1, Page(s):343 - 346 vol.1, 4-7 May 2003.

[11] T. Lemlouma and N. Layaida, “Adapted content delivery for different contexts,” Applications and the Internet, 2003. Proceedings. 2003 Symposium, Page(s):190 - 197, 27-31 Jan. 2003.

[12] W. Y. Lum and F. C. M. Lau, “User-centric adaptation of structured Web documents

for small devices,” Advanced Information Networking and Applications, 2005. AINA 2005. 19th International Conference, Volume 1, Page(s):507 - 512 vol.1, 28-30 March 2005.

[13] W. C. Peng and M. S. Chen, “Developing data allocation schemes by incremental

mining of user moving patterns in a mobile computing system,” IEEE Transactions on Knowledge and Data Engineering, Volume 15, Issue 1, Page(s):70 – 85, Jan.-Feb. 2003.

[14] I. Rish, M. Brodie, S. Ma, N. Odintsova, A. Beygelzimer, G. Grabarnik and K. Hernandez, “Adaptive diagnosis in distributed systems ,” IEEE Transactions on neural networkds, VOL. 16, NO. 5, SEPTEMBER 2005.

(52)

[15] Y. L. Wai and F. C. M. Lau, ” User-centric content negotiation for effective adaptation

service in mobile computing,” IEEE Transactions on Software Engineering, Page(s):1100 – 1111, Volume 29, Issue 12, Dec. 2003.

[16] D. Zhang, S. Dai, Y. Zheng, R. Zhang and P. Mu, “Researches and application of a

hybrid fault diagnosis expert system,” Intelligent Control and Automation, 2000. Proceedings of the 3rd World Congress on Volume 1, Page(s):215 - 219 vol.1, 28 June-2 July 2000.

[17] Oracle Admin, “Oracle9i Application Server Architecture and Components,”

http://www.huihoo.com/oracle/application_server/9i.html.

[18] Oracle Admin, “Oracle to answer Regis MSCD 640 Oracle Admin on Windows 2000 Vocabulary Assignment Answers,” http://www.wilsonmar.com/1oraarch.htm.

[19] Oracle, OTN, http://www.oracle.com/technology/software/products/ias/index.html. [20] Oracle, OTN,

http://www.oracle.com/technology/software/products/database/oracle10g/index.html. [21] Oracle, OTN, http://www.oracle.com/technology/software/products/jdev/index.html. [22] 曾憲雄等, 人工智慧與專家系統─理論、實務、應用 ,2005, 旗標。

(53)

Appendix A: AT of Rule class Daemon home

TROLL TREC CIP CDFH WLTD CFDF

Memory loading 80% 85% 90% 85% 90% 95% CPU loading 80% 85% 90% 85% 90% 95% Transaction number x>=1000 x>=1200 1<=x<=100 x>=1500 x>=500 x>=600 Not commit transaction

Yes No Slightly Slightly Slightly Sometimes

Transaction abort

Yes No Slightly Slightly Sometimes Sometimes

CKPT not trigger yet

No Yes Slightly Slightly Sometimes Slightly

System pending

Very Often Very Often Sometimes Often Often Often

Idle process number

1<= x <=10 1<= x <=10

x>=100 1<=x<=50 1<= x <=10 1<= x <=10

Disk Full No No No Slightly Yes Yes

Write back to control file

Slightly Slightly No Slightly Sometimes Yes

Write back to date file

Slightly Slightly No No Sometimes yes

Write back to log file

Slightly Slightly sometimes No Yes sometimes

Where objects are listed as follows: TROLL: Transaction Rollback TREC: Transaction Recovery

CIP: Check Idle ProcessesCDFH: Check data file header WLTD: Write log to db

CFDF: Update control file and data file Facts are listed as follows:

(54)

Memory loading: The loading status of hardware memory is in DB. CPU loading: The loading status of CPU utility is in DB.

Transaction number: The number of transaction processes in DB.

Not Commit Transaction: The transactions those are modified but are not commit yet. Transaction abort: The transaction those are error or other related reason to trigger

transaction abort.

CKPT not trigger yet: The DB module “check point” is not been triggered yet. System Pending: The whole system (DB and Application server) are pending. Idle Process number: The processes number those are idle locate in current system. Disk Full: The volume of hardware disk that is full.

Write back to control file: When system parameters are modified, the DB module will write modified information into control file.

Write back to data file: When system data are modified, the DB module will write data into data files which are located in DB.

Write back to log file: When system logs are modified and exceed the redo log buffer, the DB module will write logs into log files which are located in DB.

(55)

AOT of Rule class Daemon home

TROLL TREC CIP CDFH WLTD CFDF

Memory loading 3 3 3 3 4 3 CPU loading 3 2 3 3 3 2 Transaction number 3 3 3 2 3 3 Not commit transaction D X 1 2 1 1 Transaction abort D X 1 2 2 1 CKPT not trigger yet 2 D 2 1 1 1 System pending D 3 1 2 2 3 Idle process number 1 2 D 2 1 2 Disk Full 3 3 3 3 3 4 Write back to control file 1 1 1 1 2 D Write back to date file 1 1 1 1 2 D Write back to log file 1 1 1 2 D 3 Where,

D: dominate the relationship X: no relation

(56)

Appendix B: AT of Rule class DB Daemon

System monitoring

Process monitoring

Check point Log writer Transaction rollback no yes slightly partial Transaction recovery no yes no slightly Clear idle process partial Yes slightly no Clear data file header no No slightly no Write log into disk sometimes sometimes sometimes yes Update control file and data file no sometimes yes yes Instance recovery yes no no no Temp segment recovery yes no no no Connect fragment yes no no no

Sub processes 1 1 1 1-10

Where objects are System monitoring, Process monitoring, Check point, and Log writer, please refer the Appendix C to reach the details.

The Facts are listed as follows:

Transaction rollback: When errors occurs in data saving, the related transactions will be roll back to their previous status.

Transaction recovery: Transaction Recovery is an application recovery whereby the effects of specific transactions during a specified timeframe are removed from the database.

Clear idle process: When idle processes exceed the numbers of system predefined setting, the DB module will clear idle processes.

Clear data file header: When writing data file errors, the data file header will be clear by DB module.

Write log into disk: When system produces some information, the DB module will write logs into disks.

Update control file and data file: When system parameters or data have been modified, the DB module will update control file and data file which are located in DB.

Instance recovery: Occurs when a software or hardware problem prevents an instance from continuing work.

Temp segment recovery: Occurs when a segment space is not enough or software error. Connect fragment: When system fragments are exceeding a value, then the DB module

will connect the fragment block into a larger block for DB reuse.

(57)

AOT of Rule class DB Daemon

System monitoring

Process monitoring

Check point Log writer Transaction rollback 2 D 1 3

Transaction recovery 2 D X 1

Clear idle process 3 D 2 X

Clear data file header X X 1 X Write log into disk 2 2 2 D Update control file and datafile 1 2 D D

Instance recovery D 0 2 1

Temp segment recovery D 0 1 1

Connect fragment D 0 0 0

Sub processes 3 3 2 2

Where,

D: dominate the relationship X: no relation

(58)

Appendix C: Oracle database 10g architecture

Components:

SGA are made up three require components and three optional components:

Shared Pool – Cache the most recently used SQL statements that have been issued by database users.

Database buffer Cache—Cache the data that has been recently accessed by database users.

Redo log buffer—Store transaction information for recovery purpose.

Java pool—Caches the most recently used Java objects and application code when Oracle’s JVM option is used.

Large pool—Cache data for large operations such as Recovery Manager backup and restore activity and Shared Server components.

Stream pool—Cache the data associated with queued message requests when Oracle’s Advanced Queuing option is used.

應用 RB 與 CBR建構一個解答擷取系統

國

立

交

通

大

學

電機資訊學院 資訊學程

碩

士

論

文

應用 RB 與 CBR

建構一個解答擷取系統

Building a Solution-Retrieval System

Based on RB and CBR Approaches

研 究 生：李宗平

指導教授：曾憲雄 博士

應用 RB 與 CBR 建構一個解答擷取系統

Building a Solution-Retrieval System Based on RB and CBR

Approaches

研 究 生：李宗平 Student：Tsung-Ping Lee

指導教授：曾憲雄 Advisor：Shian-Shyong Tseng

國 立 交 通 大 學

電機學院與資訊學院專班 資訊學程

碩 士 論 文

應用 RB 與 CBR 建構

一個解答擷取系統

摘要

Building a Solution-Retrieval System Based on

RB and CBR Approaches

Abstract

誌謝

Content

List of Definitions

List of Examples

List of Figures

Chapter 1. Introduction

Chapter 2. Related Work

2.1. Problem Diagnosis

Memory

File

Daemon

2.2. Solution Retrieving

Chapter 3. The Design of Solution-Retrieval System

3.1. Knowledge Representation

3.2. The SRS System Architecture

Chapter 4. Problem Diagnosis by Rule-Based Inference

4.1. Knowledge Acquisition

4.2. Embedded Rules

Chapter 5. Solution Retrieving by CBR

5.1. Case Retain

5.2. Case Retrieve

{

{

{

∑

{

{

{

{

Weighted Platform Tree

{

{

{

{

{

Weighted Database Version Tree

Chapter 6. Implementation and Evaluations

6.1. Implementation of SRS

6.2. Evaluation of SRS

Solution Retrieving Time (minutes)

Chapter 7. Conclusion and Future Work

Bibliography

Appendix A: AT of Rule class Daemon home

AOT of Rule class Daemon home

Appendix B: AT of Rule class DB Daemon

AOT of Rule class DB Daemon

Appendix C: Oracle database 10g architecture

電機資訊學院資訊學程

研究生：李宗平

指導教授：曾憲雄博士

研究生：李宗平 Student：Tsung-Ping Lee

國立交通大學

電機學院與資訊學院專班資訊學程

碩士論文