Knowledge Representation

(1)

Knowledge Representation

Represent and manipulate the domain

knowledge .

(2)

Outlines

From Knowledge to Knowledge Representation

Knowledge Representation Methods

Effective knowledge representation methods using in AI

Representing Uncertainty

Knowledge Interchange Format (KIF)

Building a Knowledge Base (KB)

(3)

From Knowledge to Knowledge Representation

Knowledge Definition

Knowledge Storing

Knowledge Representation

(4)

Knowledge Definition

 “The fact or condition of knowing something with

familiarity gained through experience or association.”

(Webster’s Dictionary, 1988)(Knowing something via seeing, hearing, touching, feeling, and tasting.)

 “The fact or condition of being aware of something” . (Ex. Sun is hot, balls are round, sky is blue,…)

(5)

Knowledge Storing

Natural language for people

Symbols for computer: a number or

character string that represents an object or idea (Internal representation of the

knowledge).

The core concepts: mapping from facts to an internal computer representation and also to a form that people can understand.

(6)

Knowledge Representation

Simple facts or complex relationships

Mathematical formulas or rules for natural language syntax

Associations between related concepts

Inheritance hierarchies between classes of objects

Knowledge is not a “one-size-fits-all”

proposition.

(7)

Knowledge Representation Methods

Effective knowledge representation methods:

 Easy to use.

 Easily modified and extended (changing the

knowledge manually or through automatic machine learning techniques).

(8)

Knowledge Representation Methods

 Procedural method

 Declarative method

 Relational method

 Hierarchical method

 Complex network graph

(9)

Procedural representation

 Encode facts and define the sequence of operations step by step. (Hardcoded logic)

 The weakness: the knowledge and the

manipulation of that knowledge are inextricably linked.

(10)

Declarative method

Overcoming the weakness of procedural representation

The states, facts, rules, and relationships are separated declared.

The separation of knowledge from the algorithm used to manipulate or reason with that knowledge provides

advantages over procedural codes.

Because the knowledge is explicitly represented, it can be more easily modified.

Separating the control logic and reasoning algorithms from the knowledge allows us to write optimized and reusable inferencing procedures.

(11)

Hierarchical representation

Used to represent inheritable knowledge.

 Inheritable knowledge: it centers on

relationships and shared attributes between kinds or classes of objects.

 Hierarchical knowledge is best used to

represent “isa” relationships, where a general or abstract type (ex, ball) is linked to more specific types (rubber, golf, baseball, football) which

inherit the basic properties of the general type.

(12)

Hierarchical representation

 The strength of object inheritance allows for

compact representation of knowledge and allows reasoning algorithm to process at different levels of abstraction or granularity.

 The use of categories or types gives structure to the world by grouping similar objects together.

 Using categories or clusters simplifies reasoning by limiting the number of distinct things we have to deal with. (reduce complexity)

(13)

Hierarchical representation - Capturing knowledge

 Knowing what to expect based on the elapsed time from one event to another is often the

hallmark of intelligent behavior.

 Time concepts such as before, after, and during are crucial to common-sense reasoning and

planning. Temporal logic is usually used to represent and reason about time.

(14)

Predicate Logic

The use of formal logic as a primary knowledge

representation harkens back to the beginnings of AI research.

Mathematical deduction based on logic, was a well- known method of generating new knowledge from existing knowledge.

(15)

Predicate Logic

Formal logic is a language with its own syntax, which defines how to make sentences, and corresponding

semantics, which describe the meaning of the sentences.

Sentences can be constructed using proposition

symbols (P, Q, R) and Boolean connectives, such as conjunction (And), disjunction (Or), implication (P implies Q).

Ex: if P and Q then R, the preceding rule, P and Q, is called the premise or antecedent, and R is the

conclusion or consequent.

(16)

Predicate Logic

Modus Ponens: where given a rule, A implies B, if A is true, we can infer that B is also true.

Predicate logic introduces the concept of

quantifiers, which allow us to refer to sets of

objects. Using objects, attributes, and relations, we can represent almost any type of knowledge.

Two quantifiers: : universal (all objects of this type have this attribute), : existential (there exists some object that has the specified attribute)

 

(17)

Predicate Logic

Ex: “Minnesota is cold in the winter” can be represented in three single parameter:

Place (Minnesota), Temperature(cold) and Season(winter).

Or it can be represented a single relation:

Cold (Minnesota, winter), Winter(Minnesota, cold).

(18)

Resolution (

反證法

₎

Resolution is an algorithm for proving facts

true or false by virtue of contradiction. If we

want to prove a theorem X is true, we have

to show that the negation of X is not true.

(19)

Example of ^Resolution

Suppose that we know the following two facts:

1.not feathers(Tweety) or bird(Tweety) 2.feathers(Tweety)

Sentence 1 states that either Tweety does not have feathers or else Tweety is a bird. Sentence 2 states that Tweety has feathers. To prove that

Tweety is a bird, we first add an assumption that is the negation of that predicate, giving sentence 3:

3.not bird(tweety)

(20)

Example of ^Resolution

In sentence 1 and 2, not feathers(Tweety) and

feathers(Tweety) cancel each other out. Resolving sentences 1 and 2 produces the resolvant, sentence 4, which is added to our fact set:

4. Bird(Tweety)

It is clear that sentences 3 and 4 cannot both be true, either Tweety is a bird ot it is not. Thus, we have a

contradition. WE have just proved that our first assumption, not bird(Tweety), is false, and the alternative, bird(Tweety), must be true (Winston, 1993).

(21)

Resolution

If the clauses to be resolved are selected in

systematic ways, then resolution is guaranteed to find a contradiction if one exists, although it may take a long time to find.

(22)

Unification (

結合法

₎

Unification is a technique for taking two sentences in predicate logic and finding a substitution that makes them look the same.

A variable can be replaced by a constant.

A variable can be replaced by another variable.

A variable can be replaced with a predicate, as long as the predicate does not contain that

variable.

(23)

Example of unification

Given the following set of predicates, let’s explore how they can be unified:

hates(X,Y)

hates(George, broccoli)

hates(Alex, spinach)

We could unify sentence 2 with 1 by binding George to variable X, and broccoli to variable Y.

Similarly, we could bind Alex to X and spinach to Y. Note that if the predicate names were

different, we could not unify these predicates.

(24)

Example of unification

If we introduce a few more predicates, we can explore more complex unifications:

4.hates(X, vegetable(Y))

5.hates(George, vegetable(Y)) 6.hates(Z, broccoli)

We could unify sentence 6 with sentence 1 by replacing variable X with variable Z and

variable Y with the constant broccoli. Sentence 4 and 5 could be unified with George bound to

(25)

Unification using in Prolog

A generalized version of the unification algorithm, called match, is used in Prolog (Programming in logic).

Facts are represented in Prolog by clauses, which look like standard predicates, and declare things

which are unconditionally true.

Rules are clauses where the conclusion may be

true, provided that all of the clauses in the condition part are true.

(26)

Unification using in Prolog

Prolog provides a built-in inferencing procedure, based on resolution, for processing rules and

answering questions posed as goal clauses (Bratko, 1986).

Nowadays, most commercial implementations of rule-based systems are written in c and c++.

(27)

Frames

A frame is a collection of attributes, which defines the state of an object and its relationship to other frames

(objects). But a frame is much more than just a record or data structure containing data.

In AI, frames are called slot-filler data representations.

The slots are the data values, and the fillers are attached procedures which are called before, during (to compute the value of), or after the slot’s value is changed.

Frames are often linked into a hierarchy to represent has-part ans isa relationships.

(28)

Example of Frames

Vehicle Wheels: Yes

Automobile Doors: Yes Motor: Yes

Wheels: 4

Cycle Wheels: <4

Spots Car Doors: 2 Size: Small isa isa

isa

instance

Bicycle Motor: No

Wheels: 2 isa

(29)

Frame v.s. to OOP

A Frame sounds very much like an object, whose data members are the slots, and whose methods are the attached procedures or daemons. In some sense, any Java program is a frame-based mechanism for knowledge representation. It makes use of

inheritance for isa relationships, and containment or references for has-part relationships.

(30)

Semantic Nets

Semantic nets are used to define the meaning of a concept by its relationships to other concepts. A

graph data structure is used, with nodes used to hold concepts, and links with natural language labels used to show the relatjonships.

A modern implementation of a semantic net is the Knowledge Utility (KnU) developed by IBM. ((

http://www.ibm.aqui.com http://www.ibm.aqui.com) )

(31)

Example of Semantic Net

Vehicle Wheels

Automobile

Motor

4 2 Sports Car

Small

Size Doors

has-part

num-wheels

num-doors instance

isa isa

(32)

Representing Uncertainty

In most cases, there will be some information available, but the rest will have to be inferred or collected as the inferencing proceeds. We have uncertainty.

Normally, we have a statistical theory which works well under conditions of uncertainty, call Bayes’ rule or Bayes’ theorem.

P(H|E): the probability of hypothesis H given that we have observed evidence E.

(33)

Representing Uncertainty

Bayes’ theorem says that we can compute the conditional probability that event Y will occur

given that event X already occurred, providing we know the prior probabilities that X and Y could

happen, and the conditional probability that X will occur when we know that Y has already occurred.

P(Y|X) = P(X|Y) P(Y) / P(X)

(34)

Knowledge Interchange Format (KIF)

A common language use for bridging the gap of knowledge format representation.

KIF is a language that was designed for the interchange of knowledge between agents (Gensereth and Fikes,

1992). KIF supports the definition of objects, functions, relations, rules, and metaknowledge (knowledge about knowledge).

The KIF syntax is reminiscent of Lisp. KIF supports two types of variables: individual variables which begin with the ? character, and sequence variables which begin with an @ character.

(35)

Knowledge Interchange Format (KIF)

The four types of operators: including term

operators, rule operators, sentence operators, and definition operators. If a token is not a variable or an operator, then it must be a constant.

A form in KIF is either a sentence, a rule, or a definition. And finally, a KIF knowledge base is a finite set of forms. The order of the forms in a

knowledge base is not important.

(36)

Example of KIF

An example rule in KIF :

(=> (EventName “Agent: Starting”)

(SetIdentifiedIntervalAlarm “NETSCAPE” 20 “minutes”))

Which translates to “if we receive an

AGENT:STARTING event then start a named interval alarm (a timer) identified by the string

“NETSCAPE” to go off every 20 minutes.”

(37)

Example of KIF

Example of KIF rule:

(GetStockPrice ?price)

(IntegerCompare ?price “>” 150)

means “get the stock price and place it in

variable ?price, then test to see if the price is greater than 150.”

(38)

Summarization for KIF

The following formats can be used to represent the facts, rules, relationships, and complex

transformations about knowledge:

Procedural code Predicate logic If-then-else rules Semantic nets Frames

When combined with a reasoning system or

inference engine, a knowledge base becomes a part

(39)

Building a Knowledge Base (KB)

KB is designed for grouping the various knowledge together in one place. The KB is the central repository of information containing the facts we know about objects and their relationships.

Knowledge engineering (knowledge acquisition):

mapping the set of knowledge in a particular problem domain and converting it into a knowledge base.

Domain expert: who through years of experience, has gathered the knowledge about how things work and relate to one another, and know how to solve problems in his or her

(40)

Building a Knowledge Base (KB)

Knowledge engineer: who can take that domain knowledge and represent it in a form for use by the

reasoning system. As an intermediary between the human expert and the expert system, the knowledge engineer must have good people skills as well as good technical skills.

A combination of questionnaires, interviews, and first- hand observations are used to give the knowledge engineer the deep understanding required to transform the expert’s knowledge into facts and rules for the knowledge base.

(41)

Building a Knowledge Base (KB)

Neural networks could be trained to perform

classification and prediction tasks without going through the expensive knowledge acquisition process.

Neural networks may not be easily converted to a

symbolic form, they most definitely are a knowledge base, because they encode the knowledge implicit in the training data.

Knowledge Representation

Knowledge Representation

Represent and manipulate the domain

knowledge .

Outlines

From Knowledge to Knowledge Representation

Knowledge Definition

Knowledge Storing

Knowledge Representation

Knowledge Representation Methods

Knowledge Representation Methods

Procedural representation

Declarative method

Hierarchical representation

Hierarchical representation

Hierarchical representation - Capturing knowledge

Predicate Logic

Predicate Logic

Predicate Logic

 

Predicate Logic

Resolution (

)

Resolution is an algorithm for proving facts

true or false by virtue of contradiction. If we

want to prove a theorem X is true, we have

to show that the negation of X is not true.

Example of Resolution

Example of Resolution

Resolution

Unification (

)

Unification is a technique for taking two sentences in predicate logic and finding a substitution that makes them look the same.

Example of unification

Example of unification

Unification using in Prolog

Unification using in Prolog

Frames

Example of Frames

Frame v.s. to OOP

Semantic Nets

Example of Semantic Net

Representing Uncertainty

Representing Uncertainty

Knowledge Interchange Format (KIF)

Knowledge Interchange Format (KIF)

Example of KIF

Example of KIF

Summarization for KIF

Building a Knowledge Base (KB)

Building a Knowledge Base (KB)

Building a Knowledge Base (KB)

₎

Example of ^Resolution

Example of ^Resolution

₎