Belief Augmented Frames 14 June 2004

Belief Augmented Frames14 June 2004 Colin Tan ctank@comp.nus.edu.sg http://www.comp.nus.edu.sg/~ctank

Motivation • Primary Objective: • To study how uncertain and defeasible knowledge may be integrated into a knowledge base. • Main Deliverable: • A system of theories and techniques that allow us to integrate new knowledge we have gained, and to use this knowledge to make better inferences

Proposed Solution • A frame-based reasoning system augmented with belief measures. • Frame-based system to structure knowledge and relations between entities. • Belief measures provide uncertain reasoning on existence of entities and the relationships between them.

Why Belief Measures? • Statistical Measures • Standard tool for modeling uncertainty. • Essentially, if the probability that a proposition E is true is p, then the probability of that E is false is 1-p. • P(E) = p • P(not E) = 1-p

Why Belief Measures? • This relationship between P(E) and P(not E) introduces a problem: • This relationship essentially leaves no room for ignorance. Either the proposition is true with a probability of p, or it is false with a probability of 1-p. • This can be counter-intuitive at times.

Why Belief Measures? • [Shortliffe75] cites a study in which, given a set of symptoms, doctors were willing to declare with certainty x that a patient was suffering from a disease D, yet were unwilling to declare with certainty 1-x that the patient was not suffering from D.

Why Belief Measures? • To allow for ignorance our research focuses on belief measures. • The ability to model ignorance is inherent in belief systems. • E.g. in Dempster-Shafer Theory [Dempster67], if our belief in E1 and E2 are 0.1 and 0.3 respectively, then the ignorance is (1 – (0.1 + 0.3)) = 0.6.

Why Frames? • Frames are a powerful form of representation. • Intuitively represents relationships between objects using slot-filler pairs. • Simple to perform reasoning based on relationships. • Hierarchical • Can perform generalizations to create general models derived from a set of frames.

Why Frames? • Frames are powerful form of representation: • Daemons • Small programs that are invoked when a frame is instantiated or when a slot is filled.

Combining Frames with Uncertainty Measures • Augmenting slot-value pairs with uncertainty values. • Enhance expressiveness of relationships. • Can now do reasoning using the uncertainty values. • A Belief Augmented Frame (BAF) is a frame structure augmented with belief measures.

Example BAF

Belief Representation in Belief Augmented Frames • Beliefs are represented by two masses: • φT: Belief mass supporting a proposition. • φF: Belief mass refuting a proposition. • In general φT + φF 1 • Room to model ignorance of the facts. • Separate belief masses allow us to: • Draw φTand φFfrom different sources. • Have different chains of reasoning for φT and φF.

Belief Representation in Belief Augmented Frames • This ability to derive the refuting masses from different sources and chains of reasoning is unique to BAF. • In Probabilistic Argumentation Systems (the closest competitor to BAF) for example, p(not E) = 1 – p(E). • Possible though to achieve this in Dempster Shafer Theory through the underlying mechanisms generating m(E) and m(not E).

Belief Representation in Belief Augmented Frames • BAFs however give a formal framework for deriving T and F • BAF-Logic, a complete reasoning system for BAFs. • BAFs provide a formal framework for Frame operations. • E.g. how to generalize from a given set of frames. • BAF and DST can in fact be complementary: • BAF as a basis of generating masses in DST

Degree of Inclination • The Degree of Inclination is defined as: • DI = T - F • DI is in the range of [-1, 1]. • One possible interpretation of DI:

Utility Value • The Degree of Inclination DI can be re-mapped to the range [0, 1] through the Utility function: • U = (DI + 1) / 2 • By normalizing U across all relevant propositions it becomes possible to use U as a statistical measure.

Plausibility, Ignorance, Evidential Interval • Plausibility pl is defined as: pl = 1 - F • Ignorance ig is defined as: ig = pl – T = 1 – (T + F) • The Evidential Interval EI is defined to be the range EI =[T, pl]

Interpreting the Evidential Interval

Reasoning with BAFs • Belief Augmented Frame Logic, or BAF-Logic, is used for reasoning with BAFs. • Throughout the remainder of this presentation, we will consider two propositions A and B, with supporting and refuting masses TA, FA, TB, and FB.

Reasoning with BAFs AND, OR, NOT • A  B: • TA B = min(TA, TB) • FA B = max(FA, FB) • A  B: • TA  B = max(TA, TB) • FA  B = min(FA, FB) •  A: • T A = F A • F A = T A

Default Reasoning in BAF • When the truth of a proposition is unknown, then we set the supporting and refuting masses to TDEF and FDEF respectively. • Conventionally, TDEF = FDEF = 0 • Two special default values: • TONE = 1, FONE = 0 • TZERO= 0, FZERO = 1 • Used for defining contradiction and tautology.

Default Reasoning in BAF • Other default reasoning models are possible too. • E.g. categorical defaults: • : (A, TA , FA)  (B, TB , FB) / (B, TB , FB) • Semantics: • Given a knowledge base KB. • If KB :- A and KB :-/-  B, infer B with supporting and refuting masses TBand FB • Detailed study of this topic still to be made.

BAF and Propositional Logic • BAF-Logic properties that are identical to Propositional Logic: • Associativity, Commutativity, Distributivity, Idempotency, Absorption, De-Morgan’s Theorem, - elimination.

BAF and Propositional Logic • Other properties of Propositional Logic work slightly differently in BAF-Logic. • In particular, some of the properties hold true only if the constituent propositions are at least “probably true” or “probably false” • I.e. |DIP |  0.5

BAF and Propositional Logic • For example, P and P Q must both be at least probably true for Q to not be false. • If DIPand DIP Qare less than 0.5, DIQmight end up < 0. • For  - elimination, P  Q must be probably true, and P must be probably false, before we can infer that Q is not false.

BAF and Propositional Logic • This can lead to unexpected reasoning results. • E.g. P, P Q are not false, yet DIQ < 0. • A possible solution is to set {TQ = TDEF , FQ = FDEF} when DIPand DIPQare less than 0.5 • In actual fact, the magnitude of DIPand DIP Qdon’t both have to be  0.5. Only their average magnitudes must be  0.5.

Belief Revision • Beliefs are not static. We need a mechanism to update beliefs [Pollock00]. • To track the revision of belief masses, we add a subscript t to time-stamp the masses. • E.g. TP,0 is the value of TPat time 0, TP,1at time 1 etc. • At time t, given a proposition P with masses TP, t and FP,t, suppose we derive masses TP, * and FP, *, then the new belief masses at time t+1 are: • TP, t+1 = TP, t + (1-  ) TP, * • FP, t+1 = FP, t + (1-  ) FP, *

Belief Revision • Intuitively, this means that we give a credibility factor  to the existing masses, and (1-  ) to the derived masses. •  therefore controls the rate at which beliefs are revised, given new evidence.

An Example • Given the following propositions in your knowledge base: • KB = {(A, 0.7, 0.2), (B, 0.9, 0.1), (C, 0.2, 0.7), (A B R, TONE , FONE,), (A BR, TONE , FONE)} • We want to derive TR, 1, FR, 1.

An Example • Combining our clauses regarding R, we obtain: • R = (A  B)   (A   B) • = A  B  ( A  B) • With De-Morgan’s Theorem we can derive  R: •  R= A   B  (A   B)

An Example • TR,* = min(TA , TB , max(FA , TB )) = min(0.7, 0.9, max(0.2, 0.9)) = min(0.7, 0.9, 0.9) = 0.7 • FR,* = max(FA , FB , min(TA , FB )) = max(0.2, 0.1, min(0.7, 0.1)) = max(0.2, 0.1, 0.1) = 0.2

An Example • We begin with default values for R: • TR,0 = TDEF = 0.0 • FR,0 = FDEF = 0.0 • This gives us the following attributes:

An Example

An Example • Deriving the new belief values with  = 0.4 • TR,1 = 0.4 * 0.0 + (1.0 – 0.4) * 0.7 = 0.42 • FR,1 = 0.4 * 0.0 + (1.0 – 0.4) * 0.2 = 0.12 • This gives us:

An Example

An Example • We see that with our new information about R, our ignorance falls from 1.0 (total ignorance) to 0.46. With more knowledge available about whether R is true, we also see the plausibility falling from 1.0 to 0.88. • Further, suppose it is now known that: • B C R

An Example • Combining our clauses regarding R, we obtain: • R = (A  B)  (B  C)  (A   B) = A  B  C  ( A  B) • With De-Morgan’s Theorem we can derive  R: •  R= A   B   C  (A   B)

An Example •  TR,* = min(TA , TB , TC , max(FA , TB )) = min(0.7, 0.9, 0.2, max(0.2, 0.9)) = min(0.7, 0.9, 0.2, 0.9) = 0.2 • FR,* = max(FA , FB , FC , min(TA , FB )) = max(0.2, 0.1, 0.7, min(0.7, 0.1)) = max(0.2, 0.1, 0.7, 0.1) = 0.7

An Example • Updating the beliefs: • TR,2 = 0.4 * 0.42 + (1.0 – 0.4) * 0.2 = 0.288 • FR,2 = 0.4 * 0.12 + (1.0 – 0.4) * 0.7 = 0.468 • This gives us:

An Example

An Example • Here the new evidence that B C R fails to support R, because C is not true (DIC = -0.5) • Hence the plausibility of R falls from 0.88 to 0.532, while the truth value DIR,2enters into the negative range.

Integrating Belief Measures with Frames • Belief measures to quantify: • The existence of the object/concept represented by the frame. • The existence of relations between frames

Frames with Belief Measures

Integrating Belief Measures with Frames • Deriving Belief Values • BAF-Logic statements can be used to derive belief measures. • For example, suppose we propose that: • Sam is Bob’s son if Sam is male and Bob has a child. • Within our knowledge base, we have {(Sam is male, 0.6, 0.2), (Bob has child, 0.8, 0.1), (Sam is male  Bob has child  Sam is Bob’s Son, 0.7, 0.1)}

Integrating Belief Measures with Frames • Assuming that  = 0, we can derive:  Tsam,son,bob = min(0.6, 0.8, 0.7) = 0.6  Fsam,son,bob = max(0.2, 0.1, 0.1) = 0.2 DIsam,son, bob = 0.4 Plsam, son, bob = 0.8 Igsam, son, bob = 0.2

Integrating Belief Measures with Frames • Daemons • Can be activated based on belief masses, DI, EI, Ig and Pl values. • Can act on DI, EI, Ig, Pl values for further processing. • E.g. if it is likely that Sam is Bob’s son, and if the ignorance is less than 0.2, create a new frame School, and set Sam, Student, School relationship.

Frame Operations • add_frame, del_frame, add_rel, etc. etc. • More interesting operations include abstract: • Given a set of frames • Create a super-frame that is the parent of the set of frames. • Copy relations that occur in at least  %of the set of frames to the superframe. • Set the belief masses to be a composition of all the belief masses in the set for that relation.

Application ExamplesDiscourse Understanding • Discourse can be translated to a machine understandable form before being cast as BAFs. • Discourse Representation Structures (DRS) are particularly useful. • Algorithm to convert from DRS to BAF is trivial [Tan03].

Application ExamplesDiscourse Understanding • Setting Belief Masses • Initial belief masses may be set using fuzzy-sets. • E.g. to model a person being helpful • Shelpful = {1.0/”invaluable”, 0.75/”very helpful”, 0.5/”helpful”, 0.25/”unhelpful”, 0.0/”uncooperative”} • If we say that Kenny is very helpful, we can set: • Tkenny_helpful = 0.75 • Fkenny_helpful = 1.0 - 0.75= 0.25

Application ExamplesDiscourse Understanding • Further propositions and rules may be inserted into the knowledge base to perform reasoning on the initial belief masses. • Propositions and rules modeled as prolog clauses.

Belief Augmented Frames 14 June 2004

Belief Augmented Frames 14 June 2004

Presentation Transcript

June 2, 2004

Belief Augmented Frames for Knowledge Representation

June 2004

Info-Day INDUSTRYLINK 2004 Poznan, 14 June 2004

Text Classification with Belief Augmented Frames

June 24, 2004

June 24, 2004

16 June 2004

June 29, 2004

Bangalore, June 2004

Millstone Hill ISR, June 14-18, 2004

June 2004

JUNE 2004

June 2004

June 3, 2004

NORPIE 2004 Trondheim, 14 June

June 30 2004

June, 2004

3 June 2004

June 14, 2004

June 2004