Bayesian classifiers

Bayesian classifiers

Bayesian Classification: Why? • Probabilistic learning: Calculate explicit probabilities for hypothesis, among the most practical approaches to certain types of learning problems • Incremental: Each training example can incrementally increase/decrease the probability that a hypothesis is correct. Prior knowledge can be combined with observed data. • Probabilistic prediction: Predict multiple hypotheses, weighted by their probabilities • Standard: Even when Bayesian methods are computationally intractable, they can provide a standard of optimal decision making against which other methods can be measured

Bayesian Theorem • Given training data D, posteriori probability of a hypothesis h, P(h|D) follows the Bayes theorem • MAP (maximum posteriori) hypothesis • Practical difficulty: require initial knowledge of many probabilities, significant computational cost

Naïve Bayesian Classification • If i-th attribute is categorical:P(di|C) is estimated as the relative freq of samples having value di as i-th attribute in class C • If i-th attribute is continuous:P(di|C) is estimated thru a Gaussian density function • Computationally easy in both cases

The independence hypothesis… • … makes computation possible • … yields optimal classifiers when satisfied • … but is seldom satisfied in practice, as attributes (variables) are often correlated. • Attempts to overcome this limitation: • Bayesian networks, that combine Bayesian reasoning with causal relationships between attributes

Bayesian Belief Networks (I) Age FamilyH (FH, A) (FH, ~A) (~FH, A) (~FH, ~A) M 0.7 0.8 0.5 0.1 Diabetes Mass ~M 0.3 0.2 0.5 0.9 The conditional probability table for the variable Mass Insulin Glucose Bayesian Belief Networks

Applying Bayesian nets • When all but one variable known: • P(D|A,F,M,G,I) From Jiawei Han's slides

Bayesian belief network • Find joint probability over set of variables making use of conditional independence whenever known a d ad ad adad b b 0.1 0.2 0.3 0.4 Variable e independent of d given b b 0.3 0.2 0.1 0.5 e C From Jiawei Han's slides

Bayesian Belief Networks (II) • Bayesian belief network allows a subset of the variables conditionally independent • A graphical model of causal relationships • Several cases of learning Bayesian belief networks • Given both network structure and all the variables: easy • Given network structure but only some variables: use gradient descent / EM algorithms • When the network structure is not known in advance • Learning structure of network harder

The k-Nearest Neighbor Algorithm • All instances correspond to points in the n-D space. • The nearest neighbor are defined in terms of Euclidean distance. • The target function could be discrete- or real- valued. • For discrete-valued, the k-NN returns the most common value among the k training examples nearest toxq. • Vonoroi diagram: the decision surface induced by 1-NN for a typical set of training examples. . _ _ _ . _ . + . + . _ + xq . _ From Jiawei Han's slides +

Discussion on the k-NN Algorithm • The k-NN algorithm for continuous-valued target functions • Calculate the mean values of the k nearest neighbors • Distance-weighted nearest neighbor algorithm • Weight the contribution of each of the k neighbors according to their distance to the query point xq • giving greater weight to closer neighbors • Similarly, for real-valued target functions • Robust to noisy data by averaging k-nearest neighbors • Curse of dimensionality: distance between neighbors could be dominated by irrelevant attributes. • To overcome it, axes stretch or elimination of the least relevant attributes. From Jiawei Han's slides

Bayesian classifiers

Bayesian classifiers

Presentation Transcript

Classifiers

Bayesian Classifiers and Software Sensors for Intrusion Detection Systems.

Finite mixture model of Bounded Semi-Naïve Bayesian Network Classifiers

Learning Relational Bayesian Classifiers from RDF Data

Classifiers

Objectives: Cross -Validation ML and Bayesian Model Comparison Combining Classifiers

Classifiers

Classifiers

Learning Bayesian Network Classifiers by Maximizing Conditional Likelihood

Discriminative Naïve Bayesian Classifiers

Classifiers

Bayesian Classifiers

Bayesian Online Classifiers for Text Classification and Filtering

MLE’s, Bayesian Classifiers and Naïve Bayes

Classifiers

“Classifiers”

Bayesian Averaging of Classifiers and the Overfitting Problem

Classification Bayesian Classifiers

Classifiers!!!

Evidence and scenario sensitivities in naïve Bayesian classifiers

Classifiers

Bayesian Averaging of Classifiers and the Overfitting Problem