1 / 23

Estimation of Distribution Algorithms (EDA)

Estimation of Distribution Algorithms (EDA). Siddhartha K. Shakya School of Computing. The Robert Gordon University Aberdeen, UK ss@comp.rgu.ac.uk. EDAs. A novel paradigm in Evolutionary Algorithm Also known as Probabilistic model building Genetic Algorithms or Iterated density

dunn
Download Presentation

Estimation of Distribution Algorithms (EDA)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Estimation of Distribution Algorithms (EDA) Siddhartha K. Shakya School of Computing. The Robert Gordon University Aberdeen, UK ss@comp.rgu.ac.uk

  2. EDAs • A novel paradigm in Evolutionary Algorithm • Also known as Probabilistic model building Genetic Algorithms or Iterated density • A probabilistic model based heuristic • Motivated from the GA evolution • More explicit evolution than the GA

  3. a d b e c f Basic Concept of Solution and Fitness Graph colouring Problem: An Example Given a set of colours, GCP is to try and assign Colour to each nodes in such the way that neighbouring nodes will not have same colour

  4. 1 1 0 0 0 1 1 1 0 1 1 0 fitness 1 a a a b c d e f d d b b e e 1 0 0 1 1 1 c c f f 1 0 1 0 1 0 6 Basic concept of a solution and Fitness Given 2 colour Black = 0 White = 1 Representation of a solution as a chromosome Solution

  5. Chromosome and Fitness in GCP • Chromosome: is a set of colours assigned to the nodes of graph. (there are other way of representing GCP in GA, such as order based representation). • Fitness: is the number of correctly coloured nodes.

  6. GA Iteration • Initialisation of a “parent” population • Evaluation • Crossover • Mutation • Replace parent with “child” population and go to step 2 until termination criteria satisfies

  7. Selected Solution Parent population fitness Crossover Selection 1 0 1 1 0 1 2 0 0 1 0 1 1 2 Repeatiteration 1 0 1 0 1 1 4 After mutation 1 0 fitness 0 0 1 0 1 0 1 0 1 1 a 0 1 0 0 1 1 3 0 1 0 0 1 1 1 b d Mutation e 0 1 0 0 1 2 1 c 0 0 1 1 0 1 1 0 0 1 0 0 1 0 1 1 0 1 1 1 1 0 1 0 0 0 1 1 0 1 0 0 1 1 1 0 0 0 1 1 6 1 1 1 1 1 1 1 1 Given 2 colours (0,1) f 4 GA Iteration Initialization Evaluation After Crossover

  8. GA evolution • Selection drives evolution towards better solutions by giving a high pressure to the selection of high-quality solutions • Crossover and mutation (Variation operator) together ensures the exploration of the possible space of the promising solutions. Maintains the variation in the population.

  9. Variation in GA Evolution • Has its limitation • Can recombine fit solution to produce more fit solution • Also can disrupt good solution and converge in local optimum

  10. Estimation of Distribution Algorithm (EDA) • To overcome the negative effective of the crossover and mutation approach of variation, a probabilistic approach of variation has been proposed. • Algorithm using such approach is known as EDA (or PMBGA)

  11. Initial Population Evaluation Selection Crossover Mutation Simple GA framework GA to EDA Initial Population Evaluation Selection Probabilistic Model Building Sampling Child Population EDA framework

  12. Chromosome is a set of value taken by set of random variables (Where each for bit representation) Solution is a univariate marginal distribution 0 1 0 0 1 1 1 0 1 1 0 1 General Notation • EDA represents a solution as a set of value taken by a set of random variable. is a conditional distribution is a joint probability distribution

  13. Solution Usually it is not possible to calculate the joint probability distribution, so it is estimated. For example, assuming all are independent of each other, the joint probability distribution becomes the product of simple univariate marginal distribution. 0 1 0 0 1 1 1 0 1 1 0 1 Estimation of Probability distribution

  14. Solution 0 1 0 0 1 1 1 0 1 1 0 1 Simple Univariate Estimation of Distribution Algorithm Initial Population Evaluation Selection Calculate univariate marginal probability and sample Child Population

  15. Selected Solution Parent population fitness 1 0 1 1 0 1 2 0 0 1 0 1 1 2 1 0 1 0 1 1 4 After mutation 1 fitness 0 0 0 1 1 1 0 1 0 1 1 a 0 1 0 0 1 1 3 0 1 0 0 1 1 1 b d e 0 1 0 0 1 2 1 c 0 1 1 0 0 0 1 1 1 1 0 1 1 0 1 0 1 0 0 1 6 1 1 1 1 Given 2 colours (0,1) f 4 Simple univariate EDA (UMDA) Initialization Evaluation Selection Repeatiteration Build model Estimationof Distribution Calculate Distribution Sampling

  16. Note • It is not guaranteed that the above algorithm will give optimum solution for the graph colouring problem. • The reason is obvious. • The chromosome representation of GCP has dependency. i.e. node 1 taking black colour depends upon the colour of node 2. • But univariate EDAs do not assume any dependency so it may fail. • However, one could try

  17. Complex Models • To tackle problems where there is dependency between variables we need to consider more complex models. • The extra model building step will be added to univariate EDA. • Different algorithms has been purposed using different models • They are categorised into three groups • Univariate EDA • Bivariate EDA • Multivariate EDA

  18. Univariate EDA Model x2 x1 x3 x5 x4 x7 x6 Graphical representation of probability model assuming no dependency among variables. (UMDA, PBIL, cGA)

  19. Bivariate EDA Model a. Chain model (MMIC) b. Tree model (COMIT) c. Forest model (BMDA) Graphical representation of probability model assuming dependency of order two among variables.

  20. a. Marginal product model (ECGA) b. Triangular model (FDA) c. (BOA, EBNA) Graphical representation of probability model considering multivariate dependency among variables. Multivariate EDA Model

  21. Finding a probabilistic model • Task of finding a good probabilistic model (finding the relationship between variable) is a optimization problem in itself. • Most of the algorithm use Bayesian network to represent the probabilistic relationship. • Two metric to measure the goodness of Bayesian Network. • Bayesian Information Criterion (BIC) metric: • Bayesian-Dirichlet (BD) metric: • Use greedy heuristic to find a good model.

  22. Summary • EDA is an active area of research for GA community • EDAs are reported to solve GA hard problems, and also hard optimization optimisation problems like MAX SAT. • Success and failure of EDAs depends upon the accuracy of the used Probabilistic model.

  23. Links • http://cswww.essex.ac.uk/staff/zhang/MoldeBasedWeb/RGroup.htm (Research Groups working on EDAs) • http://www.sc.ehu.es/ccwbayes/main.html (EDA homepage maintained by Intelligent system group). Books • Larrañaga P., and Lozano J. A. (2001) Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation. Kluwer Academic Publishers, 2001. • Pelikan, M., (2002). Bayesian optimization algorithm: From single level to hierarchy. Ph.D. thesis, University of Illinois at Urbana-Champaign, Urbana, IL. Also IlliGAL Report No. 2002023.

More Related