1 / 19

Application of Unstructured Learning in Computational Biology

Application of Unstructured Learning in Computational Biology. Tony C Smith Department of Computer Science University of Waikato tcs@cs.waikato.ac.nz. Computability. Before computers were built, mathematicians knew what they could do arithmetic (e.g. missile trajectories)

kedem
Download Presentation

Application of Unstructured Learning in Computational Biology

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Application of Unstructured Learning in Computational Biology Tony C Smith Department of Computer Science University of Waikato tcs@cs.waikato.ac.nz

  2. Computability • Before computers were built, mathematicians knew what they could do • arithmetic (e.g. missile trajectories) • search (e.g. keys for secret codes) • sort (census information) • … anything with a mathematical algorithm Unstructured learning in computational biology Tony C Smith

  3. Artificial Intelligence • Computers do things only human brains can otherwise do expert expert Unstructured learning in computational biology Tony C Smith

  4. Artificial Intelligence • Computers do things only human brains can otherwise do expert system expert Unstructured learning in computational biology Tony C Smith

  5. Artificial Intelligence • Computers do things only human brains can otherwise do expert system learning system Unstructured learning in computational biology Tony C Smith

  6. Machine learning • creating computer programs that get better with experience • learn how to make expert judgments • discover previously hidden, potentially useful information (data mining) What is machine learning? How does it work? • user provides learning system with examples of concept to be learned • induction algorithm infers a characteristic model of the examples • model is used to predict whether or not future novel instances are also examples – and it does this very consistently, and very, very quickly! Unstructured learning in computational biology Tony C Smith

  7. weight heavy normal light dirt firmness good mild clean hard soft good poor good poor Structured learning Mushroom Data Weight Damage Dirt Firmness Quality heavy high mild hard poor heavy high mild soft poor normal high mild hard good light medium mild hard good Light clear clean hard good normal clear clean soft poor heavy medium mild hard poor . . . Unstructured learning in computational biology Tony C Smith

  8. Unstructured learning • data does not have fixed fields with specific values • examples: images, continuous signals, expression data, text • learning proceeds by correlating the presence or absence of any and all salient attributes Document Classification • given examples of documents covering some topic, learn a semantic model that can recognize whether or not other documents are relevant • prioritize them: i.e. quantify “how relevant” documents are to the topic • not limited to keywords (nor is it misled by them) • adapt to the user’s needs (ephemeral or long-term) Unstructured learning in computational biology Tony C Smith

  9. Document classification demo Unstructured learning in computational biology Tony C Smith

  10. bioinformatics • Finding genes • Determining gene roles • Determining protein functions • Empirical tests • Sequence similarity comparison • Literature Unstructured learning in computational biology Tony C Smith

  11. GO-KDS demo Unstructured learning in computational biology Tony C Smith

  12. Amino Acid R group Amide group Carboxyl group Unstructured learning in computational biology Tony C Smith

  13. Amino Acid tyrosine glycine Unstructured learning in computational biology Tony C Smith

  14. DNA encodes amino acids Unstructured learning in computational biology Tony C Smith

  15. Unstructured learning in computational biology Tony C Smith

  16. Unstructured learning in computational biology Tony C Smith

  17. Unstructured learning in computational biology Tony C Smith

  18. Rasmol demo Unstructured learning in computational biology Tony C Smith

  19. Biotechnology • Biologists know proteins, computer scientists know machine learning • Together, they can find out a lot of hidden information about genes and proteins • Biotechnology is a multi-billion dollar industry • Biotechnology is one of the best funded areas of scientific research Unstructured learning in computational biology Tony C Smith

More Related