1 / 38

Coding properties of DNA languages

Coding properties of DNA languages. Salah Hussini Lila Kari Stavros Konstantinidis Summarized by Yi Seung Joon. DNA properities. Consists of 4 bases Adenine, guanine, cytosine, thymine.(A,G,C,T) Single nucleotides are linked together end-to-end to form DNA strands.

dunne
Download Presentation

Coding properties of DNA languages

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Coding properties of DNA languages Salah Hussini Lila Kari Stavros Konstantinidis Summarized by Yi Seung Joon

  2. DNA properities • Consists of 4 bases • Adenine, guanine, cytosine, thymine.(A,G,C,T) • Single nucleotides are linked together end-to-end to form DNA strands. • A short single-stranded polynucleotide chain is called an oligonucleotide. • Polarity: 5’ end and 3’ end • A - T and G - C: complementary. • Two complementart single-stranded DNA sequences with opposite polarity is called Watson/Crick complements and will join together to form a double helix (hybridization)

  3. DNA-based algorithms • Initial DNA solution will contain some oligonucleotides which represent single ‘codewords’, and some oligonucleotides are strings of catenated codewords. • Two types of possible undesirable hybridizations • Forming a hairpin structure, which can happen if either end of the strand binds to another section of that same strand. • Binding to either another codeword or to the catenation of two codeword strands.

  4. Definitions and Notations • An alphabet X is a finite non-empty set of symbols. • A word u over the alphabet X is a sequence of letters and |u| will denote the length of u. • We donote by the Watson-Click complement of the wequence u. • If u=5’-AAAAGG-3’ then =5’-CCTTTT-3’ • X* is the set of all words over X. • X+ is the set of all non-empty words over X. • A language(over X) is any subset of X*. • , the DNA alphabet.

  5. Definitions and Notations • For a set S, we denote by |S| the cardinality of S, the number of elements in S. • Let X* be the free monoid generated by the finite alphabet X. A mapping α:X*->X* is called a morphism(anti-morphism) of X* if α(uv)= α(u) α(v) (respectively α(uv)= α(v) α(u)) for all u,v in X* • An invoultion θ:S->S of S is a mapping that θ^2 equals the identity mapping. • IF Δ* is the free monoid generated by the DNA-alphabet Δ then two involutions can be defined on Δ*: the mirror involution μ which is an anti-morphism, and the complement involution γ which is a morphism.

  6. Definitions and Notations • The complement involution γ: Δ->Δ • defined by γ(A)=T, γ(T)=A, γ(C)=G, γ(G)=C • can be extended in the usual way to a morphism of Δ* that is also an involution of Δ*. • The mirror involution μ: Δ*->Δ* • μ (u)=v defined by u=a1a2…akkk, v=akk…a2a1, ai∈ Δ • The DNA involution τ=γμ

  7. Definitions and Notations • If the involution θ is the DNA involution, then a language L being strictly θ-compliant(strictly prefix θ-compliant, strictly suffix θ-compliant) amounts to the fact the situations of the type depicted in Figure 3(respectively Figure 1, Fifure 2) do not occur.

  8. Definitions and Notations • A code • A code K is a subset of X+ satisfying the property that, for every word w in K+, there is a unique sequence (v1,v2…vn) of words in K such that w=v1v2…vn. • A bitfix code K is a prefix and suffix code; that is, K∩KX+ =K ∩X+K =0. Every bifix code is a code. • An infix code, K, has the property that no word of K is properly contained in another word of K, that is K∩(X+KX*∪X*KX+)=0. Every infix code is a bifix code. • A comma-free code K is a language with the property K^2∩X+KX+=0. Every coma-free code is an infix code.

  9. Involution-freedom and involution-compliance

  10. Involution-freedom and involution-compliance • A language L is called dense if every word is a subword of some word of L; that is , L∩X*wX*≠0 for every w∈X+. The Language L is complete if L* is dense. • For language L⊆X+ denote by the language of non-empty proper prefixes of L and the language of non-empty proper suffixes of L • Lpref={x∈X+|xy ∈L for some y ∈X+} • Lsuff={y∈X+|xy ∈L for some y ∈X+}

  11. Involution freedom and involution-compliance

  12. Involution freedom and involution-compliance

  13. Involution freedom and involution-compliance

  14. Involution freedom and involution-compliance

  15. Involution freedom and involution-compliance

  16. Involution freedom and involution-compliance

  17. Involution freedom and involution-compliance • Every complement-free language is anti complement-reflective • Not every anti complement-reflective language is complement-free.

  18. Involution freedom and involution-compliance

  19. Involution freedom and involution-compliance

  20. Involution freedom and involution-compliance

  21. Involution freedom and involution-compliance

  22. Involution freedom and involution-compliance

  23. Involution freedom and involution-compliance

  24. Decidablity issues • L(E) denotes the language represented by E.

  25. Decidablity issues

  26. Decidablity issues

  27. Decidablity issues

  28. Decidablity issues

  29. Splicing systems preserving good encodings • We are to characterize initial sets of coderords having the feature that the good encoding properties are preserved during any computation (We chose splicing) starting out from the initial set.

  30. Splicing systems preserving good encodings

  31. Splicing systems preserving good encodings

  32. Splicing systems preserving good encodings

  33. Splicing systems preserving good encodings

  34. Splicing systems preserving good encodings

  35. Splicing systems preserving good encodings

  36. Splicing systems preserving good encodings • Information re of a finite code K over some alphabet X is

  37. Splicing systems preserving good encodings

  38. Splicing systems preserving good encodings

More Related