Genome Organization and Evolution
This assignment explores the organization and evolution of genomes, focusing on the definition of genes, gene structure, and the central dogma. It highlights features such as coding regions, regulatory elements, and gene transmission. The role of genetic maps and markers like VNTRs and STRPs is examined. Additionally, it discusses prokaryotic and eukaryotic genomes with examples like E. coli and C. elegans, and covers the evolution of genomes in relation to species adaptation. Students are tasked with surveying two bioinformatics projects, delving into their history, funding, and purpose.
Genome Organization and Evolution
E N D
Presentation Transcript
For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112Weblems 2.4, 2.7, pp 112-113 Assignment
For 3/02/04 Pick any two bioinformatics projects or resources, such as those in the previous lecture. For each, write a brief survey (~1000 words), giving such information as: the history of the project; the participants; the funding; its purpose and scope. Sources: web site, mailing lists, faqs, published papers. Assignment
Genes • Definition: A gene is a segment of DNA which codes for a protein • Caveats: • DNA which codes for functional RNA? • Control regions?
Gene organization • A gene may occur on either strand of DNA • Genes are continuous stretches (almost always) in prokaryotes • Genes are (often) discontinuous stretches (exons) in eukaryotes. The intervening regions are called introns • Upstream is a binding site • Location of regulatory region is less predictable
The Central Dogma • One gene, one protein • Like most dogmas, not entirely true • Alternative splicing permits the manufacture of many products from a single gene • The protein products are sometimes called the proteome • With current technology, more gene information is available than protein information
Transmission of information • The continuity of life is a reflection of the (nearly) faithful transmission of genetic information • The adaptation of life (evolution) is a result of imperfect transmission of information, and natural selection
Genetic maps • Variable number tandem repeats (VNTRs – minisatellites), 10-100 bp, are a sort of genetic fingerprint • Short tandem repeat polymorphisms (STRPs – microsatellites), 2-5 bp, are another kind of marker • A sequence tagged site (STS), 200-600 bp, is a known unique location in the genome
Identifying genes • A long ORF is probably a gene (but what about eukaryotes? AG and GT splice signals) • A gene promoter site has identifiable characteristics (TATA box) • If it looks like a known gene, it's a gene
Prokaryote genomes • Example: E. coli • 89% coding • 4,285 genes • 122 structural RNA genes • Prophage remains • Insertion sequence elements • Horizontal transfers
Eukaryotic genome • Example: C. elegans • 10 chromosomes • 19,099 genes • Coding region – 27% • Average of 5 introns/gene • Both long and short duplications
Evolution of genomes • Adaptation of species is coterminous with adaptation of genomes • Where do genes come from? (Answer: from other genes) • Homologs and paralogs • Lateral transfer • Molecular species each have their own family tree • Genes are widely shared
Close relatives • Yeast, fly, worm and human share at least 1308 groups of proteins • Unique to vertebrates: immune proteins (for example) • Unique molecules are adapted from ancient molecules of different purpose but similar design • Most new proteins come from domain rearrangement • Most new species come from control region variation