1 / 16

Nucleotide Databases: Genbank

Nucleotide Databases: Genbank. Dr Maha Al- Sulaimani Department of Biochemistry. Learning Objectives. Understand what a nucleotide database is. Distinguish the structure of eukaryotic and prokaryotic genes. Make sense of a GenBank entry.

dextra
Download Presentation

Nucleotide Databases: Genbank

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Nucleotide Databases:Genbank Dr Maha Al-Sulaimani Department of Biochemistry

  2. Learning Objectives • Understand what a nucleotide database is. • Distinguish the structure of eukaryotic and prokaryotic genes. • Make sense of a GenBank entry. • Understand the difference between GenBank and a gene-centric resource.

  3. Nucleotide Databases • The Nucleotide database is a collection of sequences from several sources, including GenBank and RefSeq. • Genome, gene and transcript sequence data provide the foundation for biomedical research and discovery.

  4. Examples of Nucleotide Databases • Genbank. • INSDC (International Nucleotide Sequence Database Collaboration). • NDB (Nucleic Acid Database). • NCBI (National Center for Biotechnology Information). • EMBL (European Molecular Biology Laboratory). • EBI (European Bioinformatics Institute).

  5. Looking for DNA Sequences • There are many types of DNA sequences. • The most common are: • Regulatory regions, often before genes. • Untranslated regions, often around genes (UTRs). • Protein-coding regions (exons) • Intergenic regions (between genes, introns). • All these sequences can be found in GenBank

  6. Prokaryotes vs. Eukaryotes • Prokaryotes • Genome=one large circular chromosome + a few small circular chromosomes (plasmides). • 0.5 to 8 Mb/chromosome. • Genes in one piece. • 70% of the genome is coding. • 1 gene/Kb. • Eukaryotes • Genome= many large linear chromosomes. • 10 to 700 Mb/chromosome. • Genes split. • 5% of the genome is coding. • 1 gene/100 Kb (Human).

  7. Typical Eukaryotic Protein-Coding Gene • The coding sequences are made of coding exons separated by introns. • Introns are spliced out and exons glued together to make the ORF. • One gene can code for several alternative proteins: alternative splicing.

  8. GenBank • Housed by the National Center for Biotechnology. (NCBI). • GenBank is the memory of biological science. • Contains everyDNA sequence ever published.

  9. GenBank • GenBank is the original information source for most biological databases. • GenBank is more complicated to use than gene-centric databases.

  10. Limitations of GenBank • GenBank entries can contain: • Entire genes. • Portions of genes. • Many genes. • GenBank entries can be of uneven quality: • Can be duplicates and/or inaccurate. • The database is not a selection center. • All data is treated equally.

  11. Limitations of GenBank • GenBank entries are not the final word on particular genes: • They have no authoritative biological meaning. • They merely keep track of what was done. • Gene-centric databases are needed to compile everything that is known on a given gene and to correct potential errors.

  12. Using Gene-centric Databases:Entrez Gene • Entrez Gene can be accessed from NCBI. • In GenBank, each entry is one sequence from one publication. • In Entrez Gene, each entry is one gene. • Entrez Gene is built with GenBank data.

More Related