0 likes | 25 Views
Genome assembly is a technique to assemble a short DNA fragments into organized data.
E N D
Sakshi Chavan Genome Assembly
Contents C Introduction Types Differences Factors affecting genome assembly results Application References Genome Assembly
Introduction In bioinformatics, genome assembly represents the process of putting a large number of short DNA sequences back together to recreate the original chromosomes from which the DNA originated. Sequence assembly is one of the basic steps after performing next generation sequencing, PacBio SMRT sequencing, or Nanopore sequencing. The established genome assembly can be submitted to databases such as European Nucleotide Archive, NCBI Assembly, and Ensembl Genomes. You can also browse these databases for genomic sequences done by other researchers. Presentation title
Types of Genome Assembly De novo genome assembly is a strategy for genome assembly, representing the genome assembly of a novel genome from scratch without the aid of reference genomic data. De novo genome assemblies assume no prior knowledge of the source DNA sequence length, layout or composition. Reference-based genome assembly maps reads to a reference genome by identifying reads with similar nucleotides to the reference. It is a digital nucleic acid sequence database, assembled by scientists as a representative example of the set of genes in one idealized individual organism of a species. Genome assembly
Differences Genome assembly
Factors Affecting Genome Assembly Results Properties of the genome Genome size. The bigger the genome is, the more data is needed. Therefore, before ordering sequence data, you need to estimate the genome size, which may be inferred by investigating the genome size of closely related species. Repeats. Amount and distribution of repeated sequences in a genome largely influence the genome assembly results. This can lead to misassemblies and an incorrect estimate of the size of the repeats. Ploidy level. If possible, it is better to sequence haploid tissue, avoiding problems caused by heterozygosity. Genome Assembly
Factors Affecting Genome Assembly Results Nucleic Acid Extraction For the DNA isolation or RNA isolation, here are a couple of things need to be aware of: DNA/RNA integrity, DNA/RNA purification, sufficient DNA/RNA amount, etc. Compared with resequencing, de novo sequencing requires superior nucleic acid. The most important nucleic acid quality parameters for NGS are chemical purity and structural integrity. Genome Assembly
Factors Affecting Genome Assembly Results Raw data processing Although there are assembly tools that prefer dealing with the raw data, including potential adapter sequences, we highly recommend that researchers study the manual to determine whether the program requires quality-trimmed data or not. If data trimming is required, it would be necessary to omit poor quality data by trimming low quality read ends and filtering of low quality reads. Multiple tools are available for this purpose, such as PRINSEQ32 and Trimmomatic33. Genome assembly
Application Presentation title
References Wajid B, Serpedin E. Do it yourself guide to genome assembly. Briefings in functional genomics, 2014, 15(1): 1-9. Victoria D D A, Erik H, Lieven S, et al. Ten steps to get started in Genome Assembly and Annotation. F1000Research, 2018, 7. Genome assembly