1 / 23

Developed by James Estill, Dept. of Plant Biology, University of Georgia

Developed by James Estill, Dept. of Plant Biology, University of Georgia. TriAnnot. France. IOB Cluster: UGA. Pipeline Annotate Wheat Sequences. PERL. GAME XML. BLAST –m 8 -d MIPS. BLAST –m 8 -d RB_pln. BLAST –m 8 -d TIGRGram. BLAST –m 8 -d TREP9nr. >HEX0014K09 GCAATACT CGGCACTT.

agnes
Download Presentation

Developed by James Estill, Dept. of Plant Biology, University of Georgia

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Developed by James Estill, Dept. of Plant Biology, University of Georgia

  2. TriAnnot France IOB Cluster: UGA Pipeline Annotate Wheat Sequences PERL GAME XML

  3. BLAST –m 8-d MIPS BLAST –m 8-d RB_pln BLAST –m 8-d TIGRGram BLAST –m 8-d TREP9nr >HEX0014K09GCAATACTCGGCACTT Annotation Pipeline Gene Annotation TE Annotation Homology Homology De Novo De Novo GENSCANGENID FGENESH FindmiteLTR_StrucLTR_SeqFind_LTRLTR_Finder BLASTBLAT SIM4 HMMERRepeatmaskerTE NestBLAST

  4. Individual Program Procedure Configuration File Directoryof FASTA Files Run Program RawResults GFFFormated

  5. Developed by James Estill, Dept. of Plant Biology, University of Georgia

  6. !! THIS DOCUMENT IS UNDER CURRENT DEVELOPMENT!! This program manual and the scripts that make up the DAWG-PAWS package are under current development. Everything is subject to change without notice at this point. This software comes as is, without any expressed or implied warranty. Use at your own risk.

  7. File requirements: • Each fasta file contains a single record • BAC scaffolds need to be merged to a single sequence • Short header

  8. Repeat masking with RepeatMasker and TREP • Softmask (using RepeatMasker) • Convert softmask to hardmask because many gene prediction programs are not softmasked aware

  9. Structural feature annotation: Includes currently only the annotation of gaps

  10. Gene annotation: • Conduct gene prediction using TriAnnot pipeline • Run individual gene prediction programs

  11. GenMarkHMM: can be run locally (free license required) GENSCAN: Run on web server & convert output to .gff file FGeneSH: Run on web server & convert output to .gff file

  12. NCBI-Blast: Most time-consuming step in the pipeline

  13. Transposable element annotation: • By homology: RepeatMasker, NCBI-Blast • By structural criteria: LTR-finder

  14. De Novo LTR Annotation Software Computation Annotation Good Neutral Bad Crap Best

  15. Preparing the computational results for Apollo • Audit the computational results • Concatenate the .gff files

More Related