1 / 15

Which ORF?

Which ORF?. Jeltje – September 7 2005. Where to start?. gatgtc atg cgatgttattg M R C Y g atg tcatgcgatgttattg M S C D V I gatgtcatgcg atg ttattg M L L. Eukaryotes. …A/GNN AUG G……. Methylated cap. small ribosomal subunit. Eukaryotes. …A/GNN AUG G…….

nevina
Download Presentation

Which ORF?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Which ORF? Jeltje – September 7 2005

  2. Where to start? gatgtcatgcgatgttattg M R C Y gatgtcatgcgatgttattg M S C D V I gatgtcatgcgatgttattg M L L

  3. Eukaryotes …A/GNNAUGG…… Methylated cap small ribosomal subunit

  4. Eukaryotes …A/GNNAUGG…… Methylated cap

  5. Eukaryotes …A/GNNAUGG……

  6. Eukaryotes …A/GNNAUGG…… Large ribosomal subunit

  7. Eukaryotes …A/GNNAUGG…… M

  8. …CNNAUGTGCGTTAUGG…… Leaky scanning HIC …CNNAUGTGCGTTAUGG…… …CNNAUGTGCGTTAUGG……

  9. Skipping AUG In some cases translation is initialized but terminated upon encounter of the second AUG Internal Ribosome Entry Site (IRES): not sequence specific viral (only?)

  10. MGC genes Tested 1000 MGC genes (Skipped genes with same ORF) Looked at longest ORF, first ORF, and longest first ORF (picked longest from three frames). ORFs must be >5 aa Compared to ‘called’ ORF in GenBank

  11. MGC genes • Of 1000 genes • For 887, the first large ORF is the largest ORF • Of those, only 388 have the A/GNNATGG consensus • MGC ORFs: • 845 are the same as first/largest ORF • 35 are a subset of the first/largest (all skip first M) • 6 pick another orf (1 notfound )

  12. MGC genes • Of 1000 genes (the remaining 113) • In 102 cases, the annotated ORF is the longest, not the first • In 3 cases, the annotated ORF is a subset of the longest ORF • In 6 cases, the annotated ORF is the first, not the longest • 1 annotated ORF cannot be found • 1 annotated ORF is neither the first nor the longest

  13. Examples: GenBank ORF is first >longest MSLSLVFRAASYFKLVPFHSSSSNQFLQPPGWVVLTQTLVLLHFERFSYQNVPKSAQGKGNLQPETNIHLFHFLTFPKQISRNLFNSLLCLMCLTYF >first MTNVYSLDGILVFGLLFVCTCAYFKKVPRLKTWLLSEKKGVWGVFYKAAVIGTRLHAAVAIACVVMAFYVLFIK (Longest not found in mouse)

  14. GenBank neither first nor longest >longest MESDPRICTMGNQEWPGWVPPPGPASSPPNCPHPMDEAGGTFGAKPACLPAPCLTRASFQLALPPAGPWAWPGPTGGYGLGSPSPLRGWRATSLGCYNLTPDSIGPLPLPRAPRSAALRLNMSARPCQCCGTPVRASDCVCRRDAGTRGCVCMCVCVRAACPPVCMVCGLGPHPWPEHFILWGRGADLVGGAPL >first MGGGRAPPERLGGCR >GBprot MRCLSSKKAGSTSVVKYIKTWRPRYFLLKSDGSFIGYKERPEAPDQTLPPLNNFSVAECQLMKTERPRPNTFVIRCLQWTTVIERTFHVDSPDEREEWMRAIQMVANSLQPHLCAQTRIWKTPPPAQAWAVGRLEIQVLIHTSPSEG

  15. GenBank ORF is subset of longest >longest MSKRRMSVGQQTWALLCKNCLKKWRMKRQTLLEWLFSFLLVLFLYLFFSNLHQVHDTPQMSSMDLGRVDSFNDTNYVIAFAPESKTTQEIMNKVASAPFLKGRTIMGWPDEKSMDELDLNYSIDAVRVIFTDTFSYHLKFSWGHRIPMMKEHRDHSAHCQAVNEKMKCEGSEFWEKGFVAFQAAINAAIIEIATNHSVMEQLMSVTGVHMKILPFVAQGGVATDFFIFFCIISFSTFIYYVSVNVTQERQYITSLMTMMGLRESAFW >first MGSSLQELSQKMENEKTDLVGMALFISSGTVSVPIFLQFTSSS >GBprot MGWPDEKSMDELDLNYSIDAVRVIFTDTFSYHLKFSWGHRIPMMKEHRDHSAHCQAVNEKMKCEGSEFWEKGFVAFQAAINAAIIEIATNHSVMEQLMSVTGVHMKILPFVAQGGVATDFFIFFCIISFSTFIYYVSVNVTQERQYITSLMTMMGLRESAFW (Longest found in mouse)

More Related