1 / 50

Many to 1 Gene Associations

Many to 1 Gene Associations. The following slides show a few examples of gene predictions by one annotation group that overlap one or more genes from another group. Some of the examples that follow also illustrate issues related to

maxime
Download Presentation

Many to 1 Gene Associations

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Many to 1 Gene Associations The following slides show a few examples of gene predictions by one annotation group that overlap one or more genes from another group. Some of the examples that follow also illustrate issues related to - differences in annotation type (e.g., pseudogene versus gene),and -in confusing nomenclature (e.g., different genes assigned the same official gene name).

  2. One gene or two? 2:110788585..110968584 Orientation issue for OTT15152?

  3. 8:4238129..4254528 One gene or two?

  4. One gene or two? 3:105659594..105759593

  5. 11:69491920..69516919 One gene or two or three?

  6. 5:106920574..107155573 One gene or two or three?

  7. One gene or two? 6:145313224..145563223 The VEGA gene model seems to unite two separate gene models in NCBI

  8. One gene or two? 9:15109186..15189185

  9. 7:127057560..127247315 One gene or two?

  10. 7:52670474..52680473 One gene or two?

  11. 4:146600055..146731054 One gene or two?

  12. n:m ENSMUSG00000050714 and ENSMUSG00000066798 overlap OTTMUSG00000012648 and OTTMUSG00000012652 2:37243166..37343165

  13. n:m ENSMUSG00000074643 overlaps ENSMUSG00000038171 OTTMUSG00000016087, OTTMUSG00000016088, and OTTMUSG00000019746 2:155895575..155939706

  14. 6:113326848..113366847 n:m OTTMUSG00000017554 overlaps OTTMUSG00000016376 and EG68089 and EG101100.

  15. 8:47538900..47638899 Are EG667337 and EG14081 different genes?

  16. 7:126992968..127042967 Are EG233805 and EG1000043396 different genes?

  17. 6:122655579..122665578 Are EG71950 and EG100038891 different genes?

  18. 16:84828048..84836547 Are EG11957 and EG100039950 different genes?

  19. 7:87385985..87410984 Are EG61000042379 and EG269954 different genes?

  20. 9:43622106..43900000 Are EG61000042548 and EG21838 different genes?

  21. 13:22073239..22080498 Are OTT00466 and OTT13227 different genes?

  22. 4:122937497..122988738 Are OTT08975 and OTT08978 different genes?

  23. 3:94933437..94938148 Are OTT22306 and OTT19657 different genes?

  24. 3:107728458..107736457 Are OTT25890 and OTT07101 the same gene?

  25. 1:172123537..172148536 Are OTT21542 and OTT21543 different genes?

  26. 1:173164903..173177002 Are OTT21571 and OTT21573 different genes?

  27. 2:90744183..90753182 Are OTT14319 and OTT14315 different genes?

  28. 4:42236997..42261996 Are ENS78738 and ENS78736 different genes? Are the genes predicted new members of the chemokine (C-C motif) ligand family? In Ensembl multiple gene predictions are assigned to the same gene symbol/MGI id.

  29. 15:79611961..79691960 One gene or two or three? Are Nptxr and Cbx6 Overlapping?

  30. 2:120535197..120698446 One gene or two? Are Cdan1 and Ttbk2 Overlapping?

  31. One gene or two? X:9598695..9848694 Srpx and Rpgr Overlapping?

  32. One gene or two? 2:181092767..181132366 Zgpat and Lime1 Overlapping?

  33. One gene or two? 5:31435474..31485473 Mpv17 and Gtf3c2 overlapping?

  34. Next slide 16:96582252..96792251 One gene or two? Are Pcp4 and Igsf5 two different genes?

  35. In Ensembl currently it looks as though Pcp4 and Igsf5 are considered synonyms for the same gene?

  36. 6:87895874..87954921 One gene or two? NCBI gene is a pseudogene, Ensembl gene is a protein coding gene. Pseudogene Protein coding gene

  37. 13:75781991..75782990 Protein coding gene Pseudogene Protein coding gene

  38. 14:3046445..3080444 Pseudogene Protein coding gene

  39. 6:128882645..128993644 Retrotransposed vs pseudogene Pseudogene Retrotransposed Pseudogene

  40. Gene Family Challenges Gene families present many challenges to determining equivalency among gene predictions and for nomenclature. Examples from two gene families are shown in the following slides…. killer cell lectin-like receptor (Klra) family UDP glucuronosyltransferase 1 family cysteine-rich perinuclear theca C-type lectin domain family 2

  41. Next slide 6:129837719..130337718 killer cell lectin-like receptor (Klra) family

  42. Next slide 6:130198815..130298814 killer cell lectin-like receptor (Klra) family Gene identity crisis! Protein coding gene Protein coding gene Pseudogene

  43. 6:130275414..130375413 • Overlapping NCBI annotation • Overlapping features of different types 2. Pseudogene 1. Protein coding gene

  44. Next slide 1:89943192..90125441 UDP glucuronosyltransferase 1 family Ensembl maintains a single gene id for all of the members of the family.

  45. 9:24428665..24431164 cysteine-rich perinuclear theca Gene identity crisis!

  46. 6:128882645..128993644 C-type lectin domain family 2 Ensembl and VEGA predict only a single gene with multiple transcripts rather than two genes Clec2g and Clec2f.

  47. Unique to MGI MGI does not have a high-throughput computational genome annotation pipeline. However, we integrated the results of high throughput cDNA sequencing projects into the database prior to the availability of the mouse genome. Many of these genes have remained unique to MGI. The following slides illustrate several cases where MGI has a gene that has not been predicted by one of the three major annotation groups. Many (most) of these MGI-unique genes are from the RIKEN cDNA sequencing initiative. Many of them likely represent non-protein coding genes.

  48. 11:79796866..79857365

  49. 9:106742778..106752777 Unique to MGI

  50. 11:69491920..69516919

More Related