320 likes | 500 Views
2009 NIGMS Workshop Enabling Technologies for Structural Biology March 4 th -6 th , 2009 Extra-Cellular Mammalian Proteins As Structural Genomics Targets Steve Anderson. Quote from NESG PSI-2 Center Grant Application, Nov. 2004:
E N D
2009 NIGMS Workshop Enabling Technologies for Structural Biology March 4th-6th, 2009 Extra-Cellular Mammalian Proteins As Structural Genomics Targets Steve Anderson
Quote from NESG PSI-2 Center Grant Application, Nov. 2004: “…Table II-1 indicates the relative success rate, in terms of solubility and structure depositions, of targets from key organisms when expressed in E. coli. The data indicate that eukaryotic targets are some three-fold less likely to be expressed and soluble in our E. coli expression systems. Moreover, although structures of some of these eukaryotic targets will be determined in the coming months, the soluble eukaryotic proteins (particularly human proteins) are some three-fold less likely to result in NMR or crystal structures. Despite the tremendous overall success of bacterial expression systems, success rates with eukaryotic proteins in these systems are limited, and eukaryotic protein sample production remains a major challenge for structural genomics.”
Major Roadblock: Many interesting proteins (e.g., eukaryotic proteins) -- especially eukaryotic secreted proteins -- do not express well in commonly-used E. coli systems. We are especially interested in eukaryotic secreted proteins.
What about E. coli secretion vectors? Suitable media for enrichment with 15N and 13C (or labeling with SeMet) is the key issue because minimal medium appears to strongly inhibit secretion.
Protein A secretion vector for producing samples for structural biology Would this enable isotope enrichment, too? Lu et al (1997) J. Mol. Biol.266, 441.
Test expression of ZZ-OM3D-P1Lys Expression in Celtone media is better than expression in MJ media Osmotic shockate IgG-purified Purif 1 2 3 4 M 1 2 3 4 1 - Celtone + 0.2% glucose 2 - Spectra + 0.2% glucose 3 - MJ media (standard media for labeling protein) 4 - 2XTY + 0.2% glucose (optimal rich media, but can’t be used for labeling)
Cloning sites of new pEZZ318 ZZ-fusion vector Version 1.0 (TEV cleavage): • Synthetic genes with N- or C-terminal His tags inserted as Xho I / Hind III fragments C-term of Z domain coding TEV cleavage site A P K V D G L E G S G E N L Y F Q S L E * …GGCGCCGAAAGTAGAC GGTCTAGAA GGTAGCGGT GAAAACCTGTATTTTCAGAGC CTCGAG TAA GGGCCCAAGCTTGAATTC… …CCGCGGCTTTCATCTG CCAGATCTT CCATCGCCA CTTTTGGACATAAAAGTCTCG GAGCTC ATT CCCGGGTTCGAACTTAAG… Nar IXba I Xho I HindIIIEcoRI Version 2.0 (His-tag / TEV cleavage): • Synthetic genes without His tags inserted as Sac I / Hind III fragments Octa-His tag TEV cleavage site L E G S G H H H H H H H H G S G E N L Y F Q S S * …TCTAGAA GGCTCTGGT CATCACCATCACCATCACCATCAC GGCAGCGGT GAAAACCTGTATTTTCAGAGCTCTTAA GGGCCCAAGCTT… …AGATCTT CCGAGACCT GTAGTGGTAGTGGTAGTGGTAGTG CCGTCGCCA CTTTTGGACATAAAAGTCTCGAGAATT CCCGGGTTCGAA… Xba I Sac IHind III
Our source of targets: Human Cancer Protein Interaction Network (HCPIN) Systematically complete structural coverage of pathways and interaction networks Study structures of complexes Pathway-Interaction Subnet Huang, Montelione, et al (2008) Molec. Cell. Proteomics7, 2048.
Human Cancer Pathway Interaction Network* • Cell cycle progression • Apoptosis • Toll-like receptor pathway • Interferon alpha/beta • JAK-STAT pathway • TGF-beta pathway • PI3K pathway • MAPK pathway Huang, Montelione, et al (2008) Molec. Cell. Proteomics7, 2048. *For further information see Janet Huang (posters 49 & 50).
Target Selection 658 ~1100 human proteins/domains are selected as NESG targets 2971 2328 506 http://nmr.cabm.rutgers.edu:9090/PLIMS/ 136 1160
Target Selection 658 ~1100 human proteins/domains are selected as NESG targets 2971 2328 506 http://nmr.cabm.rutgers.edu:9090/PLIMS/ 136 Approximately 1/3 of the HCPIN targets not selected are predicted to be secreted or membrane-bound proteins. 1160
Predicted Extra-cellular Domains For secreted HCPIN proteins exhibiting evidence of multiple, reiterated domain modules bounded by phase one intron insertion positions [Patthy (1999) Gene238, 103], multiple sequence alignments of the intervening exons were prepared.* *See Chiang et poster (#43) for further information.
Osmotic shockates of targets 401, 601, 801/803, & 901/902 Expressed in ZZ vector with TEV-cleavable linker (38 hr. culture in 15N-Celtone) • • • • • lysozyme
TEV cleavage of 15N-enriched targets 601 801 901 P C P C P C TEV protease ZZ P - purified C - TEV cleaved
Progress Report: Some case studies with individual domains
Example 1: Human follistatin-like protein 3_domain 1 (exon 3) • TGF antagonist • binds activin A • implicated in glucose & fat homeostasis - Paolo Rossi
Example 2: Sushi domain from human complement factor H-related 1 protein. >>> examination of spectra of 15N-labeled material led to the conclusion that this domain was relatively unstructured. Is this due to the fact that some domains may need to be packed against adjacent domains for stability’s sake?
There is some evidence that sushi domains are close-packed in holoproteins -- see structure of a pair of sushi domains from human complement factor H (left). Herbert et al (2006) J. Biol. Chem.281, 16512
Purifed ZZ fusion of recombinant Sushi domain from human complement factor H-related protein 1 run on reducing and non-reducing gels Reducing Non-reducing M - + + + :incubation with TEV buffer* Conclusion 1: Multimeric disulfide cross-linked concatamers can form from recombinant proteins in the periplasmic space. Conclusion 2: Thiol-disulfide exchange is promoted by the redox character* of the TEV protease cleavage buffer, allowing breakage of inter-molecular disulfides and refolding to the monomer species. *includes 3 mM GSH and 0.3 mM GSSG full-length fusion
Example 3: Assignments for human fibulin-4 (FBLN4) domain 6 (exon 9) Connectivity map showing completeness of assignments - Swapna Gurla • predicted to be a Ca++-binding EGF-like domain • binds to extracellular matrix proteins • dysregulated in colon cancer • involved in embryonic development & remodelling
Potential disulfide scrambling issues with FBLN4_domain 6 motivated us to improve the purification protocol by adding an ion exchange step.
We then checked for incorrect disulfide bond formation by purifying FBLN4_domain 6 in the presence of oxidized glutathione (GSSG), which should reversibly cap any exposed thiols, and then treated the purified sample with iodoacetamide (IAM) in the presence of 6M Gdn-HCl, which should irreversibly alkylate any buried thiols. Result: Based on MALDI-TOF MS, >90% of the protein appeared to be of the correct molecular weight and fully disulfide bonded.
MALDI-TOF of FBLN4_domain 6 (no IAM) - Haiyan Zheng
MALDI-TOF of FBLN4_domain 6 (+ IAM) - Haiyan Zheng
Human fibulin-4 (FBLN4) domain 6 (exon 9) C Preliminary (Further structure calculations are in progress….) N - Swapna Gurla
Summary of results so far (still in research phase): 6 - targets cloned 3 - expressed 2 - 3D structural information (one expressed domain was soluble but disordered) >>> The numbers are small but promising!
Conclusion: • Facile expression of extracellular human proteins as structural genomics targets looks promising. This effort may even result in lower levels of attrition (cloned --> 3D structure) than have “classic” expression approaches. • Prospective domain parsing of larger extracellular human proteins is possible using the phase 1 intron rule.
Mission Statement. The long-range goal of the Protein Structure Initiative is to make the three-dimensional atomic-level structures of most proteins easily obtainable from knowledge of their corresponding DNA sequences.
“Holy Grail” of structural genomics (cf. Mission Statement of PSI): Complete structural coverage of some domain families in an organism? Huang, Montelione, et al (2008) Molec. Cell. Proteomics7, 2048. For example, the EGF domain family
Yi-Wen Chiang Davis Anderson Jung B. Seo Yushen Qian Paolo Rossi Swapna Gurla Guy Montelione Haiyan Zheng Peter Lobel