This article describes an improved pooled open reading frame (ORF) expression technology (POET) that uses recombinational cloning and solution-based tandem mass spectrometry (MS/MS) to identify ORFs that yield high levels of soluble, purified protein when expressed in Escherichia coli. Using this method, three identical pools of 512 human ORFs were subcloned, purified, and transfected into three separate E. coli cultures. After bulk expression and purification, the proteins from the three separate pools were digested into tryptic peptides. Each of these samples was subsequently analyzed in triplicate using reversed-phase high-performance liquid chromatography (LC) coupled directly online with MS/MS. The abundance of each protein was determined by calculating the average exponentially modified protein abundance index (emPAI) of each protein across the three protein pools. Human proteins that consistently gave high emPAI values were subjected to small-scale expression and purification. These clones showed high levels of expression of soluble protein. Conversely, proteins that were not observed by LC-MS/MS did not show any detectable soluble expression in small-scale validation studies. Using this improved POET method allows the expression characteristics of hundreds of proteins to be quickly determined in a single experiment.
Chromatography, high pressure liquid, cloning, molecular, escherichia coli, peptides, proteins, proteomics, recombination, genetic, solubility, tandem mass spectrometry, transfection, trypsin
Waybright, Timothy; Gillette, William; Esposito, Dominic; Stephens, Robert; Lucas, David; Hartley, James; and Veenstra, Timothy D., "Identification of Highly Expressed, Soluble Proteins Using an Improved, High-Throughput Pooled ORF Expression Technology" (2008). Pharmaceutical Sciences Faculty Publications. 308.