A fast, automated, polynomial-based cosmic ray spike-removal method for the high-throughput processing of Raman spectra.
Raman spectra often contain undesirable,randomly positioned,intense,narrow-bandwidth,positive,unidirectional spectral features generated when cosmic rays strike charge-coupled device cameras. These must be removed prior to analysis,but doing so manually is not feasible for large data sets. We developed a quick,simple,effective,semi-automated procedure to remove cosmic ray spikes from spectral data sets that contain large numbers of relatively homogenous spectra. Although some inhomogeneous spectral data sets can be accommodated—it requires replacing excessively modified spectra with the originals and removing their spikes with a median filter instead—caution is advised when processing such data sets. In addition,the technique is suitable for interpolating missing spectra or replacing aberrant spectra with good spectral estimates. The method is applied to baseline-flattened spectra and relies on fitting a third-order (or higher) polynomial through all the spectra at every wavenumber. Pixel intensities in excess of a threshold of 3× the noise standard deviation above the fit are reduced to the threshold level. Because only two parameters (with readily specified default values) might require further adjustment,the method is easily implemented for semi-automated processing of large spectral sets.
View Publication
B. Zhu et al. (Nov 2025)
Scientific Reports 15 8
A fixation-compatible protocol for intracellular and surface marker-based detection of circulating tumor cells in hepatocellular carcinoma
Circulating tumor cell (CTC) detection in hepatocellular carcinoma (HCC) is limited not only by the rarity of CTCs but also by a heavy reliance on cell surface markers such as EpCAM,which are variably expressed or lost during tumor progression. Detecting intracellular markers,such as cytokeratin offers an important complementary and comprehensive strategy but remains technically limited in flow cytometry due to the need for fixation and permeabilization,which often lead to cell loss and surface epitope damage. In this study,we systematically evaluated the feasibility of using fixed samples for flow cytometry,using HepG2 cells,PBMCs,and CTCs from patients with HCC. Our results demonstrate that fixation enabled intracellular staining without compromising cell surface marker detection,even after short-term storage at 4 °C and long-term storage at -80 °C. Fixed samples,particularly fixed unfrozen,exhibited comparable staining performance to fresh samples with only a 7–10% reduction in cell recovery. Clinical validation in HCC patients confirmed successful CTC detection,and tumor-specific CTNNB1 mutations were identified in CTC-derived DNA but not in matched plasma cfDNA. These findings support fixed CTC sample workflows as a reliable and practical approach for CTC analysis in HCC.
View Publication
(Jun 2025)
Nucleic Acids Research 53 11
A flexible, high-throughput system for studying live mRNA translation with HiBiT technology
HiBiT is an engineered luciferase’s 11-amino-acid component that can be introduced as a tag at either terminus of a protein of interest. When the LgBiT component and a substrate are present,HiBiT and LgBiT dimerize forming a functional luciferase. The HiBiT technology has been extensively used for high-throughput protein turnover studies in cells. Here,we have adapted the use of the HiBiT technology to quantify messenger RNA (mRNA) translation temporally in vitro in the rabbit reticulocyte system and in cellulo in HEK293 cells constitutively expressing LgBiT. The assay system can uniquely detect differences in cap,5?UTR,modified nucleotide composition,coding sequence optimization and poly(A) length,and their effects on mRNA translation over time. Importantly,using these assays we established the optimal mRNA composition varied depending on the encoded protein of interest,highlighting the importance of screening methods tailored to the protein of interest,and not reliant on reporter proteins. Our findings demonstrated that HiBiT can be easily and readily adapted to monitor real-time mRNA translation in live cells and offers a novel and highly favourable method for the development of mRNA-based therapeutics.
View Publication
Sommer A et al. (APR 1987)
Biochemical and biophysical research communications 144 2 543--50
A form of human basic fibroblast growth factor with an extended amino terminus.
The amino acid sequence of a human placental bFGF was determined by a combination of protein and cDNA sequencing. The placental bFGF consists of 157 amino acid residues with a calculated molecular weight of 17,464 and is highly homologous to bovine pituitary bFGF. The human protein contains an amino terminal extension when compared to the sequence established for bovine bFGF (Esch et al.,1985) and to the sequence of the predicted translation product based on human bFGF cDNA clones (Abraham et al.,1986).
View Publication
Lei Y and Schaffer DV (DEC 2013)
Proceedings of the National Academy of Sciences of the United States of America 110 52 E5039----E5048
A fully defined and scalable 3D culture system for human pluripotent stem cell expansion and differentiation
Human pluripotent stem cells (hPSCs),including human embryonic stem cells and induced pluripotent stem cells,are promising for numerous biomedical applications,such as cell replacement therapies,tissue and whole-organ engineering,and high-throughput pharmacology and toxicology screening. Each of these applications requires large numbers of cells of high quality; however,the scalable expansion and differentiation of hPSCs,especially for clinical utilization,remains a challenge. We report a simple,defined,efficient,scalable,and good manufacturing practice-compatible 3D culture system for hPSC expansion and differentiation. It employs a thermoresponsive hydrogel that combines easy manipulation and completely defined conditions,free of any human- or animal-derived factors,and entailing only recombinant protein factors. Under an optimized protocol,the 3D system enables long-term,serial expansion of multiple hPSCs lines with a high expansion rate (∼20-fold per 5-d passage,for a 1072-fold expansion over 280 d),yield (∼2.0 × 107 cells per mL of hydrogel),and purity (∼95% Oct4+),even with single-cell inoculation,all of which offer considerable advantages relative to current approaches. Moreover,the system enabled 3D directed differentiation of hPSCs into multiple lineages,including dopaminergic neuron progenitors with a yield of ∼8 × 107 dopaminergic progenitors per mL of hydrogel and ∼80-fold expansion by the end of a 15-d derivation. This versatile system may be useful at numerous scales,from basic biological investigation to clinical development.
View Publication
Lu LL et al. (SEP 2016)
Cell
A Functional Role for Antibodies in Tuberculosis.
While a third of the world carries the burden of tuberculosis,disease control has been hindered by a lack of tools,including a rapid,point-of-care diagnostic and a protective vaccine. In many infectious diseases,antibodies (Abs) are powerful biomarkers and important immune mediators. However,in Mycobacterium tuberculosis (Mtb) infection,a discriminatory or protective role for humoral immunity remains unclear. Using an unbiased antibody profiling approach,we show that individuals with latent tuberculosis infection (Ltb) and active tuberculosis disease (Atb) have distinct Mtb-specific humoral responses,such that Ltb infection is associated with unique Ab Fc functional profiles,selective binding to FcγRIII,and distinct Ab glycosylation patterns. Moreover,compared to Abs from Atb,Abs from Ltb drove enhanced phagolysosomal maturation,inflammasome activation,and,most importantly,macrophage killing of intracellular Mtb. Combined,these data point to a potential role for Fc-mediated Ab effector functions,tuned via differential glycosylation,in Mtb control.
View Publication
D. R. McHugh et al. ( 2018)
PloS one 13 6 e0199573
A G542X cystic fibrosis mouse model for examining nonsense mutation directed therapies.
Nonsense mutations are present in 10{\%} of patients with CF,produce a premature termination codon in CFTR mRNA causing early termination of translation,and lead to lack of CFTR function. There are no currently available animal models which contain a nonsense mutation in the endogenous Cftr locus that can be utilized to test nonsense mutation therapies. In this study,we create a CF mouse model carrying the G542X nonsense mutation in Cftr using CRISPR/Cas9 gene editing. The G542X mouse model has reduced Cftr mRNA levels,demonstrates absence of CFTR function,and displays characteristic manifestations of CF mice such as reduced growth and intestinal obstruction. Importantly,CFTR restoration is observed in G542X intestinal organoids treated with G418,an aminoglycoside with translational readthrough capabilities. The G542X mouse model provides an invaluable resource for the identification of potential therapies of CF nonsense mutations as well as the assessment of in vivo effectiveness of these potential therapies targeting nonsense mutations.
View Publication
Miller TW et al. (APR 2011)
Clinical cancer research : an official journal of the American Association for Cancer Research 17 7 2024--34
A gene expression signature from human breast cancer cells with acquired hormone independence identifies MYC as a mediator of antiestrogen resistance.
PURPOSE: Although most patients with estrogen receptor α (ER)-positive breast cancer initially respond to endocrine therapy,many ultimately develop resistance to antiestrogens. However,mechanisms of antiestrogen resistance and biomarkers predictive of such resistance are underdeveloped. EXPERIMENTAL DESIGN: We adapted four ER(+) human breast cancer cell lines to grow in an estrogen-depleted medium. A gene signature of estrogen independence was developed by comparing expression profiles of long-term estrogen-deprived (LTED) cells to their parental counterparts. We evaluated the ability of the LTED signature to predict tumor response to neoadjuvant therapy with an aromatase inhibitor and disease outcome following adjuvant tamoxifen. We utilized Gene Set Analysis (GSA) of LTED cell gene expression profiles and a loss-of-function approach to identify pathways causally associated with resistance to endocrine therapy. RESULTS: The LTED gene expression signature was predictive of high tumor cell proliferation following neoadjuvant therapy with anastrozole and letrozole,each in different patient cohorts. This signature was also predictive of poor recurrence-free survival in two studies of patients treated with adjuvant tamoxifen. Bioinformatic interrogation of expression profiles in LTED cells revealed a signature of MYC activation. The MYC activation signature and high MYC protein levels were both predictive of poor outcome following tamoxifen therapy. Finally,knockdown of MYC inhibited LTED cell growth. CONCLUSIONS: A gene expression signature derived from ER(+) breast cancer cells with acquired hormone independence predicted tumor response to aromatase inhibitors and associated with clinical markers of resistance to tamoxifen. Activation of the MYC pathway was associated with this resistance.
View Publication
Y. Wu et al. (Jun 2024)
Molecular Therapy Oncology 32 3
A general pHLA-CD80 scaffold fusion protein to promote efficient antigen-specific T cell-based immunotherapy
Inadequate antigen-specific T cells activation hampers immunotherapy due to complex antigen presentation. In addition,therapeutic in vivo T cell expansion is constrained by slow expansion rates and limited functionality. Herein,we introduce a model fusion protein termed antigen-presenting cell-mimic fusion protein (APC-mimic),designed to greatly mimicking the natural antigen presentation pattern of antigen-presenting cells and directly expand T cells both in vitro and in vivo . The APC-mimic comprises the cognate peptide-human leukocyte antigen (pHLA) complex and the co-stimulatory marker CD80,which are natural ligands on APCs. Following a single stimulation,APC-mimic leads to an approximately 400-fold increase in the polyclonal expansion of antigen-specific T cells compared with the untreated group in vitro without the requirement for specialized antigen-presenting cells. Through the combination of single-cell TCR sequencing (scTCR-seq) and single-cell RNA sequencing (scRNA-seq),we identify an approximately 600-fold monoclonal expansion clonotype among these polyclonal clonotypes. It also exhibits suitability for in vivo applications confirmed in the OT-1 mouse model. Furthermore,T cells expanded by APC-mimic effectively inhibits tumor growth in adoptive cell transfer (ACT) murine models. These findings pave the way for the versatile APC-mimic platform for personalized therapeutics,enabling direct expansion of polyfunctional antigen-specific T cell subsets in vitro and in vivo .
View Publication
Maricque BB et al. (FEB 2017)
Nucleic acids research 45 4 e16
A genome-integrated massively parallel reporter assay reveals DNA sequence determinants of cis-regulatory activity in neural cells.
Recent large-scale genomics efforts to characterize the cis-regulatory sequences that orchestrate genome-wide expression patterns have produced impressive catalogues of putative regulatory elements. Most of these sequences have not been functionally tested,and our limited understanding of the non-coding genome prevents us from predicting which sequences are bona fide cis-regulatory elements. Recently,massively parallel reporter assays (MPRAs) have been deployed to measure the activity of putative cis-regulatory sequences in several biological contexts,each with specific advantages and distinct limitations. We developed LV-MPRA,a novel lentiviral-based,massively parallel reporter gene assay,to study the function of genome-integrated regulatory elements in any mammalian cell type; thus,making it possible to apply MPRAs in more biologically relevant contexts. We measured the activity of 2,600 sequences in U87 glioblastoma cells and human neural progenitor cells (hNPCs) and explored how regulatory activity is encoded in DNA sequence. We demonstrate that LV-MPRA can be applied to estimate the effects of local DNA sequence and regional chromatin on regulatory activity. Our data reveal that primary DNA sequence features,such as GC content and dinucleotide composition,accurately distinguish sequences with high activity from sequences with low activity in a full chromosomal context,and may also function in combination with different transcription factor binding sites to determine cell type specificity. We conclude that LV-MPRA will be an important tool for identifying cis-regulatory elements and stimulating new understanding about how the non-coding genome encodes information.
View Publication
M. Legut et al. (mar 2022)
Nature 603 7902 728--735
A genome-scale screen for synthetic drivers of T cell proliferation.
The engineering of autologous patient T cells for adoptive cell therapies has revolutionized the treatment of several types of cancer1. However,further improvements are needed to increase response and cure rates. CRISPR-based loss-of-function screens have been limited to negative regulators of T cell functions2-4 and raise safety concerns owing to the permanent modification of the genome. Here we identify positive regulators of T cell functions through overexpression of around 12,000 barcoded human open reading frames (ORFs). The top-ranked genes increased the proliferation and activation of primary human CD4+ and CD8+ T cells and their secretion of key cytokines such as interleukin-2 and interferon-$\gamma$. In addition,we developed the single-cell genomics method OverCITE-seq for high-throughput quantification of the transcriptome and surface antigens in ORF-engineered T cells. The top-ranked ORF-lymphotoxin-$\beta$ receptor (LTBR)-is typically expressed in myeloid cells but absent in lymphocytes. When overexpressed in T cells,LTBR induced profound transcriptional and epigenomic remodelling,leading to increased T cell effector functions and resistance to exhaustion in chronic stimulation settings through constitutive activation of the canonical NF-$\kappa$B pathway. LTBR and other highly ranked genes improved the antigen-specific responses of chimeric antigen receptor T cells and ?? T cells,highlighting their potential for future cancer-agnostic therapies5. Our results provide several strategies for improving next-generation T cell therapies by the induction of synthetic cell programmes.
View Publication
Park RJ et al. (DEC 2016)
Nature genetics
A genome-wide CRISPR screen identifies a restricted set of HIV host dependency factors.
Host proteins are essential for HIV entry and replication and can be important nonviral therapeutic targets. Large-scale RNA interference (RNAi)-based screens have identified nearly a thousand candidate host factors,but there is little agreement among studies and few factors have been validated. Here we demonstrate that a genome-wide CRISPR-based screen identifies host factors in a physiologically relevant cell system. We identify five factors,including the HIV co-receptors CD4 and CCR5,that are required for HIV infection yet are dispensable for cellular proliferation and viability. Tyrosylprotein sulfotransferase 2 (TPST2) and solute carrier family 35 member B2 (SLC35B2) function in a common pathway to sulfate CCR5 on extracellular tyrosine residues,facilitating CCR5 recognition by the HIV envelope. Activated leukocyte cell adhesion molecule (ALCAM) mediates cell aggregation,which is required for cell-to-cell HIV transmission. We validated these pathways in primary human CD4(+) T cells through Cas9-mediated knockout and antibody blockade. Our findings indicate that HIV infection and replication rely on a limited set of host-dispensable genes and suggest that these pathways can be studied for therapeutic intervention.
View Publication