Several recent studies have claimed that cancer cells can be reprogrammed into induced pluripotent stem cells (iPSCs). However, in most cases, cancer cells seem to be resistant to cellular reprogramming. Furthermore, the underlying mechanisms of limited reprogramming in cancer cells are largely unknown. Here, we identified the candidate barrier genes and their target genes at the early stage of reprogramming for investigating cancer reprogramming.
We tried induction of pluripotency in normal human fibroblasts (BJ) and both human benign (MCF10A) and malignant (MCF7) breast cancer cell lines using a classical retroviral reprogramming method. We conducted RNA-sequencing analysis to compare the transcriptome of the three cell lines at early stage of reprogramming.
We could generate iPSCs from BJ, whereas we were unable to obtain iPSCs from cancer cell lines. To address the underlying mechanism of limited reprogramming in cancer cells, we identified 29 the candidate barrier genes based on RNA-sequencing data. In addition, we found 40 their target genes using Cytoscape software.
Our data suggest that these genes might one of the roadblock for cancer cell reprogramming. Furthermore, we provide new insights into application of iPSCs technology in cancer cell field for therapeutic purposes.
Cancer is caused by environmental risk factors as well as genetic and epigenetic alterations, leading to uncontrolled cell proliferation and tumor formation (1). To explore the underlying mechanism of cancers, many researchers have attempted to investigate the cause of cancer (2–4).
In 2007, it was reported that overexpression of Yamanaka’s cocktails (
The normalization of cancer cells remains an important challenge due to the alteration of epigenetic modifications and expression of cancer-related genes (9). Understanding of the reprogramming of cancer cells would provide powerful tools for investigating the dynamic changes in the epigenetic and gene expression states of cancer cells.
Several groups have reported the reprogramming of cancer cells including melanoma, pancreatic cancer, gastrointestinal cancer, bladder cancer, lung carcinoma, and breast cancer (10–15). The cancer-derived iPSCs were distinct from parental cancer cells in their acquired sensitivity to chemotherapeutic agents and in tumorigenic activities (11, 13, 15–17). In another study, the authors hypothesized that human cancer cells could be converted into iPSCs and then differentiated again into cancer cells, which might be at an early developmental stage (18). This approach could provide a human model to study the early stages of cancer. However, reprogramming of cancer cells is less efficient and more time-consuming than that of normal somatic cells (19). The features important for their pluripotency, including morphology, gene expression, clonal expansion, immunocytochemistry, and teratoma formation, are not fully characterized (12, 15, 17, 20, 21).
Recent study implies that induction of pluripotency from malignant cancer cells was challenging compared to benign cancer cells (22). However, the exact reason for the differences regarding to reprogramming between malignant and benign cancer cells is unknown. To find out the exact mechanism that causes this difference might help understand malignancy in tumors. In general, primary cancer cells sourced from tumor tissues allow cancer study in
In this study, we attempted to reprogram normal human fibroblasts (BJ), MCF7, and MCF10A using the transcription factors
BJ and PLAT-A cells were purchased from American Type Culture Collection (ATCC, VA, USA). Mouse embryonic fibroblasts (MEFs) were derived from C57BL6/N (KOATECH, Pyeongtaek, Korea) mouse strain embryos at embryonic day 13.5 after removing the head and all internal organs. MCF7 and MCF10A were obtained from Seoul National University, Korea, and Dongduk Women’s University, Korea. MEFs, BJ, and MCF7 were maintained in Dulbecco’s Modified Eagle’s Medium (DMEM; Welgene, Gyeongsan-si, Korea) supplemented with 10% fetal bovine serum (Corning, NY, USA), 0.1 mM non-essential amino acids (NEAA; Gibco, MA, USA), and 1% penicillin/streptomycin (P/S; Welgene). MCF10A cells were cultured in MEGM kit medium (ATCC, CC-3150).
The pMX reprogramming vectors were purchased from Addgene (Cat. 17964, 17965, 17966, and 17967, respectively). The retroviral vectors were produced by transfection of these vectors (15
BJ, MCF7, and MCF10A cells seeded in a 6-well plate (SPL Life Science, Pocheon, Korea) at 2×105 cells per well were incubated at 37°C with equal volumes of the virus-containing supernatants supplemented with 4
Total RNA was purified using the RNeasy Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions; 1
Cells were fixed in 4% paraformaldehyde for 15 min at room temperature and stained for alkaline phosphatase (AP) with the Leukocyte Alkaline Phosphatase kit (Stemgent, MA, USA) following the manufacturer’s instructions.
Cells were fixed in 4% paraformaldehyde for 20 min at room temperature and washed 3 times with Dulbecco’s Phosphate Buffered Saline (DPBS; Hyclone Laboratories, UT, USA). The cells were then permeabilized and blocked with DPBS containing 0.5% Triton X-100 (Sigma-Aldrich) and 1% bovine serum albumin (Sigma-Aldrich) for 1 h at room temperature. Cells were incubated with primary antibodies (Pluripotency Antibody kit, Cell signaling, MA, USA) overnight at 4°C, washed 3 times with DPBS, and incubated with secondary antibodies (against mouse IgG, mouse IgM, or rabbit IgG; all from Cell signaling) for 1 h at room temperature in the dark. Nuclei were stained with Hoechst33342 (Invitrogen, CA, USA).
The amount and quality of total RNA were assessed using an Agilent 2100 Bioanalyzer RNA kit (Agilent Technologies, CA, USA). Total RNA was used to prepare mRNA sequencing libraries with the Illumina TruSeq Stranded mRNA Sample Preparation kit (Illumina, CA, USA) according to manufacturer’s instructions. We used the Agilent 2100 Bioanalyzer DNA kit (Agilent Technologies) to evaluate the quality and size of the libraries. All libraries were quantified with qPCR using the CFX96 Real Time System (Bio-Rad, CA, USA) and sequenced with a paired-end 75-bp run plus a single 8-bp index read run on a NextSeq 500 sequencer (Illumina).
Potential sequencing adapters and low-quality bases in the raw reads were trimmed by Skewer (23). The options –x AGATCGGAAGAGCACACGTCTGAACTCCAGTCA and –y AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT were used for the common sequence of the Illumina TruSeq adapters and the option -q 0 -l 25 -k 3 -r 0.1 -d 0.1 was used for trimming low-quality 5′ and 3′ ends of the raw reads. The trimmed high-quality reads were mapped to the human reference genome by STAR software (24). A strand-specific sequencing library was prepared according to the protocol in the Illumina’s strand-specific library preparation kit; the strand-specific library option, --library-type=fr-firststrand was used for mapping.
To convert the numbers of mapped reads into the gene expression values, the Cufflinks package (25) with the strand-specific library option, --library-type=fr-firststrand and other default options was used. The gene annotation of the human reference genome mm10 was accessed through the UCSC genomes browser (https://genome.ucsc.edu) and used in GTF format, and the expression values were calculated as fragments per kilobase of transcript per million fragments mapped. The Cuffdiff software in Cufflinks was used to analyze differentially expressed genes between the two selected biological conditions (26). To compare the expression profiles between samples, the normalized expression values of the selected few hundred differentially expressed genes were clustered by unsupervised in-house R scripts.
Overlap between the analyzed differentially expressed genes and functionally categorized genes, including biological processes of Gene ontology (GO) and KEGG pathways, was tested by using g:Profiler (http://compbio.mit.edu/cummeRbund/). Cytoscape was used to identify gene interaction network and to visualize networks constructed using selected candidate barrier genes and their target genes as nodes (27).
The values in results of qPCR analysis are represented as mean±SEM. All statistical analysis data were carried out using GraphPad Prism 5.0 software. Significance was analyzed using one-way ANOVA (analysis of variance) followed by Dunnett’s test (n=3 per each sample).
To test whether the retroviral vector system could efficiently deliver
We performed qPCR to compare the expression levels of endogenous pluripotency markers in BJ, MCF7, and MCF10A iPSC-like colonies at day 25. BJ-iPSCs were used as a positive control. The representative of pluripotency genes,
These data suggest that MCF7 and MCF10A show relatively poor induction of pluripotency due to the lack of expression the core pluripotency genes such as
Although previous studies described the reprogramming kinetics in human fibroblasts in detail, that of cancer cells remains unknown (29). To elucidate the gene expression patterns during reprogramming, we followed the time-course of gene expression by qPCR. We first checked the transduction efficiency by evaluating the induction level of transgenes by qPCR and found it to be similar in BJ, MCF7, and MCF10A at day 5 after transduction (Fig. 2A). These data indicate that our retroviral system efficiently delivers the reprogramming factors into all three types of cells. Next, we checked the endogenous expression levels of pluripotency genes,
Our data revealed that the endogenous
Unexpectedly, we could not induce pluripotency in 4F-MCF7 and 4F-MCF10A by using reprogramming factors. To reveal the mechanism that limits reprogramming in cancer cells, we performed RNA-seq analysis in BJ, MCF7, and MCF10A during reprogramming. To this end, we selected the genes, which are essential for the early stage of reprogramming (29). Among them, using heatmap analysis, we found 17 candidate barrier genes that are up-regulated in BJ D5, but not in MCF7 D5 or MCF10A D5 (Fig. 3A). Furthermore, we identified 12 candidate barrier genes that are down-regulated in BJ D5 but not in MCF7 D5 or MCF10A D5 by heatmap analysis (Fig. 3B).
These data suggest that a total of 29 candidate barrier genes (17 genes, which are up-regulated gene in BJ, but not in MCF7 and MCF10A and 12 genes, which are down-regulated genes in BJ, but not in MCF7 and MCF10A) might act as a roadblock to achieve pluripotency in an early reprogramming stage in cancer cells.
To gain a deeper insight into the mechanism of action of the candidate barrier genes, we constructed a genetic interaction network (GIN) based on their target genes as well as candidate barrier genes. First of all, we tried to identify the target genes of 29 candidate barrier genes we found. Based on the RNA-seq data, we identified 40 target genes using Cytoscape software and constructed a GIN for investigating interaction between candidate barrier genes and their target genes.
Among a total 40 putative target genes identified, 20 genes (
Overall, these results suggest that the complex connection of barrier genes to their target genes could be the major obstacle for reprogramming.
The iPSC technology is a powerful tool for transformation of the differentiated cells into the pluripotent cells (30). Many reports have been demonstrated that iPSCs can be generated by ectopic expression of the Yamanaka’s cocktail from various cell types such as human fibroblasts (5, 6), blood cells (31), keratinocytes (32), and melanocytes (33). Although previous studies attempted to generate iPSCs from breast cancer cell, MCF7 and MCF10A (15, 17), the reprogrammed cells were not fully characterized; they had unusual morphology, lack of pluripotency markers, and limited differentiation potential
To investigate transcriptome profile changes during reprogramming, we performed RNA-seq analysis in BJ, MCF7, and MCF10A at day 5 after transduction. By comparing their transcriptomes, we screened for candidate barrier genes involved in the early stage of reprogramming. We found 29 candidate barrier genes and categorized them into two groups: 17 candidate barrier genes that are not up-regulated in MCF7 and MCF10A but not in BJ at day 5. In addition, 12 candidate barrier genes, which are not down-regulated in MCF7 and MCF10A but not in BJ at day 5.
For a deeper understanding of the initial reprogramming process, we sought to identify the target genes of the 29 candidate barrier genes using Cytoscape, and found 40 target genes. There was a complex connection between the candidate barrier genes and target genes, and this complex connection may be strongly affected cancer cells as an important obstacle in the early stage of reprogramming. Our GO analysis of the 40 genes revealed significant enrichment of the co-expression, physical interaction, co-localization, genetic interactions, and shared protein domain. These complicated signaling pathways might be critical for cell survival and maintenance of cancer properties.
Our RNA-seq data (Supplementary Table S2 and S3) suggested that, among the target genes,
Previous studies demonstrated that
The potential of iPSC technology in cancer research has an infinite possibility. However, it has been begun to be explored since few years ago. Our
This work was supported by grants from the National Research Foundation of Korea (NRF) funded by the Korea government (MSIT) [grant number 2018R1A2B6001072] and by the Technology Innovation Program [grant number 10063301] funded by the Ministry of Trade, Industry & Energy (MOTIE, Korea).
The authors have no conflicting financial interest.
Supplementary data including three tables and one figure can be found with this article online at http://pdf.med-rang.co.kr/paper/pdf/IJSC/IJSC-12-s19067.pdf.
Generation of iPSCs from BJ, MCF7, and MCF10A cells. (A) Procedure used for generation of iPSCs. (B) The morphology of iPSC-like colonies formed from BJ cultured on MEFs, but no iPSC-like colonies were formed from MCF7 and MCF10A (
Analysis of exogenous expression of reprogramming factors (
Transcriptome profiling analysis by RNA-seq. (A) Heatmap representing the expression patterns of 17 candidate barrier genes up-regulated in BJ D5 but not in MCF7 D5 and MCF10A D5. (B) Heatmap representing the expression patterns of 12 candidate barrier genes down-regulated in BJ D5 but not in MCF7 D5 and MCF10A D5. The heatmaps show the log10 scale fold changes (FPKM values). Red and green colors represent higher and lower gene expression levels, respectively.
Genetic interaction network (GIN) between candidate barrier genes and target genes during reprogramming. (A) Construction of GIN based on up-regulated candidate barrier genes, target genes, and core pluripotency genes. (B) Construction of GIN down-regulated candidate barrier genes, target genes, and core pluripotency genes. Black circles, candidate barrier genes; gray circles, target genes.
