- Open Access
Metabolic clusters of breast cancer in relation to gene- and protein expression subtypes
Cancer & Metabolism volume 4, Article number: 12 (2016)
The heterogeneous biology of breast cancer leads to high diversity in prognosis and response to treatment, even for patients with similar clinical diagnosis, histology, and stage of disease. Identifying mechanisms contributing to this heterogeneity may reveal new cancer targets or clinically relevant subgroups for treatment stratification. In this study, we have merged metabolite, protein, and gene expression data from breast cancer patients to examine the heterogeneity at a molecular level.
The study included primary tumor samples from 228 non-treated breast cancer patients. High-resolution magic-angle spinning magnetic resonance spectroscopy (HR MAS MRS) was performed to extract the tumors metabolic profiles further used for hierarchical cluster analysis resulting in three significantly different metabolic clusters (Mc1, Mc2, and Mc3). The clusters were further combined with gene and protein expression data.
Our result revealed distinct differences in the metabolic profile of the three metabolic clusters. Among the most interesting differences, Mc1 had the highest levels of glycerophosphocholine (GPC) and phosphocholine (PCho), Mc2 had the highest levels of glucose, and Mc3 had the highest levels of lactate and alanine. Integrated pathway analysis of metabolite and gene expression data uncovered differences in glycolysis/gluconeogenesis and glycerophospholipid metabolism between the clusters. All three clusters had significant differences in the distribution of protein subtypes classified by the expression of breast cancer-related proteins. Genes related to collagens and extracellular matrix were downregulated in Mc1 and consequently upregulated in Mc2 and Mc3, underpinning the differences in protein subtypes within the metabolic clusters. Genetic subtypes were evenly distributed among the three metabolic clusters and could therefore contribute to additional explanation of breast cancer heterogeneity.
Three naturally occurring metabolic clusters of breast cancer were detected among primary tumors from non-treated breast cancer patients. The clusters expressed differences in breast cancer-related protein as well as genes related to extracellular matrix and metabolic pathways known to be aberrant in cancer. Analyses of metabolic activity combined with gene and protein expression provide new information about the heterogeneity of breast tumors and, importantly, the metabolic differences infer that the clusters may be susceptible to different metabolically targeted drugs.
Breast cancer accounts for 25 % of newly diagnosed cancers and 15 % of cancer deaths among women worldwide . It is a heterogeneous disease  with high diversity in prognosis and response to treatment. Identification of underlying mechanisms contributing to this heterogeneity may reveal new cancer targets and clinically relevant subgroups and has thus been the focus of many recent studies [3–5].
Searching for genetic features causing the variation in breast cancers, Perou et al. used gene expression analyses followed by hierarchical clustering and defined naturally occurring molecular subtypes [4, 6]. These subtypes are named basal-like, luminal A, luminal B, Erb-B2+ (Her2 enriched), and normal-like, and are found to be associated with tumor characteristics and clinical outcome; patients with basal-like tumors having the shortest and luminal A the longest relapse-free survival . A centroid-based method called prediction analysis of microarrays 50 (PAM50), which uses the expression of 50 genes to classify breast cancer into these five intrinsic subtypes was later established and is now broadly implemented .
Proteins are the ultimate cellular effectors of pathways and networks within cells, tissues, and organisms. Although protein levels are dependent on mRNA expression, not all mRNA will be translated into protein and further protein levels are also influenced by protein stability. In a study by Myhre et al. only 22 of 52 quantified breast cancer-related proteins were found to correlate with mRNA expression levels  and similar low levels of correlation have been seen in large scale studies [9, 10]. Protein expression subtypes of breast cancer could give further understanding of underlying mechanisms causing heterogeneity . Based on the expression of 171 breast cancer-associated proteins detected by reverse phase protein array (RPPA), six breast cancer subtypes, called RPPA subtypes, have been defined . Four of these subgroups were in high accordance with the gene expression profiles of the PAM50 subtypes and named accordingly; Basal, Her2, luminal A, and luminal A/B. In addition, two new subgroups were defined; reactive I and reactive II, based on expression of proteins possibly produced by the surrounding microenvironment.
The chemical processes controlled by proteins involve metabolites as intermediates or end-products. In metabolomics, metabolite levels are measured to gather the final downstream information of ongoing cellular processes. Which processes are active at a specific time point is strongly influenced by environmental factors like diet and drugs as well as disease state. Well-established metabolic differences have been observed when comparing cancer cells to normal cells. Cancer cell energy production frequently depends on increased glycolysis and production of lactate from glucose regardless of access to oxygen, in contrast to normal cells which produce pyruvate and lactate in aerobic conditions . Also, to produce macromolecules/biomass, mitochondrial metabolism is reprogrammed . Altered metabolism has therefore been included as one of the emerging hallmarks of cancer . In breast cancer, metabolic differences between cancer tissue and normal adjacent tissue have been studied by the magnetic resonance spectroscopy (MRS) method high-resolution magic-angle spinning (HRMAS) MRS . Using this technique, metabolic profiles and biomarkers predicting long-term survival for locally advanced breast cancer , node involvement of patients with infiltrating ductal carcinoma , and 5-year survival for ER positive patients  have been identified.
Merging transcriptomics and metabolomics led to the discovery of three luminal A subgroups with distinct metabolic profiles and significant differences within gene set expression in a study by Borgan et al. . The aim of the current study was to establish clusters of breast cancer based on the metabolic expression using an approach similar to Borgan et al., but in a larger cohort of patients including all PAM50 subgroups. This approach reveals the main metabolic differences between untreated breast tumors. In addition, the combination of the metabolic clusters with transcriptomics and protein expression data provide an opportunity for information gain from each -omics technology, giving further characterization of the defined metabolic clusters.
Patients and tissue samples
Primary breast carcinoma samples from 228 patients at the Oslo University Hospital (Radium Hospital and Ullevål Hospital) were collected in the time period 2006–2009 as part of the Oslo2 study. The samples were fresh frozen after surgery and stored at −80 °C. The tumors were divided into smaller pieces depending on their size, and one of them was selected for this study. The samples were cut into three sections where the edges of the two outer pieces were used for histological evaluation (including estrogen receptor (ER) status and tumor cell percentage), and an adequate part of the mid pieces were used for HR MAS MRS experiments to obtain metabolic profiles. The remnants of all three pieces were pooled and cut into smaller pieces with scalpel, depending on the size of the tumor and divided into fractions used for extraction of DNA, RNA, and protein. Due to high lipid content, HR MAS MRS was performed on a second piece from the same tumor for 13 of the samples. A total of 228 samples were analyzed by MR spectroscopy, of which 201 and 217 were analyzed for gene expression by arrays and protein expression using RPPA, respectively, leaving a total of 191 samples analyzed by all three methods. Patient and tumor characteristics are shown in Table 1.
HR MAS MRS spectra
HR MAS MRS spectra were acquired from tissue samples (mean sample weight 7.3 mg ± 2.6 mg) on a Bruker Avance III 600 MHz/54 mm US (Bruker, Biospin GmbH, Germany) equipped with a 1H/13C MAS probe with gradient aligned with the magic angle (Bruker, Biospin GmbH, Germany). Spin-echo spectra were recorded using a Carr-Purcell-Meiboom-Gill (cpmg) pulse sequence (cpmgpr1d; Bruker). For experimental details and information about data processing, see Additional file 1.
Forty-three samples were excluded from the original sample cohort of 271 samples due to large lipid content. The spectral region between 1.40 and 4.70 ppm was chosen for further analysis excluding lipid peaks at 4.36–4.27, 2.88–2.70, 2.30–2.20, 2.09–1.93, and 1.67–1.50 ppm. After removal of the lipid residuals, the spectra were mean normalized by dividing each spectral variable to the average spectral intensity. This is done to account for differences in tumor cell percentage and sample weight, as it can be assumed that most of the lipid signals from breast samples do not originate from cancer cells.
Protein experiments and protein expression subtyping
Protein levels were determined using reverse phase protein array (RPPA), a platform where single protein levels can be measured across a series of samples simultaneously . One hundred fifty primary antibodies were used to detect breast cancer-related proteins (Additional file 2: Table S1). For analytical details, see Additional file 1.
The samples underwent consensus clustering with an option for four or five groups. The best fit on consensus clustering identified five groups, luminal, HER2, basal, and reactive I and II subsets as defined in The Cancer Genome Atlas Network data set .
mRNA expression profiling and gene expression subtyping
Total RNA was isolated with TRIzol (Invitrogen, Carlsbad, CA, USA). Expression of mRNA was measured using SurePrint G3 Human GE 8x60K (Aglient Technologies) according to the manufactory’s protocol (one-color microarray-based gene expression analysis, low input Quick Amp Labeling, v.6.5, May 2010) and 100 ng RNA was used as input for labeling. Arrays were log2-transformed, quantile normalized, and hospital adjusted . Values corresponding to probes with identical Entrez ID were averaged to form a single expression value per gene.
Subgrouping with cluster analysis of metabolic data
Hierarchical cluster analysis (HCA) was performed with Euclidean distance as the distance parameter and Ward’s method (furthest inner square distance) as the clustering distance (Statistical toolbox, Matlab R2013b, The Mathworks, Inc., USA) on the preprocessed metabolic spectra. Similar spectra based on the distance measures cluster together. The dendrogram was cut to give three clusters. To evaluate the robustness of the three HCA clusters, partial least square discriminant analysis (PLS-DA) model, using the cluster group for classification was carried out and classification accuracy was evaluated. For details, see Additional file 1.
Analysis of metabolic profiles
Metabolite assignments were performed based on literature values . Relative metabolite quantification was performed by peak integration of fixed regions corresponding to the metabolite of interest. In total, the level of 18 metabolites were calculated. Kruskal-Wallis test was performed to compare metabolite levels between clusters. Calculated p values were corrected for multiple testing by The Benjamini Hochberg false discovery rate (FDR) in Matlab, and the differences were considered statistically significant for adjusted p ≤ 0.05.
Analysis of subtype and clinical distributions
Differences in the distributions of RPPA and PAM50 subtype as well as that of other clinical characteristics of the tumors between the different metabolic clusters were tested for significance using Fisher’s exact test for count data (R 2.15.2). Calculated p values were corrected for multiple testing by The Benjamini Hochberg FDR, and the differences were considered statistically significant for adjusted p < 0.05.
Analysis of gene expression data
Significance analysis of microarrays (SAM) was used to identify differentially expressed gene between the metabolic clusters . SAM analysis was performed using 21851 genes from 42405 mRNA probes. The expression analysis was performed in R 2.15.2  with the cluster group as the dependent variable and a total of 100 permutations. T statistics/Wilcoxon statistics were calculated using multiclass comparisons and two-class unpaired tests while comparing two clusters. The differences were considered statistically significant for adjusted p < 0.01.
DAVID, an online network analysis tool , was used to search for biological functions within gene sets. DAVID was performed on the gene list over for each of the class comparisons produced by SAM. Official gene symbol was selected as gene identifier. The functional annotation clustering report of this software reports similar annotations together, where the member of a cluster have similar biological meaning due to sharing of similar gene members.
Gene set enrichment analysis (GSEA) was used to identifying sets of genes that were enriched in the metabolic clusters [27, 28]. During each cluster comparison, genes were ranked depending on calculated absolute signal-to-noise ratio (Eq. 1), where μ and σ are the mean and standard deviation, respectively.
High absolute signal-to-noise ratio will represent genes that are more likely to be “class markers” in the comparison because of high difference in expression.
The gene set C5 (gene ontology (GO) gene sets) available from the Molecular Signatures Database (MSigDB)  from The Broad Institute was chosen for evaluation of enrichment. One thousand four (of 1454) gene sets from this data base passed the filtering of lacking any gene from the expression data followed by minimum and maximum size of 15 and 500 genes, respectively. For each comparison, 1000 permutations on phenotypes were performed and FDR cutoff was set to 25 % (recommended in the manual).
Integrated pathway analysis
To combine transcriptomics and metabolic data the “Integrated pathway analysis” tool in MetaboAnalyst 3.0 software was used . Genes with adjusted p < 0.05 from SAM analysis and metabolites differently expressed between the clusters were used as input. Pathways with p values ≤0.05 were interpreted as significant.
Three main metabolic clusters of breast cancer
From the spectral data of 228 breast tumors, hierarchical clustering gave a dendrogram divided in three metabolic clusters (Mc) (Fig. 1a) Mc1, Mc2, and Mc3. The mean spectra of the clusters are illustrated in Fig. 1b.
The prediction of the metabolic clusters by PLS-DA resulted in a model with two valid latent variables LVs (Fig. 2a). The clusters Mc1 and Mc2 were well separated in the score plot of LV1 and LV2, while most Mc3 samples had low values of LV2. Classification accuracy was found to be 91.1, 88.7, and 69.9 %, respectively, for the three clusters. Permutation testing showed that all three clusters had significantly different metabolic profiles (p < 0.001). The regression vectors for each of the clusters (Fig. 2b) indicate each metabolite’s influence on the cluster prediction. The regression vector for Mc1 showed that high levels of glycerophosphocholine (GPC) and phosphocholine (PCho) and low levels of lactate (Lac), taurine (Tau), and alanine (Ala) were important for the class prediction result. For Mc2, high levels of β-glucose (β-Glc) were important as well as low levels of Lac, creatine (Cr), glycine (Gly), Tau, GPC, PCho, and Ala. Mc3 had a regression profile with low β-Glc, GPC, and PCho levels, and high Lac, Gly, Tau, Cr, and Ala levels. Univariate comparison of metabolite levels (Additional file 2: Table S2) between the three clusters revealed that 15 out of 18 metabolites analyzed were found to be significantly different (adjusted p < 0.05) between at least two of the clusters (Table 2). A combination of metabolic cluster labels and heatmap of metabolite fold change further illustrate this (Fig. 3).
Clinical parameters (tumor size, histology, grade, node status, hormone receptor status) were analyzed for differences in distribution among the metabolic clusters. Only histology was found to be significantly different between the clusters (adjusted p = 0.0144), where 11 of 21 lobular tumors and all ductal carcinoma in situ (DCIS) (n = 4) were classified as Mc2 (Table 1).
Protein expression subtype (RPPA) distribution differs between the three metabolic clusters
The metabolic clusters were investigated for differences in distribution of PAM50 and RPPA subtypes. While PAM50 subtypes did not show increased frequency of occurrence in any of the metabolic clusters, (Fig. 3c, adjusted p = 0.138), RPPA distribution was significantly different (Fig. 3d, adjusted p = 1.43E-04) with only 9 % of the RPPA reactive I and II samples being classified as Mc1, and 44 % of Mc2 samples subtyped as reactive I. The complete distribution of PAM50 and RPPA subtypes is listed in Table 3.
SAM reveals only one metabolic cluster to have differences in gene expression
SAM was performed to identify expression differences between the metabolic clusters. Of the 21,851 genes, multiclass SAM showed that 696 were differently expressed between the metabolic clusters with adjusted p < 0.01 (Fig. 3e, Additional file 2: Table S3). Further investigation through two-class SAM revealed that Mc2 and Mc3 did not have significant differences in mRNA expression, while they had 413 and 617 genes upregulated, respectively, compared to Mc1 (Additional file 2: Tables S4 and S5, respectively). Out of these, 277 genes were found in both comparisons and upregulated compared to Mc1. DAVID software was used to investigate the biological interactions between genes that were found to be significantly differentially expressed between the metabolic clusters.
A total of 404 of the 413 significant genes from SAM between Mc1 and Mc2 were identified by DAVID. Functional Annotation Clustering resulted in 117 clusters (Top 10 in Additional file 2: Table S6), where the clusters with the highest enrichment scores were linked to signaling, extracellular region, and cell adhesion.
A total of 653 of the 671 significant genes from SAM between Mc1 and Mc3 were identified by DAVID. Functional Annotation Clustering resulted in 236 clusters (Top 10 in Additional file 2: Table S7), where the clusters with the highest enrichment scores were linked to extracellular matrix (ECM), cell adhesion, and basement membrane.
Enrichment analysis shows gene expression differences to be related to ECM activity
Since Mc1 was found to have a gene expression pattern different from both Mc2 and Mc3 and these two clusters lacked statistically significant gene expression differences, Mc1 was compared to Mc2 and Mc3 combined in GSEA. This resulted in 146 of the gene ontology gene sets altered in Mc1 compared to Mc2 and Mc3 (Additional file 2: Table S8). Gene sets with the highest significance were classified with functions within collagen, ECM, and integrin binding. None of the gene ontology sets were significantly different when comparing Mc2 to Mc1 combined with Mc3, but 44 gene sets were significantly enriched when comparing Mc2 to Mc1 alone, with gene ontology terms relevant to ECM dominating the result (Additional file 2: Table S9). Eleven gene sets were significantly altered between Mc3 and Mc1 combined with Mc2 (Additional file 2: Table S10) and also here ECM-related findings were reported. One hundred fourteen gene sets were significantly different between Mc1 and Mc3, while none were significant between Mc2 and Mc3 (results not shown).
Joint analysis of gene and metabolite expression shows differences in metabolic pathways
Integrated pathway analysis resulted in 12 significantly different metabolic pathways (p value <0.05) between Mc1 and Mc2 (Additional file 2: Table S11). The most significant pathway was “tyrosine metabolism’ with eight hits of genes and metabolites, but also “d-glutamine and d-glutamate metabolism,” “glycolysis/gluconeogenesis” (Fig. 4a), and “glycerophospholipid metabolism” (Fig. 4b) were among the significant pathways. Integrated pathway analysis resulted in four significantly different metabolic pathways (p value <0.05) between Mc1 and Mc3 (Additional file 2: Table S12). The most significant pathway was glycerophospholipid metabolism with nine hits, succeeded by d-glutamine and d-glutamate metabolism.
In the present work, metabolite, protein, and gene expression data from 228 breast tumors were combined to search for new insight into the heterogeneity of breast cancer. MR metabolite data was used to derive naturally occurring metabolic clusters, which were further combined with data from the proteomics and transcriptomics levels. We identified three significantly different metabolic clusters, Mc1, Mc2, and Mc3, with significant differences in gene expression and protein expression profiles, but not within PAM50 subgroups. The metabolic clusters could therefore contribute with additional information beyond the intrinsic gene sets for understanding breast cancer heterogeneity.
Of the three metabolic clusters, Mc1 was on a separate branch in the dendrogram indicating that the metabolic profile of this cluster was the most different. This cluster is defined by significantly higher levels of GPC and PCho, two choline-containing metabolites involved in the synthesis and degradation of phosphatidylcholine (PtdCho), a major component of cell membranes . Altered choline metabolism has been considered an emerging hallmark for malignant transformations and has been detected in several cancer types including breast cancer . PCho in particular has been suggested a biomarker of breast cancer . Both GPC and PCho are confirmed elevated in tumor tissue compared to adjacent non-involved tissue from breast cancer patients , and a higher GPC/PCho-ratio has been reported in ER negative tumors [34, 35]. The latter was also observed for our cohort (results not shown); however, there was no significant difference in ER status between the three metabolic clusters. Thus, the high level of GPC and PCho is not resulting from differences in the distribution of estrogen receptor (ER) status. Interestingly, integrated pathway analysis showed that glycerophospholipid metabolism was the most significant pathway, when comparing Mc1 to Mc2. This metabolic pathway had eight hits including the metabolites GPC and PCho and genes LCAT, LPCAT2, PPAP2A, PPAP2B, PLD1, and AGPAT4. Downregulation of the expression of these genes in Mc1 indicate a less active degradation of PtdCho causing an accumulation of GPC and PCho, thus explaining the higher levels of GPC and PCho in Mc1. Furthermore, LPCAT2 is involved in the reaction where the GPC precursor (acyl-GPC) is converted into PtdCho. Lower expression of this gene may explain why the GPC precursor is directed to the production of GPC instead of PtdCho. The same hits were obtained when Mc1 was compared to Mc3. In addition, PLA2G5, one of the enzymes degrading PtdCho to acyl-GPC, is downregulated in Mc1 compared to Mc3, further supporting that Mc1 has an altered PtdCho metabolism.
The levels of PCho and GPC were higher in Mc1 compared to the two other clusters, but no significant difference in the expression of choline kinase alpha (CHKA) could be detected in the SAM analysis. However, univariate analysis confirmed that CHKA expression was significantly higher in Mc1. This is in agreement with previous findings revealing a positive correlation between levels of PCho and GPC and expression of CHKA [34, 36].
For Mc1 compared to Mc2 through integrated pathway analysis, d-glutamine and d-glutamate metabolism has only two hits, but comes out as significant because of the small number of genes and metabolites within this pathway. Interestingly, the gene GLS which catalyzes the conversion of glutamine to glutamate is downregulated in Mc1, the cluster with lowest levels of glutamate. Glutamine metabolism is considered a therapeutic target as some cancer cells exhibit high uptake and addiction to this nonessential amino acid . Since there were no differences in glutamine levels of Mc1 and Mc2, less glutamate in Mc1 could indicate that more glutamine is directed towards other metabolic pathways necessary for proliferation, glutathione needed for reducing power or further that glutamate is rapidly metabolized in cells through the TCA cycle or other mechanisms.
The distribution of protein subtypes (RPPA) was significantly different between the metabolic clusters, whereas no significant differences in the distribution of PAM50 subtypes were found. Thus, the metabolic difference between Mc1, Mc2, and Mc3 is not a result of intrinsic subtypes and might therefore contain additional information for understanding breast cancer heterogeneity. Among the tumors clustered in Mc1, 12 % were classified as RPPA-reactive (either I or II) while 49 % were classified as RPPA-luminal. The reactive RPPA subtypes have a characteristic protein expression pattern probably produced by the microenvironment , indicating less microenvironmental activity within Mc1. Mc1 also had downregulation of several genes involved in processes within the ECM of the stroma compared to both Mc2 and Mc3. As ECM changes can drive cancer behavior , these genetic differences between Mc1 and Mc2 might be of prognostic relevance. In fact, differences in expression of ECM-related genes have been used to stratify breast carcinomas into four groups, where the subgroup ECM1 have the worst prognosis . ECM classification was not performed on this cohort. However, 34 of 43 genes that clustered with a tendency of being downregulated in ECM1 and ECM2 were also found to be downregulated in Mc1. In addition, only 5 of 46 genes reported to be downregulated in ECM2 compared to ECM1 were downregulated in Mc1 (results extracted from SAM analyses, Additional file 2: Table S6–S7). These results support the contention that Mc1 tumors have an ECM signature similar to the reported ECM2 tumors. ECM2 did not show significant difference in disease outcome compared to ECM3 and ECM4, but had better prognosis than ECM1 tumors .
Mc2 has a metabolic profile with significant higher glucose level and at the same time lower levels of most of the other metabolites compared to one or both of the remaining clusters. High glucose level could reflect lower glucose consumption, inferring a lower demand for energy within these tumors. Glycolysis/gluconeogenesis came out as a significant pathway when Mc1 was compared to Mc2 during integrated pathway analysis with two metabolite hits and five gene hits. For the most significant metabolite, glucose, the levels are higher in Mc2 compared to Mc1. Glucose is the main source of energy for mammalian cells, either through aerobic glycolysis (production of lactate even in the presence of oxygen) or tricarboxylic acid (TCA) cycle and oxidative phosphorylation. For normal proliferating cells and cancer cells, which both have an increased energy demand, a glycolytic switch is often observed (higher glycolytic rate) . The increased glycolysis is followed by fermentation of the pyruvate to lactate (Warburg effect), in contrast to the conversion of acetyl CoA through the TCA cycle that occurs in normal non-proliferating cells. Increased glucose consumption is commonly used in tumor detection by using a glucose analogue and positron emission tomography (PET)  and has shown to correlate with poor prognosis and tumor aggressiveness . However, not all breast cancers are detected by PET. Here, we expect lower sensitivity in detection of Mc2 tumors due to the possible difference in glycolytic rate. None of the genes with hits in glycolysis/gluconeogenesis for the comparison of Mc1 and Mc2 could directly explain the high glucose levels of Mc2 tumors, but altered expression of the genes indicates pyruvate being guided towards the TCA cycle rather than lactate production. Two of the alternative fates of pyruvate showed significantly higher levels (alanine) or levels approaching significance (lactate, adjusted p = 0.056), supporting a higher glycolytic rate in Mc1 and that the pyruvate produced is not directed to metabolism in the TCA cycle. The significantly lower acetate levels in Mc1 compared to Mc2 could be linked to ALDH1A3 and ALDH2 downregulation, since the enzymatic product of these genes catalyzes the reversible reaction where acetaldehyde is converted to acetate.
Both DAVID and GSEA showed that many of the genes found to be downregulated in Mc1 and consequently upregulated in Mc2 were related to ECM activity. Mc2 had the highest percentage of RPPA-reactive I with 44 % of Mc2 tumors classified as this protein subtype, also related to stromal changes. Together with the metabolic finding, this implies that Mc2 tumors have cancer cells with low proliferating rate and at the same time ongoing changes within the ECM of the stroma. Mc2 tumors also had a higher frequency of lobular and ductal carcinoma in situ, indicating metabolic differences between histological subtypes of breast cancer which should be further investigated.
Mc3 has the highest lactate levels of all three clusters and higher glycine level than Mc2. These metabolites have been related to poor prognosis in ER positive patients , and higher levels of glycine is also associated with poor prognosis in a study irrespective of ER status . Although the ER-positive patients are equally distributed among our reported metabolic clusters, Mc3 expressed higher levels of both of these metabolites compared to Mc2. Moestue et al. detected differences in the expression of genes involved in choline degradation that could explain higher glycine concentrations in the poor-prognosis basal-like breast cancer xenograft model compared to luminal-like . Five of the genes described by Moestue et al. were significantly upregulated in Mc3 compared to Mc1; AGPAT4, PPAP2B, PPAP2A, LCAT, and PLD1. Of these, LCAT and PLD1 are directly involved in choline metabolism. LCAT catalyze the conversion of PtdCho to acyl-GPC while PLD1 catalyzes the conversion of PtdCho to choline. Higher GPC levels, but no difference in choline levels in Mc3 compared to Mc1 indicates that a higher amount of GPC is converted to choline in Mc3, and further contributing to higher glycine levels through choline degradation.
Mc3 shares similarities with a previously reported metabolic subgroup of luminal A tumors with significantly lower levels of glucose, higher levels of alanine, and nearly significantly higher lactate levels . In Mc3, we also see a significant higher level of lactate. Since one of the main sources of alanine is pyruvate, which also is the source for lactate, it appears that Mc3 is a cluster with a switch in glycolytic activity.
The majority of Mc3 tumors were classified as RPPA-luminal, similar to Mc1. In contrast to Mc1, Mc3 had a higher percentage of RPPA–reactive II tumors, probably linked to changes in stromal content. Also, gene expression wise, this was observed by significantly different gene expressions linked to ECM activity and the gene expression profile of Mc3 was found similar to the previously reported ECM3 or ECM4 subtypes .
In this study, information flow between the transcriptomics, proteomics, and metabolomics levels is illustrated; at the transcriptomics level, only one of the metabolic clusters shows difference in gene expression compared to the two others, while at the proteomics level, there is difference between all three clusters. Combining these findings, Mc1 is expected to have the worst prognosis due to the distinct gene expression profile and the alterations in both glycerophospholipid metabolism and evidence of increased glycolytic rate. However, this has to be validated when 5-year follow-up of this cohort is available. The main metabolic characteristics, especially of Mc1 and Mc3, have been proposed as treatment targets that could improve the therapeutic effect . Cancer therapy targeting CHKA, the enzyme responsible for PCho production from choline, causes tumor growth arrest and apoptosis in preclinical models , while treatment targeting glycolytic enzymes in combination with chemotherapy has been shown to re-sensitize cancer cells that had become resistant to treatment . Metabolic classification as illustrated here could therefore be relevant for developing a more targeted treatment plan. Importantly, the prognostic value of the clusters should be evaluated once 5-year follow-up is available.
We have here identified three metabolic clusters of breast cancer, also characterized with differences at the proteomic and transcriptomic level. The metabolic clusters are not reflecting the intrinsic genetic subtypes and may give important additional information for understanding breast cancer heterogeneity. Gene enrichment analysis revealed diverse ECM characteristics among these clusters in accordance with RPPA-subtyping. The approach of combining information from several -omics levels in the same tumor shows promise in improving the understanding of breast cancer heterogeneity potentially leading to more patient specific treatment.
Ala, alanine; Cr, creatine; DCIS, ductal carcinoma in situ; ECM, extracellular matrix; ER, estrogen receptor; FDR, false discovery rate; Gly, glycine; GPC, glycerophosphocholine; GSEA, gene set enrichment analysis; HCA, hierarchical cluster analysis; HR MAS MRS, high resolution magic angle spinning magnetic resonance spectroscopy; Lac, lactate; Mc, metabolic cluster; MRS, magnetic resonance spectroscopy; MSigDB, molecular signatures database; PAM50, prediction analysis of microarrays 50; PCho, phosphocholine; PET, positron emission tomography; PLS-DA, partial least square discriminant analysis; PtdCho, phosphatidylcholine; RPPA, reverse phase protein array; SAM, significance analysis of microarrays; Tau, taurine; TCA, tricarboxylic acid; β-Glc, β-glucose
Ferlay J, Soerjomataram I, Ervik M, Dikshit R, Eser S, Mathers C, et al. GLOBOCAN 2012 v1.0: Cancer incidence and mortality worldwide: IARC CancerBase No. 11. 2013. http://globocan.iarc.fr. Accessed: 04 Jan 2015.
Polyak K. Heterogeneity in breast cancer. J Clin Invest. 2011;121(10):3786–8.
Wang Y, Crampin E. Biclustering reveals breast cancer tumour subgroups with common clinical features and improves prediction of disease recurrence. BMC Genomics. 2013;14(1):102.
Perou C, Sørlie T, Eisen M, van de Rijn M, Jeffrey S, Rees C, et al. Molecular portraits of human breast tumours. Nature. 2000;406(6797):747–52.
The Cancer Genome Atlas Network. Comprehensive molecular portraits of human breast tumours. Nature. 2012;490(7418):61–70.
Sørlie T, Perou C, Tibshirani R, Aas T, Geisler S, Johnsen H, et al. Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci U S A. 2001;98(19):10869–74.
Parker J, Mullins M, Cheang M, Leung S, Voduc D, Vickery T, et al. Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol. 2009;27(8):1160–7.
Myhre S, Lingjærde O-C, Hennessy B, Aure M, Carey M, Alsner J, et al. Influence of DNA copy number and mRNA levels on the expression of breast cancer related proteins. Mol Oncol. 2013;7(3):704–18.
Akbani R, Ng P, Werner H, Shahmoradgoli M, Zhang F, Ju Z, et al. A pan-cancer proteomic perspective on The Cancer Genome Atlas. Nat Commun. 2014;5.
Zhang B, Wang J, Wang X, Zhu J, Liu Q, Shi Z, et al. Proteogenomic characterization of human colon and rectal cancer. Nature. 2014;513(7518):382–7.
Hennessy B, Lu Y, Gonzalez-Angulo A, Carey M, Myhre S, Ju Z, et al. A technical assessment of the utility of reverse phase protein arrays for the study of the functional proteome in non-microdissected human breast cancers. Clin Proteomics. 2010;6(4):129–51.
Gatenby R, Gillies R. Why do cancers have high aerobic glycolysis? Nat Rev Cancer. 2004;4(11):891–9.
Ward P, Thompson C. Metabolic reprogramming: a cancer hallmark even warburg did not anticipate. Cancer Cell. 2012;21(3):297–308.
Hanahan D, Weinberg R. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–74.
Bathen T, Geurts B, Sitter B, Fjøsne H, Lundgren S, Buydens L, et al. Feasibility of MR metabolomics for immediate analysis of resection margins during breast cancer surgery. PLoS One. 2013;8(4):e61578.
Cao M, Sitter B, Bathen T, Bofin A, Lønning P, Lundgren S, et al. Predicting long-term survival and treatment response in breast cancer patients receiving neoadjuvant chemotherapy by MR metabolic profiling. NMR Biomed. 2012;25(2):369–78.
Sitter B, Lundgren S, Bathen T, Halgunset J, Fjosne H, Gribbestad I. Comparison of HR MAS MR spectroscopic profiles of breast cancer tissue with clinical parameters. NMR Biomed. 2006;19(1):30–40.
Giskeødegård G, Lundgren S, Sitter B, Fjøsne HE, Postma G, Buydens L, et al. Lactate and glycine—potential MR biomarkers of prognosis in estrogen receptor-positive breast cancers. NMR Biomed. 2012;25(11):1271–9.
Borgan E, Sitter B, Lingjærde O, Johnsen H, Lundgren S, Bathen T, et al. Merging transcriptomics and metabolomics-advances in breast cancer profiling. BMC Cancer. 2010;10(1):628.
Tibes R, Qiu Y, Lu Y, Hennessy B, Andreeff M, Mills G, et al. Reverse phase protein array: validation of a novel proteomic technology and utility for analysis of primary leukemia specimens and hematopoietic stem cells. Mol Cancer Ther. 2006;5(10):2512–21.
Aure M, Jernström S, Krohn M, Vollan H, Due E, Rødland E, et al. Integrated analysis reveals microRNA networks coordinately expressed with key proteins in breast cancer. Genome Med. 2015;7(1):1-17.
Nilsen G, Vollan H, Pladsen A, Borgan Ø, Liestøl K, Vitelli V, et al. Dissecting genome aberration patterns in tumors, Submitted. 2015.
Sitter B, Sonnewald U, Spraul M, Fjösne H, Gribbestad I. High-resolution magic angle spinning MRS of breast cancer tissue. NMR Biomed. 2002;15(5):327–37.
Tusher V, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci U S A. 2001;98(9):5116–21.
R Core Team. R: a language and environment for statistical computing. Viena: R Foundation for Statistical Computing; 2012.
da Huang W, Sherman B, Lempicki R. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2008;4(1):44–57.
Subramanian A, Tamayo P, Mootha V, Mukherjee S, Ebert B, Gillette M, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50.
Mootha V, Lindgren C, Eriksson K-F, Subramanian A, Sihag S, Lehar J, et al. PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34(3):267–73.
Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, and Mesirov J. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011; http://www.broadinstitute.org/gsea/msigdb/index.jsp. Accessed: 08 Oct 14.
Xia J, Mandal R, Sinelnikov I, Broadhurst D, Wishart D. MetaboAnalyst 2.0—a comprehensive server for metabolomic data analysis. Nucleic Acids Res. 2012;40(Web Server issue):W127–33.
Gibellini F, Smith T. The Kennedy pathway—de novo synthesis of phosphatidylethanolamine and phosphatidylcholine. IUBMB Life. 2010;62(6):414–28.
Glunde K, Bhujwalla Z, Ronen S. Choline metabolism in malignant transformation. Nat Rev Cancer. 2011;11(12):835–48.
Eliyahu G, Kreizman T, Degani H. Phosphocholine as a biomarker of breast cancer: molecular and biochemical studies. Int J Cancer. 2007;120(8):1721–30.
Grinde M, Skrbo N, Moestue S, Rødland E, Borgan E, Kristian A, et al. Interplay of choline metabolites and genes in patient-derived breast cancer xenografts. Breast Cancer Res. 2014;16:1-16.
Giskeødegård G, Grinde M, Sitter B, Axelson D, Lundgren S, Fjøsne H, et al. Multivariate modeling and prediction of breast cancer prognostic factors using MR metabolomics. J Proteome Res. 2010;9(2):972–9.
Cao MD, Döpkens M, Krishnamachary B, Vesuna F, Gadiya M, Lønning P, et al. Glycerophosphodiester phosphodiesterase domain containing 5 (GDPD5) expression correlates with malignant choline phospholipid metabolite profiles in human breast cancer. NMR Biomed. 2012;25(9):1033–42.
Wise D, Thompson C. Glutamine addiction: a new therapeutic target in cancer. Trends Biochem Sci. 2010;35(8):427–33.
Tlsty T, Hein P. Know thy neighbor: stromal cells can contribute oncogenic signals. Curr Opin Genet Dev. 2001;11(1):54–9.
Bergamaschi A, Tagliabue E, Sørlie T, Naume B, Triulzi T, Orlandi R, et al. Extracellular matrix signature identifies breast cancer subgroups with different clinical outcome. J Pathol. 2008;214(3):357–67.
Mankoff D, Eary J, Link J, Muzi M, Rajendran J, Spence A, et al. Tumor-specific positron emission tomography imaging in patients:[18F] fluorodeoxyglucose and beyond. Clin Cancer Res. 2007;13(12):3460–9.
Sitter B, Bathen T, Singstad T, Fjøsne H, Lundgren S, Halgunset J, et al. Quantification of metabolites in breast cancer patients with different clinical prognosis using HR MAS MR spectroscopy. NMR Biomed. 2010;23(4):424–31.
Moestue S, Borgan E, Huuse E, Lindholm E, Sitter B, Børresen-Dale A-L, et al. Distinct choline metabolic profiles are associated with differences in gene expression for basal-like and luminal-like breast cancer xenograft models. BMC Cancer. 2010;10(1):433.
Zhao Y, Butler E, Tan M. Targeting cellular metabolism to improve cancer therapeutics. Cell Death Dis. 2013;4(3):e532.
Glunde K, Jiang L, Moestue S, Gribbestad I. MRS and MRSI guidance in molecular medicine: targeting and monitoring of choline and glucose metabolism in cancer. NMR Biomed. 2011;24(6):673–90.
The HR MAS MRS analysis was performed at the MR Core Facility, Norwegian University of Science and Technology (NTNU). MR core facility is funded by the Faculty of Medicine at NTNU and Central Norway Regional Health Authority.
The Oslo Breast Cancer Research Consortium (OSBREAC).
Vessela N Kristensen1,2,3, Torill Sauer4,5, Elin Borgen6, Olav Engebråten7,8,9, Øystein Fodstad7,9, Rolf Kåresen9,10, Bjørn Naume2,8, Gunhild Mari Mælandsmo2,7,11, Hege G Russnes1,2,12, Therese Sørlie1,2, Helle Kristine Skjerven13, Britt Fritzman14.
1Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway. 2K.G. Jebsen Centre for Breast Cancer Research, Institute for Clinical Medicine, University of Oslo, Oslo, Norway. 3Department of Clinical Molecular Biology and Laboratory Science (EpiGen), Division of Medicine, Akershus University Hospital, Lørenskog, Norway. 4Department of Pathology, Akershus University Hospital, Lørenskog, Norway. 5Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway. 6Department of Pathology, Division of Diagnostics and Intervention, Oslo University Hospital, Oslo, Norway. 7Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway. 8Department of Oncology, Division of Surgery and Cancer and Transplantation Medicine, Oslo University Hospital, Oslo, Norway. 9Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway. 10Department of Breast- and Endocrine Surgery, Division of Surgery, Cancer and Transplantation, Oslo University Hospital, Oslo, Norway. 11Department of Pharmacy, Faculty of Health Sciences, University of Tromsø, Tromsø, Norway. 12Department of Pathology, Oslo University Hospital, Oslo, Norway. 13Breast and Endocrine Surgery, Department of Breast and Endocrine Surgery, Vestre Viken Hospital, Drammen, Norway. 14Østfold Hospital, Østfold, Norway.
Email: Vessela N Kristensen email@example.com, Torill Sauer Torill.firstname.lastname@example.org, Elin Borgen email@example.com, Olav Engebråten Olav.firstname.lastname@example.org, Øystein Fodstad Oystein.Fodstad@rr-research.no, Rolf Kåresen email@example.com, Bjørn Naume firstname.lastname@example.org, Gunhild Mari Mælandsmo Gunhild.Mari.Malandsmo@rr-research.no, Hege G Russnes Hege.email@example.com, Therese Sørlie firstname.lastname@example.org, Helle Kristine Skjerven Helle.email@example.com, Britt Fritzman Britt.Fritzman@so-hf.no
This work was supported by
(1) K. G. Jebsen Center for Breast Cancer Research. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript
(2) Cancer Society of Norway
(3) South-Eastern Norway Regional Health Authority (Project no 2011042, OSBREAC: Towards personalized therapy for breast cancer)
(4) The Research Council of Norway, Imaging the breast cancer metabolome, Project no 221879
(5) Liaison Committee between the Central Norway Regional Health Authority and NTNU Project no 46056615
(6) NCI grants P01CA0099031 and P30 CA016672 to GBM
Availability of data and materials
The datasets supporting the conclusions of this article are included within the Additional files and included in the article; “Integrated analysis reveals microRNA networks coordinately expressed with key proteins in breast cancer” Aure MR, Jernström S, Krohn M, Vollan HK, Due EU, Rødland E, Kåresen R; Oslo Breast Cancer Research Consortium (OSBREAC), Ram P, Lu Y, Mills GB, Sahlberg KK, Børresen-Dale AL, Lingjærde OC, Kristensen VN. Genome Med. 2015 Feb 2; where mRNA expression data have been submitted to the Gene Expression Omnibus (GEO) database as a SuperSeries record with accession number GSE58215.
THH, LRE, GFG, OLC, ES, ØG, OSBREAC, KKS, ALBD, and TFB participated in the design of the study. KKS, ALBD, and TFB conceived the study. THH, LRE, GFG, KKS, ALBD, and TFB interpreted the data. SL and THH performed the HR MAS MRS acquisition. THH performed the statistical analysis and drafted the manuscript. MK, SJ, MRA, OCL, ES, ØG, EUD, OSBREAC, GBM, KKS, ALBD, and TFB participated in acquisition of the data. All authors have read and helped to revise the manuscript. The final manuscript is approved by all the authors.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
The study is approved by the Norwegian Regional Committee for Medical Research Ethics (Biobank approval 1.2006.1607), and all patients have given written consent for the use of material for research purposes.
Additional methods and references. Information of additional methods. (DOCX 22 kb)
Antibodies used for reverse phase protein array (RPPA). Table S2. Metabolite integrals for each of the samples included in the study. Table S3. Significantly different expressed genes between the three metabolic clusters. Table S4. Significantly different expressed genes between the metabolic clusters Mc1 and Mc2. Table S5. Significantly different expressed genes between the metabolic clusters Mc1 and Mc3. Table 6. Statistically over-represented annotation terms, according to DAVID, of differently expressed genes between metabolic cluster Mc1 and Mc2. Table S7. Statistically over-represented annotation terms, according to DAVID, of differently expressed genes between metabolic cluster Mc1 and Mc3. Table S8. Gene set enrichment analysis (GSEA) result for gene ontology (GO) gene sets. Metabolic cluster Mc1 was compared with Mc2 and Mc3. Table S9. Gene set enrichment analysis (GSEA) result for gene ontology (GO) gene sets. Metabolic cluster Mc1 was compared with Mc2. Table S10. Gene set enrichment analysis (GSEA) result for gene ontology (GO) gene sets. Metabolic cluster Mc3 was compared with Mc1 and Mc2. Table S11. Integrated pathway analysis result from the comparison of the metabolic clusters Mc1 compared to Mc2. Table S12. Integrated pathway analysis result from the comparison of the metabolic clusters Mc1 compared to Mc3. (XLSX 337 kb)
About this article
Cite this article
Haukaas, T.H., Euceda, L.R., Giskeødegård, G.F. et al. Metabolic clusters of breast cancer in relation to gene- and protein expression subtypes. Cancer Metab 4, 12 (2016). https://doi.org/10.1186/s40170-016-0152-x
- HR MAS MRS
- Breast cancer subgroups
- Metabolic cluster
- Extracellular matrix