Machine-learning facilitates selection of a novel diagnostic panel of metabolites for the detection of heart failure

Marcinkiewicz-Siemion, M.; Kaminski, M.; Ciborowski, M.; Ptaszynska-Kopczynska, K.; Szpakowicz, A.; Lisowska, A.; Jasiewicz, M.; Tarasiuk, E.; Kretowski, A.; Sobkowicz, B.; Kaminski, K. A.

doi:10.1038/s41598-019-56889-8

Download PDF

Article
Open access
Published: 10 January 2020

Machine-learning facilitates selection of a novel diagnostic panel of metabolites for the detection of heart failure

M. Marcinkiewicz-Siemion ORCID: orcid.org/0000-0002-6531-2705¹,
M. Kaminski¹,
M. Ciborowski ORCID: orcid.org/0000-0003-3729-0518²,
K. Ptaszynska-Kopczynska¹,
A. Szpakowicz¹,
A. Lisowska¹,
M. Jasiewicz¹,
E. Tarasiuk¹,
A. Kretowski²,
B. Sobkowicz¹ &
…
K. A. Kaminski ORCID: orcid.org/0000-0002-9465-2581^1,3

Scientific Reports volume 10, Article number: 130 (2020) Cite this article

2387 Accesses
9 Citations
4 Altmetric
Metrics details

Subjects

Heart failure

Abstract

The metabolic derangement is common in heart failure with reduced ejection fraction (HFrEF). The aim of the study was to check feasibility of the combined approach of untargeted metabolomics and machine learning to create a simple and potentially clinically useful diagnostic panel for HFrEF. The study included 67 chronic HFrEF patients (left ventricular ejection fraction-LVEF 24.3 ± 5.9%) and 39 controls without the disease. Fasting serum samples were fingerprinted by liquid chromatography-mass spectrometry. Feature selection based on random-forest models fitted to resampled data and followed by linear modelling, resulted in selection of eight metabolites (uric acid, two isomers of LPC 18:2, LPC 20:1, deoxycholic acid, docosahexaenoic acid and one unknown metabolite), demonstrating their predictive value in HFrEF. The accuracy of a model based on metabolites panel was comparable to BNP (0.85 vs 0.82), as verified on the test set. Selected metabolites correlated with clinical, echocardiographic and functional parameters. The combination of two innovative tools (metabolomics and machine-learning methods), both unrestrained by the gaps in the current knowledge, enables identification of a novel diagnostic panel. Its diagnostic value seems to be comparable to BNP. Large scale, multi-center studies using validated targeted methods are crucial to confirm clinical utility of proposed markers.

A novel preliminary metabolomic panel for IHD diagnostics and pathogenesis

Article Open access 01 February 2024

S. S. Markin, E. A. Ponomarenko, … S. A. Appolonova

Multinomial machine learning identifies independent biomarkers by integrated metabolic analysis of acute coronary syndrome

Article Open access 23 November 2023

Meijiao Fu, Ruhua He, … Jun He

Dynamic personalized risk prediction in chronic heart failure patients: a longitudinal, clinical investigation of 92 biomarkers (Bio-SHiFT study)

Article Open access 18 February 2022

Dominika Klimczak-Tomaniak, Marie de Bakker, … Isabella Kardys

Introduction

Despite significant improvement in pharmacological and invasive treatment of cardiovascular diseases (CVD) that led to an increase in life expectancy, the incidence of heart failure with reduced ejection fraction (HFrEF) continues to rise. As a result, HFrEF has become a major public health problem as it constitutes currently a leading cause of hospital admissions over the age of 65¹. In spite of rapidly expanding current knowledge, molecular basis of the HFrEF is still incompletely understood. Available biomarkers, though playing an important role in the HFrEF diagnosis, risk stratification and prognosis (eg.: B-type natriuretic peptide – BNP, N-terminal pro B-type natriuretic peptide - NTproBNP, suppression of tumorigenicity-2 - ST2 family, galectin-3 - Gal-3, troponin - Tn etc.), do not give a satisfactory answer about HFrEF pathophysiology. Moreover there are some significant limitations that may lower clinical utility of natriuretic peptides (NPs: BNP or NTproBNP) which are most established HF biomarkers in clinical practice (e.g. limitation in detection of early and asymptomatic stages of HFrEF, an increase in NPs concentration due to the several cardiac as well as non-cardiac conditions, NPs concentration rising with age, are higher in women, while lower in obese patients). None of currently available HF biomarker do provide additional therapeutic target possibilities nor enable biomarker guided therapy in chronic HFrEF². In fact, an assessment of a single, particular metabolite or protein cannot provide fully informative results. Each of the currently available HFrEF biomarkers being restricted to particular mechanism (e.g. NPs – pressure-volume overload, ST2 family – inflammation, Gal-3 – fibrosis and cardiac remodeling etc.) cannot reflect simultaneously all other mechanisms related to HFrEF pathophysiology. HFrEF being a multisystemic syndrome³ needs a holistic approach to study systemic changes occurring in the course of HFrEF development. Due to the aforementioned limitations there is still a need to search of newer HFrEF biomarkers with particular emphasis on a multibiomarker approach. Untargeted metabolomics analysis enables comprehensive characterization of low molecular weight metabolites reflecting the complete metabolic phenotype of the disease. Therefore the importance of this hypothesis free, complex metabolic evaluation has increased within the last years and metabolomics approach has been more frequently used in the search for new HFrEF biomarkers. However, the number of small-molecule metabolites detected by using untargeted metabolomics approach range from hundreds to thousands. As a result, a problem of massive amounts of data has appeared generating a need of specialized forms of data analysis. Studies that have been conducted so far were restricted mainly to classical regression-based models which constitute significant limitation especially in terms of pre-specification of a model structure (based on a theory and assumption), the number of variables included in the analysis and their interactions. To our knowledge, in none of the available HFrEF untargeted metabolomics studies machine-learning (ML) algorithms have been used to analyze metabolomics data. ML, by having different motivating philosophies (data-driven models) and by not being limited by current knowledge (no need for a pre-specification of a model structure), seems to be a powerful tool to improve diagnostic and prognostic processes in various diseases^4,5,6,7. Therefore, the aim of the study was to detect in peripheral blood possibly all low molecular weight metabolites which differentiate HFrEF patients from controls with further creation of the top performing diagnostic panel using ML algorithms.

Material and Methods

Study population

Study population, methodology for obtaining peripheral blood samples, clinical and metabolomics evaluation were as described previously⁸. In brief, 67 patients with chronic heart failure with reduced ejection fraction (HFrEF) and 39 age-, ischemic heart disease (IHD) occurrence- and body mass index (BMI)-matched controls were enrolled in the study. Patients’ (HFrEF and control group) inclusion to the study was conducted between 2012–2015 at Cardiology Department of the University Hospital in Bialystok, Poland. The investigation conforms with the principles outlined in the Declaration of Helsinki. The Bioethical Commission of the Medical University of Bialystok approved the research (R-I-002/67/2013). HFrEF study group consisted of ambulatory optimally treated patients (according to the 2012 European Society of Cardiology guidelines for the diagnosis and treatment of acute and chronic heart failure) with stable moderate chronic HFrEF (left ventricular ejection fraction – LVEF ≤ 35%) who did not have an episode of decompensation within the last month. All of the HFrEF patients had a minimum six-month history of the disease. Clinical, biochemical (BNP) and echocardiographic (LVEF ≥ 50%) assessment allowed to exclude HF from the control group which consisted of ambulatory treated patients with arterial hypertension, atrial fibrillation, ischemic heart disease and/or hypercholesterolemia. All of the study participants provided written informed consent. The exclusion criteria were the same for both groups: acute and chronic inflammatory diseases (rheumatoid arthritis and asthma), severe chronic obstructive pulmonary disease (forced expiratory volume in one second - FEV1 less than 50% of a predicted value), severe renal dysfunction (estimated glomerular filtration rate – eGFR < 30 ml/min/1.73 m²), diabetes mellitus, thyroid dysfunction requiring pharmacotherapy, a history of implantation of the cardiac resynchronization therapy device (CRT) or diagnosis of cancer in the past five years. Information about treatment, prior hospitalizations and concomitant diseases were gathered from the medical documentation. Clinical (body mass index - BMI, systolic and diastolic blood pressure – SBP/DBP, heart rate - HR, New York Heart Association - NYHA class), biochemical (complete blood count, iron level, parameters of renal function, urea, uric acid, fasting lipid profile, natriuretic peptide – BNP, C reactive protein – CRP), echocardiographic (left ventricular ejection fraction – LVEF, left ventricular end-diastolic diameter – LVEDd) and functional (six-minute walk test – 6MWT, cardiopulmonary exercise test – CPET with rest spirometry) assessment were carried out in all of the enrolled patients. Clinical examination, basic biochemical analyses, echocardiography and functional assessment were done at the day of inclusion to the study. Metabolomics assessment was performed in two batches in 2014 and 2015 after collecting complete sets of biological material within the HFrEF and control group. Each batch included both HFrEF and control group patients.

Clinical parameters

BMI was calculated as weight [kg]/height [m]². The modification of diet in renal disease (MDRD) formula GFR calculator was used to compute estimated glomerular filtration rate (eGFR). The incidence of ischaemic heart disease (IHD) was calculated based on medical documentation (HFrEF - invasively confirmed IHD as a cause of chronic HF; control group - previous diagnosis of IHD). For cardiopulmonary exercise test (CPET), a standardized continuous Ball State University/Bruce (BSU/Bruce) Ramp treadmill protocol was used with 20-second stages and continuous increase in workload per stage. Forty four (66%) patients with chronic heart failure carried out CPET. The decision about patient’s ability to perform the CPET was made after the 6MWT. Difficulties with walking due to non-cardiac causes (e.g. peripheral occlusive arterial disease, problems with lumbar spine), significant fatigue in the 6MWT, fear of exercise on a treadmill or mask intolerance were main reasons of not performing CPET.

Peak oxygen uptake (PVO2) is the rate of an oxygen consumption reflecting the difference between inspired and expired volume of oxygen. PVO2 is measured at peak exercise on a treadmill and expressed as milliliters of O₂ per kg per minute (mL/kg/min). PVO2 depends on the arteriovenous O₂ difference and cardiac output reserve. The slope of a minute ventilation to carbon dioxide output (VE/VCO2 slope) reflects the effectiveness of ventilation. An increase in VE/VCO2 slope in HFrEF patients is known as an indicator of poor outcome.

Metabolic fingerprinting by LC-QTOF-MS

In order to avoid variation due to circadian rhythm, the collection of peripheral venous blood samples was carried out in the morning (at the day of inclusion to the study) between 8.00 and 10.00 am after compulsory overnight fasting (at least 8 h). Samples were further centrifuged for 10 minutes (1300 x g, room temperature). The separated serum was stored in the Eppendorf tubes at −80 °C until further metabolomics analysis. All of the morning medications were taken as usual. Collected serum samples were further subjected to untargeted analysis by liquid chromatography - quadrupole time-of-flight - mass spectrometry (LC-QTOF-MS - model 6550, Agilent Technologies, Santa Clara, California, USA) system. The analytical process was controlled by the use of quality control (QC) samples⁹. As the LC-MS analyses were performed in two separate sets (1^st set: 36 HFrEF patients and 19 age-matched controls; 2^nd set: 31 HFrEF patients and 20 age-, gender- and concomitant disease-matched controls), a pool of equal volumes of serum from each of the 55 samples in derivation and 51 in validation sets were used to prepare the QCs. They were prepared independently following the same procedure as for the rest of the samples and injected at the beginning of the run and after every 6–7 real samples. Samples were analyzed by the HPLC system that consisted of a degasser, two binary pumps and thermostated auto sampler connected to a mass spectrometry detector using previously described method⁸. Samples from each set (derivation and validation) were analyzed in a randomized order in separate runs (first for positive and then for negative ion mode). Metabolomics analyses were conducted at Clinical Research Centre of Medical University of Bialystok, Poland.

Processing of LC-MS data was performed as described previously⁸. Briefly, the raw data collected by the analytical instrumentation was cleaned of background noise and unrelated ions by the Molecular Feature Extraction (MFE) tool in the Mass Hunter Qualitative Analysis Software (B.06.00, Agilent). Alignment, quality assurance⁹ and data filtering were performed using Mass Profiler Professional (MPP) 12.6.1 (Agilent).

Statistical analysis

Basic statistical analyses were performed as described previously⁸. In short, continuous variables are expressed as the mean ± standard deviation (SD) or median and interquartile range (IQR), depending on the type of distribution. Categorical variables are presented as raw values and percentages from the total. The Student’s t-test, Mann-Whitney U test or χ² test were used, as appropriate. Statistical significance was defined as p < 0.05.

As the data were obtained by subjecting samples to MS analysis in two separate batches, which imposed further manual matching, the variables were prefiltered to include 50% most variant metabolites of these present in at least 80% of samples. Following the filtering, manual matching and elimination of artifacts (signals present in blank analysis), 63 variables from both ESI ion modes have been subjected to computational modelling. Missing values were replaced by k-means nearest neighbour analysis according to the criteria of Armitage et al.¹⁰.

There were nine cases with missing BNP, which were omitted in models. No data imputation techniques were applied, since gathered data would not allow us to obtain a reliable approximation of unobserved BNP levels. Within each batch, the variables were transformed with Yeo-Johnson method and standardized by subtracting the mean and dividing by trimmed standard deviation. Cases were split into training and test set, with proportions of 3:2. Recursive feature elimination with repeated cross-validation (RFECV) based on random forests was performed on the training set to rank the predictors. Top 20 ranking metabolites were then subset and incrementally used as predictors of multiple GLMs fitted to training set, with their accuracy assessed on resampling. The top performing model, containing 8 predictors, was chosen and its validity was assessed on the test set. Additional multiple regression models, including HFrEF presence and another clinical covariate as predictors, one for each clinical covariate and metabolite combination, have been built to assess potential confounding on metabolite levels. Correlations between serum intensities of metabolites included in the panel and clinical parameters were performed calculating Pearson’s product-moment coefficients for pairs of normalized variables. The analysis was performed in R version 3.4.2.

Metabolites identification

Identification of significant metabolites was performed as described previously⁸. Identification of uric acid, deoxycholic acid and docosahexaenoic acid was confirmed by matching retention time, accurate mass and fragmentation pattern of authentic standards (Sigma Aldrich). Lysophospholipids were identified based on previously described fragmentation pattern⁹.

Equation 1

Model equation, for the logit-link binomial model, relating HFrEF occurrence (outcome) to metabolite panel (predictors). Predictors are 0-centered, Yeo-Johnson normalized intensities of metabolites.

$$p=\frac{1}{1+{{\rm{e}}}^{-y}}$$

y = 1.781 + −3.190 * [LPC 18:2sn2] + 0.851 * [UA] + 0.037 * [LPC 18:2sn1] + −0.035 * [UM] + 2.749 * [LPC 18:2sn2] + −1.099 * [LPC 20:1] + −0.673 * [DA] + −1.099 * [DHA].

Results

Clinical group characteristics

The basic clinical characteristics of study groups (HFrEF and controls) is shown in Table 1. There were no statistically significant differences between study groups in terms of age, blood pressure, BMI and IHD occurrence. The percent of women was higher within the control group (n = 11, 29% vs n = 7, 10%, respectively; p = 0.019). HFrEF group consisted of mildly/moderately symptomatic patients (NYHA class II – 43%; III − 57%) who had significantly impaired left ventricular ejection fraction (EF 24.3 ± 5.9%). Ischemic heart disease was an etiology of HFrEF in 57% of patients. HFrEF patients presented slower resting heart rate than controls (70.7 ± 9.9 vs 75.3 ± 12.5 beats per minute, p = 0.045). Among assessed laboratory parameters higher concentration of uric acid (6.8 mg/dL IQR 6.0–7.9 mg/dL vs 5.99 mg/dL IQR 5.1–7.0 mg/dL, p = 0.029) and lower total cholesterol (167.5 ± 42 mmo/L vs 197.2 ± 35 mg/dL, p = 0.001), HDL (46.5 ± 14.1 mmol/L vs 53.3 ± 14.3 mmol/L, p = 0.029), LDL (101.9 ± 32.7 mmol/L vs 127.0 ± 39.2 mmol/L, p = 0.001) levels were observed in HFrEF patients. Pharmacological treatment of HFrEF (ACEIs/ARBs, B-blockers, MRAs) was consistent with the current ESC recommendations. HFrEF patients were more frequently treated with the use of statins, ASA, ACEIs, B-blockers and MRAs in comparison with those without the disease. Whereas CCBs were used more often in the control group.

Table 1 Basic clinical characteristics of patients included in the study (chronic heart failure and controls without the disease).

Full size table

Selection of top ranked metabolites which created the diagnostic panel

Quality assurance protocol, manual matching and filtering of further artifacts were carried out on the data obtained from the two independent LC-QTOF-MS analyses. As a result, 63 variables have been subjected to computational modelling. Random forests nested inside RFECV algorithm lead to selection of 20 top ranked predictors, further sequentially used as covariates of GLMs, of which the most accurate final model was chosen, built on 8 metabolites (Fig. 1, Table 2). Further MS/MS spectrum analyses enabled to identify components of the model (uric acid, two isomers of lysophosphatidylcholine - LPC 18:2, LPC 20:1, deoxycholic acid, docosahexaenoic acid and one unknown metabolite), and build the final model equation (Equation 1). Apart from uric acid serum intensity of seven remaining metabolites was significantly lower in HFrEF (Table 3). Based on both accuracy profiling (Fig. 2), and ROC curve (Fig. 3), a prediction cut-off value equal to 0.5 was found to be optimally discriminate between controls and HFrEF cases. Our model compared to BNP as a sole predictor presents with better accuracy (0.85 vs 0.82; Fig. 1) and specificity (0.92 vs 0.83), but worse sensitivity (0.73 vs 0.80). Overall performance measured by AUC is insignificantly lower (0.85 vs 0.92, p = 0.29). To demonstrate the additive value of metabolite panel on top of BNP, we have built an additional model containing all these predictors (Supplementary Table 1). Formal testing confirmed better goodness-of-fit of full model than BNP alone (p = 0.001), while resampling profile indicates comparable accuracy (Fig. 4). Additional multiple linear regression models have been built, to investigate the potential effect of other clinically relevant covariates (such as ischaemic ethiology, statin or ACEI treatment) on metabolite levels (Supplementary Tables 2–4). Although in some cases these have shown some extent of potential confounding, they did not nullify the initial relationship with HFrEF.

Table 2 Estimates for parameters of the proposed model.

Full size table

Table 3 Mean serum intensities of top ranked metabolites included in the panel in chronic heart failure versus control group (without partitioning into training and test set).

Full size table

Serum intensities of all metabolites significantly correlated with left ventricular ejection fraction. Moreover, apart from UA and DHA, serum intensities of remaining metabolites moderately positively correlated with exercise duration on a treadmill, peak oxygen consumption, renal function and negatively correlated with VE/VCO2 slope. Serum intensities of lysophosphatidylcholines (LPCs) included in the metabolites’ panel correlated positively with LDL and total cholesterol serum level. Weak negative correlation was also observed between deoxycholic acid, LPC 18:2 sn1 and BNP (Table 4).

Table 4 Correlations between serum intensities of 8 metabolites included in the panel and clinical parameters (all patients - HFrEF and control group, n = 106).

Full size table

Discussion

As heart failure is not a single organ disease but a multisystemic syndrome, variety of systemic changes occur in the course of HFrEF development. Analysis of changes in blood metabolites profile seems to be an attractive approach to perform holistic assessment of complex adaptive responses and to discover valuable novel HFrEF biomarkers. An untargeted metabolomics analysis being unrestricted by current knowledge enables detection of possibly all low molecular weight metabolites present in a particular moment in the peripheral blood. That allows a simultaneous assessment of changes in many various metabolites, including even those that we have not been able to identify to date. Our study presents a completely new approach towards biomarkers – something to be expected in the era of personalized medicine when “one fits all” approach will be replaced by analysis of simultaneous changes of multiple substances reflecting different mechanisms. According to current research, collective biomarkers better reflect the complexity of changes in the organism and possibly will increase likelihood to define particular sub-phenotypes of diseases (in this case heart failure)^11,12,13. Previous research has already proven that metabolomics may constitute a powerful diagnostic and prognostic tool in chronic HF^14,15. However, a high diversity of analytical methods (e.g. nuclear magnetic resonance – NMR, mass spectrometry – MS), separation techniques (i.e. capillary electrophoresis or chromatography – gas or liquid) and approaches (untargeted, targeted) used to date to study metabolome as well as various studied populations are partly responsible for a diversity in the results obtained by different research groups. For instance, Cheng et al.¹⁴ have performed untargeted metabolomics analysis followed by targeted evaluation of obtained results and an identification of a combination of four metabolites (histidine, phenylalanine, spermidine, and phosphatidylcholine C34:4) that discriminated HF stage C from control group similarly to b-type natriuretic peptide (BNP). Authors have suggested that profile of metabolites (another panel identified by Cheng et al.) may provide even better prognostic value in comparison with conventional biomarkers such as BNP in HF patients. Mueller-Hennessen et al.¹⁶ have created a lipid diagnostic panel which improved HFrEF detection, even in mild and asymptomatic stages. Hunter et al.¹⁷ have indicated a group of circulating metabolites which were significantly elevated in HF in comparison with the control group. Moreover it allowed to differentiate HFrEF from heart failure with preserved ejection fraction (HFpEF). Different analytical approaches were used in all of those studies (Cheng et al. – untargeted followed by targeted analysis; Mueller-Hennessen et al. – metabolite profiling; Hunter et al. – targeted analysis). Apart from this, analyses carried out by Cheng et al.¹⁴ and Hunter et al.¹⁷ included healthy control group. As the metabolites’ profile is susceptible to many various external (e.g. pharmacotherapy) and internal (e.g. age, sex, comorbidities, renal function) factors, many differences in basic characteristic of HF and control group (e.g. age, BMI, comorbidities, renal function, pharmacotherapy) observed in those studies may serve as significant confounding factors. On the contrary to previous research, control group in our study was carefully matched in terms of age, BMI, eGFR, IHD occurrence. Despite this, differences in medication were noticed. Additional analyses showed the relation between medications and serum intensities of UA (ACEI treatment), LPC 18:2sn1 (ACEI, statin therapy) and HFrEF (Supplementary Tables 3, 4). However, due to small group size these analyses are biased by strong association between the treatment and the presence of heart failure. Previous metabolomics studies¹⁴ that used untargeted analysis to select metabolites and further create a novel diagnostic panel for HFrEF have been based on classical, hypothetical-driven statistical approaches. In fact, the amount of data gathered from an untargeted metabolomics analyses poses an analytical challenge resulting from a necessity of modelling a multidimensional space of relations between metabolite fingerprint and outcomes, and simplifying the results just enough to facilitate holistic, straightforward conclusions. An application of machine-learning techniques efficiently solves these tasks by incorporating multiple procedures consisting of data resampling and remodelling. In our case, this eventually allows to acquire a single, simple linear model, that reapplied to patients’ metabolic data yields a value, that might be interpreted as a ‘surrogate metabolite’, being a linear combination of set of selected metabolites, which might be used for discrimination of heart failure cases, after applying a single cut-off. To our knowledge, this is the first study that implemented untargeted metabolomics analysis combined with ML algorithms in order to select metabolites creating the top performing diagnostic model for HFrEF. Recently, Verdonschot et al.¹⁸ have shown that the NT pro-BNP-based determination of the dilated cardiomyopathy (DCM) severity might be complemented by the combination of metabolites. In contrast to our study, authors have applied targeted metabolomics approach which might limit objective analysis of changes occurring in the blood/urine metabolites’ profile. The use of non-fasting blood samples, significant differences in diabetes prevalence within the studied groups, lack of chronic inflammatory disease exclusion could potentially influence the disease metabolic phenotype. Nevertheless, in spite of these study limitations the result of the analysis suggest some clinical utility of the combination of metabolomics and ML methods. In our study an untargeted metabolomics analysis was followed by computational modelling which enabled hypothesis free selection of a panel of metabolites which seemed to have non-inferior diagnostic value (based on accuracy) to BNP in chronic HFrEF. Additionally, the results of the likelihood-ratio test performed for GLMs suggested that prediction of the outcome with model utilizing BNP and eight metabolites presents significantly higher goodness-of-fit when compared with BNP alone. When we based our conclusions on data resampling, our fit slightly outperformed the BNP alone, however for such a small set, this clearly remains inconclusive.

According to the previous research, all of the selected and identified metabolites have already been described in various cardiovascular diseases (CVD) including heart failure. However, apart from DHA (possible adjunctive therapy in optimally treated patients with symptomatic HFrEF) and uric acid (useful marker of adverse prognosis in HFrEF patients), clinical significance of changes in residual components of a panel in HFrEF remains unknown. In our study HFrEF patients presented significantly higher serum UA level in comparison with the control group. According to the literature, the incidence of hyperuricemia is high in HFrEF and occurs in 50–55% of patients. An elevation in uric acid level is considered to be a result of increased production via higher xanthine oxidase activity or decreased elimination via kidneys. In our study, both groups (controls and HFrEF) were fitted in terms of estimated glomerular filtration rate (eGFR) and patients with severe chronic kidney disease were excluded, reducing the role of UA excretory disorders. It is still unclear why UA itself has a negative impact on prognosis in HFrEF^19,20 as randomized trials with agents blocking this pathway failed to provide clinical benefit²¹. Hypothesis free selection of UA as a component of a top performing diagnostic panel confirms the need for further research and continuation of a discussion on xanthine oxidase metabolic pathway (including serum uric acid - sUA) role in HFrEF pathophysiology. Lipid disorders have also been recognized as a strong risk factor for CVD. Nevertheless, the vast majority of previous research regarding dyslipidemia in chronic HFrEF concerned solely alteration in cholesterol metabolism. As a result of a growing interest in the matter of metabolomics and lipidomics within the last years, it was possible to detect a substantial dysregulation in metabolism of other lipid fractions, including phospholipids^9,14,16. This suggests more complex lipid metabolism abnormalities in HFrEF. In this study, serum intensities of all LPCs, likewise cholesterol level, were significantly lower in the HFrEF group. Simultaneously, higher percentage of statin use was observed in HFrEF patients. According to the results of additional analysis, it seems that statin therapy was likely to partialy impact the association between LPCs (e.i. LPC 18:2sn1) and HFrEF. Therefore, the differences between both groups (HFrEF and controls) regarding statin therapy might be considered as a possible study limitation. Although, an assessment of each particular metabolite is a kind of simplification as panel compounds should be considered together as a “surrogate metabolite”. In our previous study it has been shown that the greater serum PLs deficit, the worse clinical condition of HF patient (including more severe metabolic dysregulation, impaired renal function and decreased exercise capacity)⁹. Despite the fact that an alteration in PLs metabolism is considered to be related to plethora of processes associated with HFrEF (e.g. immune response, impaired energy metabolism, altered choline metabolism with a possible role of gut microbiota), the exact metabolic mechanism responsible for changes in PLs in HFrEF remains unknown. The dysregulation in phospholipids including phosphatidylcholines (PCs) metabolism has already been observed in various non-related diseases^{9,22,23,24,25}. Lindahl et al.²⁶ suggested that alteration in LPCs concentration may be an indicative of disease in general rather than a disease specific metabolite marker. In HFrEF which is a multisystemic syndrome, changes observed in the LPCs serum intensities considered as a part of the whole metabolites’ profile seem not to be a limitation but an advantage. The presence of LPCs in the diagnostic panel and their correlations with serum cholesterol level points out an importance of dysregulation in various lipid classes in HFrEF. Bile acids (BAs), likewise LPCs and UA, have been described as factors implicated in various cardiac pathologies. Mayerhofer et al.²⁷ have demonstrated that the ratio of primary to secondary BAs has been reduced in chronic heart failure patients. Nevertheless, the association between this pattern of BAs composition and reduced overall survival has been seen solely in univariate analysis. In our study, serum intensity of one secondary BA (deoxycholic acid – DCA) was lower in HFrEF group. Deoxycholic acid is known as one of the two most common secondary bile acids that are synthesized solely by the microbial flora of a small intestine. An impairment in intestinal function and gut microbiome in HF has been intensively studied within the last years^28,29. Reduced intestinal blood flow with further bowel wall oedema has been thought to be responsible for altered intestinal barrier function leading to the passage of bacterial products into the systemic blood circulation. Another component of a diagnostic panel with known anti-inflammatory, anti-arrhythmic and beneficial effects on the endothelial function was docosahexaenoic acid (DHA) classified as omega-3 polyunsaturated fatty acid (PUFA)^30,31. Mozaffarian et al.³² have indicated that circulating omega-3 PUFAs are associated with lower incidence of chronic heart failure. In our study serum intensity of DHA was significantly lower than in the control group. Current European Society of Cardiology (ESC) guidelines for the diagnosis and treatment of acute and chronic heart failure has recommended that n-3 PUFA preparations containing >850 mg/g of eicosapentaenoic acid (EPA) and DHA may be considered as an adjunctive therapy in optimally treated patients with symptomatic HFrEF¹.

Endothelial dysfunction, oxidative stress, systemic inflammatory activation, impaired energy metabolism, altered choline metabolism, apoptosis, intestinal dysbiosis – all of those have been thought to be implicated in HFrEF pathophysiology. As cited above, every metabolite included in the diagnostic panel has already been described to be involved in an activation of aforementioned processes. Moreover their correlations with clinical, biochemical (BNP, renal function), echocardiographic (left ventricular systolic function) and functional parameters (the distance of 6MWT, duration of exercise on a treadmill, peak VO2, VE/VCO2 slope) confirm possible clinical significance of the diagnostic panel components. Therefore, based on the results of this and previous studies, metabolomics seems to be a powerful tool in HFrEF especially when combined with ML algorithms in the detection of the top performing HFrEF diagnostic panel. Presence of additional unidentified metabolite in the panel only confirms great possibilities that the combination of those two unrestricted methods offers in the aspect of broadening the knowledge on HFrEF pathophysiology. As compared to the strategy of a single metabolite, complementarity of a panel compounds gives an opportunity to get a more complete picture of the metabolic changes taking place in the course of HFrEF and as a result may increase accuracy of the diagnostic panel.

Study limitations

Despite the study was carefully planned we are aware that there are several potential limitations. First, number of enrolled patients is relatively small. Nevertheless, our purpose was to select the most homologous HFrEF group using strict exclusion criteria especially in terms of concomitant diseases. Second, there is no information about patients’ diet, PUFAs supplementation or HFrEF duration. Third, there are differences in pharmacotherapy between study and control group. Larger studies with prospectively followed-up groups are needed for clinical validation of the diagnostic panel.

Conclusions

In the present study we demonstrated that the combination of two innovative methods: an untargeted metabolomics and ML algorithms can be a promising tool for the diagnostic workup in HFrEF. The combination of metabolites may provide comparable diagnostic value to BNP. Due to the complementarity of the panel components, changes in the serum intensities of particular metabolites interpreted together as a “surrogate metabolite” might be more specific to the HFrEF. Large scale, multi-center studies using validated targeted methods are crucial to confirm clinical utility of proposed markers.

References

Ponikowski, P. et al. ESC Guidelines for the diagnosis and treatment of acute and chronic heart failure. The task force for the diagnosis and treatment of acute and chronic heart failure of the European Society of Cardiology (ESC). Eur. Heart J. 37, 2129–2200, https://doi.org/10.1093/eurheartj/ehw128 (2016).
Article PubMed Google Scholar
Pfisterer, M. et al. BNP-Guided vs symptom-guided heart failure therapy. The trial of intensified vs standard medical therapy in elderly patients with congestive heart failure (TIME-CHF) randomized trial. JAMA. 301, 383–392, https://doi.org/10.1001/jama.2009.2 (2009).
Article CAS PubMed Google Scholar
Marcinkiewicz-Siemion, M., Ciborowski, M., Kretowski, A., Musial, W. J. & Kaminski, K. A. Metabolomics — a wide-open door to personalized treatment in chronic heart failure? Int. J. Cardiol. 219, 156–163, https://doi.org/10.1016/j.ijcard.2016.06.022 (2016).
Article CAS PubMed Google Scholar
Tabassian, M. et al. Diagnosis of heart failure with preserved ejection fraction: machine learning of spatiotemporal variations in left ventricular deformation. J. Am. Soc. Echocardiogr. 31, 1272–1284, https://doi.org/10.1016/j.echo.2018.07.013 (2018).
Article PubMed Google Scholar
Meyer, A. et al. Machine learning for real-time prediction of complications in critical care: a retrospective study. Lancet Respir. Med. 6, 905–914, https://doi.org/10.1016/S2213-2600(18)30300-X (2018).
Article PubMed Google Scholar
Tripoliti, E. E., Papadopoulos, T. G., Karanasiou, G. S., Naka, K. K. & Fotiadis, D. I. Heart failure: diagnosis, severity estimation and prediction of adverse events through machine learning techniques. Comput. Struct. Biotechnol. J. 15, 26–47, https://doi.org/10.1016/j.csbj.2016.11.001 (2017).
Article PubMed Google Scholar
Mortazavi, B. J. et al. Analysis of machine learning techniques for heart failure readmissions. Circ. Cardiovasc. Qual. Outcomes 9, 629–640, https://doi.org/10.1161/CIRCOUTCOMES.116.003039 (2016).
Article PubMed PubMed Central Google Scholar
Marcinkiewicz-Siemion, M. et al. LC–MS-based serum fingerprinting reveals significant dysregulation of phospholipids in chronic heart failure. J. Pharm. Biomed. Anal. 154, 354–363, https://doi.org/10.1016/j.jpba.2018.03.027 (2018).
Article CAS PubMed Google Scholar
Godzien, J., Alonso-Herranz, V., Brabas, C. & Armitage, E. G. Controlling the quality of metabolomics data: new strategies to get the best out of the QC sample. Metabolomics 11, 518–528, https://doi.org/10.1007/s11306-014-0712-4 (2015).
Article CAS Google Scholar
Armitage, E. G., Godzien, J., Alonso-Herranz, V., Lopez-Gonzalvez, A. & Barbas, C. Missing value imputation strategies for metabolomics data. Electrophoresis. 36, 3050–60, https://doi.org/10.1002/elps.201500352 (2015).
Article CAS PubMed Google Scholar
Leopold, J. A. & Loscalzo, J. Emerging role of precision medicine in cardiovascular disease. Circ. Res. 122, 1302–1315, https://doi.org/10.1161/CIRCRESAHA.117.310782 (2018).
Article CAS PubMed PubMed Central Google Scholar
Tromp, J. et al. Novel endotypes in heart failure: effects on guideline-directed medical therapy. Eur. Heart. J. 39, 4269–4276, https://doi.org/10.1093/eurheartj/ehy712 (2018).
Article CAS PubMed Google Scholar
Triposkiadis, F. et al. The continuous heart failure spectrum: moving beyond an ejection fraction classification. Eur. Heart. J. 40, 2155–2163, https://doi.org/10.1093/eurheartj/ehz158 (2019).
Article PubMed Google Scholar
Cheng, M. L. et al. Metabolic disturbances identified in plasma are associated with outcomes in patients with heart failure: diagnostic and prognostic value of metabolomics. J. Am. Coll. Cardiol. 65, 1509–1520, https://doi.org/10.1016/j.jacc.2015.02.018 (2015).
Article CAS PubMed Google Scholar
Tang, W. H. et al. Prognostic value of elevated levels of intestinal microbe-generated metabolite trimethylamine-N-oxide in patients with heart failure: refining the gut hypothesis. J. Am. Coll. Cardiol. 64, 1908–1914, https://doi.org/10.1016/j.jacc.2014.02.617 (2014).
Article CAS PubMed Google Scholar
Mueller-Hennessen, M. et al. A novel lipid biomarker panel for the detection of heart failure with reduced ejection fraction. Clin. Chem. 63, 267–277, https://doi.org/10.1373/clinchem.2016.257279 (2017).
Article CAS PubMed Google Scholar
Hunter, W. G. et al. Metabolomic profiling identifies novel circulating biomarkers of mitochondrial dysfunction differentially elevated in heart failure with preserved versus reduced ejection fraction: evidence for shared metabolic impairments in clinical heart failure. J. Am. Heart. Assoc. 29, 5, https://doi.org/10.1161/JAHA.115.003190 (2016).
Article Google Scholar
Verdonschot, J. A. J. et al. Metabolic profiling associates with disease severity in non-ischemic dilated cardiomyopathy. J. Card. Fail. 00, 1–11, https://doi.org/10.1016/j.cardfail.2019.09.004 (2019).
Article Google Scholar
Filippatos, G. S. et al. Hyperuricemia, chronic kidney disease, and outcomes in heart failure: potential mechanistic insights from epidemiological data. Eur. Heart J. 32, 712–720, https://doi.org/10.1093/eurheartj/ehq473 (2011).
Article CAS PubMed PubMed Central Google Scholar
Jankowska, E. A. et al. Hypeuricaemia predicts poor outcome in patients with mild to moderate chronic heart failure. Int. J. Cardiol. 115, 151–155, https://doi.org/10.1016/j.ijcard.2005.10.033 (2007).
Article PubMed Google Scholar
Hare, J. M. et al. OPT-CHF Investigators. Impact of oxypurinol in patients with symptomatic heart failure. Results of the OPT-CHF study. J. Am. Coll. Cardiol. 51, 2301–2309, https://doi.org/10.1016/j.jacc.2008.01.068 (2008).
Article CAS PubMed Google Scholar
Wang-Sattler, R. et al. Novel biomarkers for pre-diabetes identified by metabolomics. Mol. Syst. Biol. 8, 615, https://doi.org/10.1038/msb.2012.43 (2012).
Article CAS PubMed PubMed Central Google Scholar
Sparagna, G. C. et al. Loss of cardiac tatralinoleoyl cardiolipin in human and experimental heart failure. J. Lipid. Res. 48, 1559–1570, https://doi.org/10.1194/jlr.M600551-JLR200 (2007).
Article CAS PubMed Google Scholar
Li, J. et al. Discovery of phosphatidic acid, phosphatidylcholine, and phosphatidylserine as biomarkers for early diagnosis of endometriosis. Front. Physiol. 9, 14, https://doi.org/10.3389/fphys.2018.00014 (2018).
Article PubMed PubMed Central Google Scholar
Heimerl, S. et al. Alterations of plasma lysophosphatidylcholine species in obesity and weight loss. PLoS One 9, e111348, https://doi.org/10.1371/journal.pone.0111348 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Lindahl, A., Forshed, J. & Nordström, A. Overlap in serum metabolic profiles between non-related diseases: implications for LC-MS metabolomics biomarker discovery. Biochem. Biophys. Res. Commun. 478, 1472–7, https://doi.org/10.1016/j.bbrc.2016.08.155 (2016).
Article CAS PubMed Google Scholar
Mayerhofer, C. C. K. et al. Increased secondary/primary bile acid-ratio in chronic heart failure. J. Card. Fail. 23, 666–671, https://doi.org/10.1016/j.cardfail.2017.06.007 (2017).
Article CAS PubMed Google Scholar
Krack, A., Sharma, R., Fiqulla, H. R. & Anker, S. D. The importance of the gastrointestinal system in the pathogenesis of heart failure. Eur. Heart J. 26, 2368–2374, https://doi.org/10.1093/eurheartj/ehi389 (2005).
Article CAS PubMed Google Scholar
Kamo, T., Akazawa, H., Suzuki, J. & Komuro, I. Novel concept of heart-gut axis in the pathophysiology of heart failure. Korean Circ. J. 47, 663–669, https://doi.org/10.4070/kcj.2017.0028 (2017).
Article PubMed PubMed Central Google Scholar
Nodari, S. et al. Effects of n-3 polyunsaturated fatty acids on left ventricular function and functional capacity in patients with dilated cardiomyopathy. J. Am. Coll. Cardiol. 57, 870–879, https://doi.org/10.1016/j.jacc.2010.11.017 (2011).
Article CAS PubMed Google Scholar
Aarsetoy, H. et al. Low levels of cellular omega-3 increase the risk of ventricular fibrillation during the acute ischaemic phase of a myocardial infarction. Resuscitation 78, 258–264, https://doi.org/10.1016/j.resuscitation.2008.04.007 (2008).
Article CAS PubMed Google Scholar
Mozaffarian, D. et al. Circulating long-chain omega-3 fatty acids and incidence of congestive heart failure in older adults: the cardiovascular health study. Ann. Intern. Med. 155, 160–170, https://doi.org/10.1059/0003-4819-155-3-201108020-00006 (2011).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the annual grant for Young Scientists 2014, Club 30 of Polish Cardiac Society granted to MMS, statutory grants and funds from National Scientific Leading Center of Medical University of Bialystok. The article has been supported by the Polish National Agency for Academic Exchange under grant number PPI/PZA/2019/1/00001/U/00001.

Author information

Authors and Affiliations

Medical University of Bialystok, Cardiology Department, M. Sklodowskiej-Curie 24A, 15-276, Bialystok, Poland
M. Marcinkiewicz-Siemion, M. Kaminski, K. Ptaszynska-Kopczynska, A. Szpakowicz, A. Lisowska, M. Jasiewicz, E. Tarasiuk, B. Sobkowicz & K. A. Kaminski
Medical University of Bialystok, Clinical Research Centre, M. Sklodowskiej-Curie 24A, 15-276, Bialystok, Poland
M. Ciborowski & A. Kretowski
Department of Population Medicine and Civilization Diseases Prevention, Medical University of Bialystok, Waszyngtona 13A, 15-276, Bialystok, Poland
K. A. Kaminski

Authors

M. Marcinkiewicz-Siemion
View author publications
You can also search for this author in PubMed Google Scholar
M. Kaminski
View author publications
You can also search for this author in PubMed Google Scholar
M. Ciborowski
View author publications
You can also search for this author in PubMed Google Scholar
K. Ptaszynska-Kopczynska
View author publications
You can also search for this author in PubMed Google Scholar
A. Szpakowicz
View author publications
You can also search for this author in PubMed Google Scholar
A. Lisowska
View author publications
You can also search for this author in PubMed Google Scholar
M. Jasiewicz
View author publications
You can also search for this author in PubMed Google Scholar
E. Tarasiuk
View author publications
You can also search for this author in PubMed Google Scholar
A. Kretowski
View author publications
You can also search for this author in PubMed Google Scholar
B. Sobkowicz
View author publications
You can also search for this author in PubMed Google Scholar
K. A. Kaminski
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.M.S. and K.A.K. conceived the experiment; M.M.S., K.P.K., A.S., A.L., M.J., E.T. and K.A.K conducted the experiment; M.C. performed metabolomics analyses; M.M.S., M.C. and M.K. carried out statistical assessment; M.M.S., M.C. and K.A.K. analyzed the results; M.M.S., M.C., M.K. and K.A.K. prepared the manuscript; A.K., B.S. supervised the procedures and contributed to the manuscript edition. All authors reviewed the manuscript.

Corresponding author

Correspondence to K. A. Kaminski.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Suplementary information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Marcinkiewicz-Siemion, M., Kaminski, M., Ciborowski, M. et al. Machine-learning facilitates selection of a novel diagnostic panel of metabolites for the detection of heart failure. Sci Rep 10, 130 (2020). https://doi.org/10.1038/s41598-019-56889-8

Download citation

Received: 02 July 2019
Accepted: 18 December 2019
Published: 10 January 2020
DOI: https://doi.org/10.1038/s41598-019-56889-8

This article is cited by

Understanding ayahuasca effects in major depressive disorder treatment through in vitro metabolomics and bioinformatics
- Flávia S. Zandonadi
- Alex Ap. Rosini Silva
- Alessandra Sussulini
Analytical and Bioanalytical Chemistry (2023)
Probiotic-mediated p38 MAPK immune signaling prolongs the survival of Caenorhabditis elegans exposed to pathogenic bacteria
- Miroslav Dinić
- Stefan Jakovljević
- Nataša Golić
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

A novel preliminary metabolomic panel for IHD diagnostics and pathogenesis

Multinomial machine learning identifies independent biomarkers by integrated metabolic analysis of acute coronary syndrome

Dynamic personalized risk prediction in chronic heart failure patients: a longitudinal, clinical investigation of 92 biomarkers (Bio-SHiFT study)

Introduction

Material and Methods

Study population

Clinical parameters

Metabolic fingerprinting by LC-QTOF-MS

Statistical analysis

Metabolites identification

Equation 1

Results

Clinical group characteristics

Selection of top ranked metabolites which created the diagnostic panel

Discussion

Study limitations

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Suplementary information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Understanding ayahuasca effects in major depressive disorder treatment through in vitro metabolomics and bioinformatics

Probiotic-mediated p38 MAPK immune signaling prolongs the survival of Caenorhabditis elegans exposed to pathogenic bacteria

Comments

Search

Quick links