Database Restricted Access

Hospitalized patients with heart failure: integrating electronic healthcare records and external outcome data

Zhongheng Zhang Yan Zhao Linghong Cao Ziyin Xu Rangui Chen Lukai Lv Ping Xu

Published: Oct. 9, 2020. Version: 1.1

When using this resource, please cite: (show more options)
Zhang, Z., Zhao, Y., Cao, L., Xu, Z., Chen, R., Lv, L., & Xu, P. (2020). Hospitalized patients with heart failure: integrating electronic healthcare records and external outcome data (version 1.1). PhysioNet.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.


Heart failure is a common reason for hospitalization in the elderly and it is associated with significant mortality and morbidity. To facilitate epidemiological studies of heart failure, there is a need for high quality datasets to be made available to researchers. While several heart failure datasets have been established in Western countries, there is a paucity of data available from China. Understanding differences in patient populations and healthcare systems between China and other countries is important in providing optimal care. To help address this issue, we created a retrospective heart failure dataset using electronic health data collected from patients who were admitted to a hospital in Sichuan, China between 2016 and 2019. The dataset includes 168 variables for 2,008 patients with heart failure.


Heart failure is a common reason for hospitalization in the elderly and it is associated with significant mortality and morbidity. The disease affects over 6 million people in the United States, with an estimated incidence of 21 per 1000 people in the elderly [1, 2]. Studies have reported that one year mortality ranges from 20% to 60% after hospitalization for acute heart failure [3-6], depending on comorbidities and co-existing medical conditions.

Many datasets have been developed to support studies on the epidemiology of hospitalized patients with heart failure. For instance, the Cleveland in Heart Disease dataset contains 75 variables in 303 patients and it is widely used in clinical studies [7]. Another widely used dataset is the Nationwide Inpatient Sample (NIS), a publicly available resource from the Healthcare Utilization Project (HCUP) that is supported by the Agency for Healthcare Research and Quality [8].


Our study aimed to establish a heart failure database by extracting data from routinely collected electronic healthcare records. Data on subsequent hospital admissions and mortality were obtained at a mandatory follow‐up visit at 28 days, 3 months and 6 months (if the patient was unable to reach the clinical centre, the follow‐up visit was replaced by a telephone call).

The study was carried out retrospectively, selecting patients for inclusion who had been admitted to Zigong Fourth People’s Hospital with heart failure between December 2016 to June 2019. Informed consent was waived due to retrospective design of the study. The study complies with the Declaration of Helsinki and it was approved by the ethics committee of Zigong Fourth People’s Hospital (Approval Number: 2020-010)

Electronic healthcare records of consecutive patients with the diagnosis of heart failure were reviewed. Heart failure was defined according to the European Society of Cardiology (ESC) criteria [9]:

  • The presence of symptoms and/or signs of heart failure. Typical symptoms include breathlessness, orthopnea, paroxysmal nocturnal dyspnea, reduced excises  tolerance, Fatigue, tiredness, increased time to recover after exercise and Ankle swelling. Typical signs include elevated jugular venous pressure, hepatojugular reflux, third heart sound (gallop rhythm) and laterally displaced apical impulse. 
  • Elevated levels of BNPs (BNP >35 pg/mL and/or NT‐proBNP >125 pg/mL)
  • Objective evidence of other cardiac functional and structural alterations underlying heart failure.
  • In case of uncertainty, a stress test or invasively measured elevated LV filling pressure may be needed to confirm the diagnosis.

Data Description

Baseline clinical characteristics were measured on the day of hospital admission, which included body temperature, pulse, respiration rate, systolic blood pressure, diastolic blood pressure, mean arterial blood pressure, weight, height, body mass index (BMI), type of heart failure, New York Heart Association (NYHA) cardiac function, Killip Grade (Class 1: no rales, no 3rd heart sound; Class 2: rales in <1⁄2 lung field or presence of a 3rd heart sound; Class 3: Rales in >1⁄2 lung field–pulmonary edema; Class 4: cardiogenic shock determined clinically), and Glasgow Coma Scale (GCS).

Echocardiographic findings include left ventricular ejection fraction (LVEF), left ventricular end diastolic diameter, mitral valve peak E wave velocity (m/s), mitral valve peak A wave velocity (m/s), E/A, tricuspid valve regurgitation velocity, and tricuspid valve regurgitation pressure.

Medications administered during the hospitalization are recorded in dat_md.csv. Administration times are not available, but multiple entries in the table indicate that a medication was administered multiple times. Primary drug categories included in our dataset were diuretics, inotropes, vasodilators. The diuretics included furosemide, torsemide and spironolactone. Inotropes included deslanoside, dobutamine, digoxin, isoprenaline and milrinone. Vasodilators included isosorbide mononitrate and nitroglycerin.

Usage Notes

Data were collected with the goal of developing a model to predict emergency readmission of discharged heart failure patients. Another use might be to carry out a descriptive study to compare this patient population with similar cohorts of patients in other hospitals and countries.

A limitation of the dataset is that it is collected from a single center, so models developed using the data may not be generalizable. It should also be noted that the data has been aggregated into a single row per hospitalization, so only high level representations of patient stays are provided. The dataset does not offer time series data over the hospitalization, so, for example, administration times for medications are not available.

Release Notes

This update (V1.1) included:

  1. A table containing details of medications administered during the hospitalization was added.
  2. Minor updates were made to the data to correct errors.
  3. The data dictionary file was improved by adding more detailed descriptions of observations, such as units of measurement and reference range for laboratory tests. 

Conflicts of Interest

The authors have no conflicts of interest to declare.


  1. Benjamin, E. J. et al. Heart Disease and Stroke Statistics-2017 Update: A Report From the American Heart Association. Circulation 135, e146–e603 (2017).
  2. Fang, N., Jiang, M. & Fan, Y. Ideal cardiovascular health metrics and risk of cardiovascular disease or mortality: A meta-analysis. Int. J. Cardiol. 214, 279–283 (2016).
  3. Lombardi, C. et al. In-hospital and long-term mortality for acute heart failure: analysis at the time of admission to the emergency department. ESC Heart Fail ehf2.12847 (2020). doi:10.1002/ehf2.12847
  4. Ye, S.-D. et al. Association between anemia and outcome in patients hospitalized for acute heart failure syndromes: findings from Beijing Acute Heart Failure Registry (Beijing AHF Registry). Intern Emerg Med 151, 457 (2020).
  5. Braunschweig, F. et al. New York Heart Association functional class, QRS duration, and survival in heart failure with reduced ejection fraction: implications for cardiac resychronization therapy. Eur. J. Heart Fail. 19, 366–376 (2017).
  6. Al-Jarallah, M. et al. Incidence and impact of cardiorenal anaemia syndrome on all-cause mortality in acute heart failure patients stratified by left ventricular ejection fraction in the Middle East. ESC Heart Fail 6, 103–110 (2019).
  7. Detrano, R. et al. International application of a new probability algorithm for the diagnosis of coronary artery disease. Am. J. Cardiol. 64, 304–310 (1989).
  8. HCUP National Inpatient Sample (NIS). Healthcare Cost and Utilization Project (HCUP). 2012. Agency for Healthcare Research and Quality, Rockville, MD.
  9. Ponikowski, P. et al. 2016 ESC Guidelines for the diagnosis and treatment of acute and chronic heart failure: The Task Force for the diagnosis and treatment of acute and chronic heart failure of the European Society of Cardiology (ESC). Developed with the special contribution of the Heart Failure Association (HFA) of the ESC. European Journal of Heart Failure 18, 891–975 (2016).


Access Policy:
Only logged in users who sign the specified data use agreement can access the files.

License (for files):
PhysioNet Restricted Health Data License 1.5.0

Corresponding Author
You must be logged in to view the contact information.
  • 1.0 - Sept. 9, 2020
  • 1.1 - Oct. 9, 2020