Database Restricted Access

HYAMD High-Resolution Fundus Image Dataset for age related macular degeneration (AMD) Diagnosis

Meishar Meisel Benjamin Alfred Cohen Meital Baskin Beatrice Tiosano Joachim Behar Eran Berkowitz

Published: Sept. 9, 2025. Version: 1.0.0


When using this resource, please cite: (show more options)
Meisel, M., Cohen, B. A., Baskin, M., Tiosano, B., Behar, J., & Berkowitz, E. (2025). HYAMD High-Resolution Fundus Image Dataset for age related macular degeneration (AMD) Diagnosis (version 1.0.0). PhysioNet. RRID:SCR_007345. https://doi.org/10.13026/ydf1-z238

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220. RRID:SCR_007345.

Abstract

The Hillel Yaffe Age Related Macular Degeneration (HYAMD) longitudinal dataset comprises of 1,560 Digital Fundus Images (DFIs) of 325 patients examined at the Hillel Yaffe Medical Center (Hadera, Israel, Helsinki approval number 0048-24-HYMC) providing follow-up data from 2021 to 2024. The AMD cohort includes 147 patients (73 males, 74 females) aged 54–94, while the control group comprises 190 diabetic retinopathy (DR) patients (111 males, 78 females) aged 24–92. DFIs in the AMD cohort either present with early, intermediate, or late-stage AMD, including neovascular AMD (NVAMD), visible geographic atrophy (GA), or presence of large drusen (LD). Diagnosis was based on a comprehensive clinical ophthalmic examination and supported by Optical Coherence Tomography (OCT) scans, including OCT angiography. Non-AMD DFIs were taken from patients examined in the Retina clinic for DR and without a concurrent diagnosis of AMD. Diagnosis of DR was based on clinical examination, macular OCT, and in relevant cases fluorescein angiography, and widefield imaging. This dataset was developed for the validation of machine learning models intended to the task of AMD identification from single DFI. The annotations are gold-standard, meaning the AMD labels were determined based on a complete ophthalmic evaluation rather than solely on DFIs. The images were taken using a DRI OCT Triton (Topcon) camera, with a 45° field of view (FOV) and a resolution of 1960 × 1934 pixels. HYAMD is also, to the best of our knowledge, the first open access retinal dataset from an Israeli sample.


Background

Age related macular degeneration (AMD) is a retinal disease that threatens high-acuity central vision, and a leading cause of blindness. The main risk factor is advancing age, with the severity of vision loss ranging from mild to severe. There is a 25% risk of early AMD and 8% risk of late AMD in patients over the age of 75, with the number of cases expected to increase because of the aging population. The current global prevalence of this disease is about 200 million today and is expected to reach 288 million in 2040 [1]. Due to the high incidence and risk, it is crucial to improve the efficiency of the screening and diagnosis of AMD. In addition, early detection of AMD significantly improves the patient's visual prognosis by allowing timely intervention, which can prevent severe vision loss and enhance the quality of life.

Diagnosis of the disease requires a dilated fundus examination the examiner evaluates the macula for deposits of drusen, pigmentary changes, geographic atrophy and scar formation. Attention is given to the size, number, and distribution of drusen. A complete eye examination is also performed to rule out other coexisting ocular pathologic conditions. The staging of the disease may largely be based on the examination; however, use of a variety of imaging techniques is now considered essential to correlate examination findings and guide management.

The etiology of AMD remains incompletely understood, with contributing factors including genetic predisposition, chronic photo-destruction, oxidative stress, and nutritional deficiencies [1, 2]. AMD is broadly classified into two categories: Dry AMD (nonexudative) and Wet AMD (neovascular or exudative).

Dry AMD is characterized by the accumulation of drusen and progressive atrophy of the retinal pigment epithelium (RPE), leading to gradual vision loss [3]. In advanced stages, large areas of atrophy and extensive drusen deposits are visible under ophthalmoscopy. Wet AMD, on the other hand, is marked by pathological neovascularization beneath the RPE, resulting in exudation, hemorrhage, and scarring. This form progresses rapidly and causes irreversible damage to photoreceptors, leading to severe and rapid vision loss if untreated [4].

The Age-Related Eye Disease Study (AREDS) developed a classification system to assess the severity of AMD, categorizing it into mild, moderate, and severe stages based on clinical features. Mild AMD is characterized by the presence of small drusen (<63 μm) without significant visual symptoms. Moderate AMD involves medium-sized drusen (≥63 μm and <125 μm) or pigmentary abnormalities, with a higher risk of progression. Severe AMD is marked by large drusen (≥125 μm), geographic atrophy, or neovascular AMD, often accompanied by significant visual impairment. This classification aids in predicting disease progression and guiding management strategies [5]

Recent advancements in deep learning (DL) have revolutionized the field of automated age-related macular degeneration (AMD) detection, with digital fundus imaging (DFI) serving as a primary modality for these applications [6]. However, a persistent limitation in the existing body of research is the reliance on reference labels derived solely from DFI evaluations, rather than comprehensive ophthalmic examinations [7]. This approach reduces the detection task to a subjective interpretation of retinal appearance, which is inherently prone to variability and error. Conditions such as diabetic retinopathy and central retinal vein occlusion may mimic AMD features, further complicating the identification process. Consequently, DL models trained exclusively on DFI data are susceptible to inheriting biases from inconsistent annotations, subjective judgments, and unverified cases, potentially misrepresenting true clinical manifestations of AMD.

To address these critical issues, the Hillel Yaffe AMD Dataset (HYAMD) was specifically developed to provide a robust foundation for AMD detection and classification tasks. Unlike most existing datasets, HYAMD employs gold-standard annotations based on comprehensive ophthalmic evaluations, including optical coherence tomography (OCT), rather than relying solely on subjective DFI-based labeling. By incorporating a comprehensive ophthalmic examination together with multimodal diagnostic data, HYAMD aims to enhance the reliability of DL models and mitigate biases, ensuring that algorithmic outputs align more closely with clinical realities. This resource represents a pivotal step toward improving the accuracy and generalizability of AI-driven AMD detection systems, ultimately benefiting both researchers and clinicians. HYAMD is also, to the best of our knowledge, the first open access retinal dataset from an Israeli population sample.


Methods

This study was approved by the Helsinki Committee at the Hillel Yaffe Medical Center (Helsinki approval number: 0048-24-HYMC). All identifiable patient information was removed to ensure patient privacy.

Study Cohort

The dataset was curated by the Hillel Yaffe Ophthalmology Department Retina Unit, Hadera, Israel. DFIs of both eyes of each patient were captured using a TOPCON DRI OCT Triton retinal camera, with a 45° FOV. The AMD cohort includes 188 patients (95 males, 93 females) aged 57–94, while the control group comprises 192 diabetic retinopathy (DR) patients (113 males, 79 females) aged 24–92.

Data preparation

All DFIs were deidentified, ensuring that any personal identifiers were removed. To maintain consistency, images were cropped to a square format by removing black borders.

Labeling

HYAMD employs gold-standard annotations, meaning that AMD labels were assigned based on a full ophthalmic examination, rather than being inferred solely from DFIs. Patients are diagnosed with AMD based on a comprehensive ophthalmic examination, which includes visual acuity (VA) assessment, intraocular pressure (IOP) measurement, anterior and posterior segment evaluation and posterior pole assessment using a 78 diopter lens while the pupil is dilated. These examinations are supported by OCT scans, including OCT angiography . The AMD cohort is divided in two subgroups: early AMD, with a label AMD=1, and intermediate-to-late AMD, with a label of AMD=2. The intermediate-to-late group either presents with neovascular AMD (NVAMD), visible geographic atrophy (GA), or with the presence of large drusen (LD). Furthermore, all patients in this cohort were followed up in Retina clinic for at least a year validating the labeling accuracy. Each patient had at least three visits a year, during which they underwent clinical examinations, OCT, and fundus photography as part of their routine follow-up. If the stage of an eye changed during the follow-up period, all subsequent images were labeled with the new AMD stage. Patients with erroneous diagnoses were removed from the dataset. Non-AMD DFIs were collected from patients examined in the Retina clinic for DR who did not have a concurrent diagnosis of AMD or other eye diseases. The diagnosis of DR was based on clinical examination, macular OCT, and, where relevant, fluorescein angiography and widefield imaging.

All examinations were carried out by professional Retina specialists, while the images were taken by trained and experienced technicians.


Data Description

The longitudinal dataset comprises 1,560 DFIs from 325 patients, providing follow-up data from 2021 to 2024. The AMD cohort includes 147 patients (73 males, 74 females) aged 54–94, while the control group comprises 190 diabetic retinopathy (DR) patients (111 males, 78 females) aged 24–92.

Number of images per class:

  • Class 0 (non-AMD) – 1,028 images
  • Class 1 (early AMD) – 108 images
  • Class 2 (late AMD) – 424 images

Number of images per patient – Mean: 4.8, Median: 4, Std: 4.3

List of patient IDs and number of images per patient is in the attached CSV file.

Folder structure

  • /images – Folder containing all DFIs in .png format
  • /labels – Folder containing three files:
    • labels.csv – Metadata file with labels and patient-level information
    • README.md – Readme file containing data description
    • images_per_patient_id.csv – CSV file containing number of images per patient ID

The structure of the dataset is as follows:

Images

The dataset consists of DFIs, including:

  • AMD DFIs
  • Non-AMD DFIs

All DFIs are stored in JPG format with a 1:1 aspect ratio. The naming convention follows the format x_y_z.jpg, where:

  • x represents the patient ID
  • y represents the laterality

For example: 981616684_R.jpg

The images in the images/ folder correspond to the patient IDs in labels.csv, where image_id in the CSV matches the filename in images/.

Labels

The labels file contains the following columns:

Column Name Description
Image ID The filename of the DFI
Patient ID The patient ID
Sex Binary classification: Male (M) or Female (F)
Age In years
Eye side Binary classification: OD (right eye) or OS (left eye)
AMD Multiclass classification: 0 – non-AMD (DR control), 1 – early AMD, 2 – intermediate to late AMD

Usage Notes

Potential research applications

  1. This dataset can serve as a benchmark dataset for evaluating existing AMD diagnosis models.
  2. It can be used as a resource for training new deep learning models for automated AMD detection.
  3. This dataset can be a valuable resource for validating AI models across diverse demographics since it contains a cohort of Israeli patients.
  4. The dataset's dual focus on AMD and DR opens additional doors for comparative analysis, possibly used to develop hybrid diagnostic models.

Limitations

  1. Patients’ medical history beyond AMD diagnosis is not provided.
  2. The dataset has limited demographic diversity, as it was collected using a single camera model with a fixed 45° FOV and includes data from a single ethnic group.

Release Notes

Version 1.0.0: Initial release.


Ethics

The authors declare no ethics concerns. This project was approved by the Helsinki Committee at the Hillel Yaffe medical center (Helsinki approval number 0048-24-HYMC).

The dataset complies with all relevant data protection and privacy regulations in Israel, including data de-identification standards and requirements for data sharing.


Conflicts of Interest

The authors have no conflicts of interest to declare.


References

  1. Lim LS, Mitchell P, Seddon JM, Holz FG, Wong TY. Age-related macular degeneration. Lancet. 2012;379(9827):1728-1738. doi:10.1016/S0140-6736(12)60282-7.
  2. Khandhadia S, Lotery AJ. Oxidation and age-related macular degeneration: insights from molecular biology. Expert Rev Mol Med. 2010;12:e34. doi:10.1017/S1462399410001559.
  3. Ferris FL, Wilkinson CP, Bird A, Chakravarthy U, Chew E, Csaky K, et al. Clinical classification of age-related macular degeneration. Ophthalmology. 2013;120(4):844-851. doi:10.1016/j.ophtha.2012.10.036.
  4. Wong TY, Chakravarthy U, Klein R. Epidemiology of age-related macular degeneration. Retina. 2006;26(2):151-160. doi:10.1097/01.iae.0000228827.18836.6e.
  5. Ferris FL, Davis MD, Clemons TE, Lee LY, Chew EY, Lindblad AS, et al. A simplified severity scale for age-related macular degeneration: AREDS Report No. 18. Arch Ophthalmol. 2005;123(11):1570-1574. doi:10.1001/archopht.123.11.1570.
  6. Leingang O, Kuehlewein L, Heimes B, et al. Automated deep learning-based AMD detection and staging in real-world OCT datasets. Sci Rep. 2023;13:46626. doi:10.1038/s41598-023-46626-7.
  7. Araújo T, Silva S, Monteiro M, et al. Robust deep learning for automated AMD screening and staging based on a retinal foundation model. Invest Ophthalmol Vis Sci. 2024;65(7):2833. doi:10.1167/iovs.65.7.2833.

Share
Access

Access Policy:
Only registered users who sign the specified data use agreement can access the files.

License (for files):
PhysioNet Restricted Health Data License 1.5.0

Data Use Agreement:
PhysioNet Restricted Health Data Use Agreement 1.5.0

Discovery
Corresponding Author
You must be logged in to view the contact information.

Files