Feedback
  1. Chest X-Ray Images (Pneumonia)

    • www.kaggle.com
    Updated Mar 24, 2018
  2. NIH Chest X-rays

    • www.kaggle.com
    Updated Feb 21, 2018
  3. NIH Chest X-ray Dataset (2 of 6)

    • www.kaggle.com
    Updated Nov 16, 2017
  4. Center for X-ray Optics (CXRO)

    • catalog.data.gov
    • data.wu.ac.at
    Updated Mar 8, 2017
  5. Random Sample of NIH Chest X-ray Dataset

    • www.kaggle.com
    Updated Nov 23, 2017
  6. X-Ray Diffractometer

    • catalog.data.gov
    Updated Mar 8, 2017
  7. Pulmonary Chest X-Ray Abnormalities

    • www.kaggle.com
    Updated Mar 9, 2018
  8. X-Ray Imaging Technology Development Lab

    • catalog.data.gov
    Updated Mar 8, 2017
  9. a

    NIH Chest X-ray Dataset of 14 Common Thorax Disease Categories

    • academictorrents.com
  10. Chest X-Rays Dataset

    • www.kaggle.com
    Updated Jan 29, 2018
  11. d

    Silicon Wafer X-ray Mirror Project

    • catalog.data.gov
    Updated Aug 1, 2018
  12. X-Ray Diffractometer

    • data.wu.ac.at
    Updated Mar 8, 2017
  13. GOES-12 Solar X-ray Imager Archive

    • catalog.data.gov
    • data.nodc.noaa.gov
    Updated Feb 1, 2017
  14. d

    Magnetically-coupled microcalorimeter arrays for x-ray astrophysics with...

    • catalog.data.gov
    Updated Aug 1, 2018
  15. EUV/X-Ray Calibration Facility

    • catalog.data.gov
    Updated Mar 8, 2017
  16. RSNA Bone Age

    • www.kaggle.com
    Updated Jan 24, 2018
  17. d

    Hybrid Powder - Single Crystal X-Ray Diffraction Instrument for Planetary...

    • catalog.data.gov
    Updated Aug 1, 2018
  18. Single mimivirus particles intercepted and imaged with an X-ray laser (CXIDB...

    • www.osti.gov
    • search.datacite.org
    Published Feb 2, 2011
  19. X-ray Diffraction Images of Pseudomonas aeruginosa DsbA

    • researchdata.ands.org.au
    Updated Dec 14, 2013
  20. d

    Multi-Purpose X-ray System, Phase I

    • catalog.data.gov
    Updated Aug 1, 2018
  21. K

    Lung Masks for Shenzhen Hospital Chest X-ray Set

    • www.kaggle.com
    Updated Mar 3, 2018
  22. d

    The Focusing Optics X-ray Solar Imager (FOXSI): Update & Second Launch...

    • catalog.data.gov
    • datamirror.org
    Updated Aug 1, 2018
  23. d

    Data from: X-ray computed tomography and its potential in ecological...

    • datadryad.org
    Published Jul 17, 2018
  24. X-ray fluorescence scannings and AMS radicarbon dates of sediment core...

    • doi.pangaea.de
    Published Mar 21, 2017
  25. Data from: The X-ray Crystal Structure of Full-length Human Plasminogen

    • researchdata.ands.org.au
    Updated May 3, 2012
  26. w

    Automatic X-ray Diffractometers

    • data.wu.ac.at
  27. d

    X-Ray Facility List

    • catalog.data.gov
    • data.wa.gov
    • +1more
    Updated Feb 3, 2018
  28. X-ray fluorescent (XRF) ratios and diatom groups of sediment core...

    • doi.pangaea.de
    Published May 10, 2016
  29. Raw and normalized X-ray fluorescence (XRF) scannings of IODP Expedition...

    • doi.pangaea.de
    Published May 2, 2012
  30. Biomedical applications of phase contrast x-ray imaging

    • researchdata.ands.org.au
    Published 2014
  31. f

    Coherent X-ray Imaging Data Bank

    • fairsharing.org
  32. Advanced UV and X-ray Probes

    • catalog.data.gov
    • data.wu.ac.at
    Updated Mar 8, 2017
  33. X-ray fluorescence (XRF) scannings and grain size analysis at sites in the...

    • doi.pangaea.de
    Published Mar 26, 2014
  34. d

    Advanced Optical Metrology for XRAY Replication Mandrels ...

    • data.world
    Updated Aug 27, 2018
  35. Tomographic X-ray data of time-dependent 3D cross phantom

    • zenodo.org
    Published Sep 1, 2018
  36. X-Ray Imaging Technology Development Lab

    • data.wu.ac.at
    Updated Mar 8, 2017
  37. d

    Development of X-ray Computed Tomography (CT) Imaging Method for the...

    • catalog.data.gov
    Updated Aug 1, 2018
  38. d

    Whole rock, soil, sediment, x-ray diffraction, and electron microprobe...

    • catalog.data.gov
    • search.datacite.org
    Updated Jun 8, 2018
  39. Data from: Milli X-Ray Fluorescence Spectrometer

    • data.wu.ac.at
    Updated Mar 8, 2017
  40. Iron X-ray fluorescence (XRF) scannings of ODP Holes 165-1001A and 171-1050C

    • doi.pangaea.de
    Published Jan 27, 2001
  41. d

    Soft X-ray Absorbers Enabling Study of the Diffuse X-ray Background

    • catalog.data.gov
    Updated Aug 1, 2018
  42. Raw X-ray fluorescence (XRF) scannings and radiocarbon age of sediment cores...

    • doi.pangaea.de
    Published Aug 19, 2016
  43. d

    Wide Field-of-View (FOV) Soft X-Ray Imager Project

    • catalog.data.gov
    Updated Jul 8, 2015
  44. d

    Novel Magnetically-Tuned TES For Imaging X-ray Spectroscopy Project

    • catalog.data.gov
    Updated Aug 1, 2018
  45. Data from: Single mimivirus particles intercepted and imaged with an X-ray...

    • www.osti.gov
    • search.datacite.org
    Published Feb 2, 2011
  46. d

    Modulated X-ray Sources for Cal X-1

    • catalog.data.gov
    Updated Aug 1, 2018
  47. X-ray diffraction analysis of sediment cores from the ACEX expedition to the...

    • doi.pangaea.de
    Published Oct 2, 2008
  48. NIST X-ray Photoelectron Spectroscopy Database - SRD 20

    • catalog.data.gov
    Updated Aug 10, 2018
  49. f

    Data from: X-ray and Neutron Diffraction Studies on “Li4.4Sn”

    • figshare.com
  50. d

    Hard X-ray Photoelectric Polarimeter

    • data.world
    Updated Sep 4, 2018
Share
Facebook
Twitter
Google+
Email
Click to copy link
Link copied

Random Sample of NIH Chest X-ray Dataset

5,606 images and labels sampled from the NIH Chest X-ray Dataset

  • Dataset updated   Nov 23, 2017
Dataset provided by
National Institutes of Health Chest X-Ray Dataset
License
CC0: Public Domainhttps://creativecommons.org/publicdomain/zero/1.0/
Available download formats from providers
csv
,
zip
Description

NIH Chest X-ray Dataset Sample

National Institutes of Health Chest X-Ray Dataset

Chest X-ray exams are one of the most frequent and cost-effective medical imaging examinations available. However, clinical diagnosis of a chest X-ray can be challenging and sometimes more difficult than diagnosis via chest CT imaging. The lack of large publicly available datasets with annotations means it is still very difficult, if not impossible, to achieve clinically relevant computer-aided detection and diagnosis (CAD) in real world medical sites with chest X-rays. One major hurdle in creating large X-ray image datasets is the lack resources for labeling so many images. Prior to the release of this dataset, Openi was the largest publicly available source of chest X-ray images with 4,143 images available.

This NIH Chest X-ray Dataset is comprised of 112,120 X-ray images with disease labels from 30,805 unique patients. To create these labels, the authors used Natural Language Processing to text-mine disease classifications from the associated radiological reports. The labels are expected to be >90% accurate and suitable for weakly-supervised learning. The original radiology reports are not publicly available but you can find more details on the labeling process in this Open Access paper: "ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases." (Wang et al.)

Link to paper


File contents - This is a random sample (5%) of the full dataset:

  • sample.zip: Contains 5,606 images with size 1024 x 1024

  • sample_labels.csv: Class labels and patient data for the entire dataset

    • Image Index: File name
    • Finding Labels: Disease type (Class label)
    • Follow-up #
    • Patient ID
    • Patient Age
    • Patient Gender
    • View Position: X-ray orientation
    • OriginalImageWidth
    • OriginalImageHeight
    • OriginalImagePixelSpacing_x
    • OriginalImagePixelSpacing_y


Class descriptions

There are 15 classes (14 diseases, and one for "No findings") in the full dataset, but since this is drastically reduced version of the full dataset, some of the classes are sparse with the labeled as "No findings"

  • Hernia - 13 images
  • Pneumonia - 62 images
  • Fibrosis - 84 images
  • Edema - 118 images
  • Emphysema - 127 images
  • Cardiomegaly - 141 images
  • Pleural_Thickening - 176 images
  • Consolidation - 226 images
  • Pneumothorax - 271 images
  • Mass - 284 images
  • Nodule - 313 images
  • Atelectasis - 508 images
  • Effusion - 644 images
  • Infiltration - 967 images
  • No Finding - 3044 images


Full Dataset Content

The full dataset can be found here. There are 12 zip files in total and range from ~2 gb to 4 gb in size.


Data limitations:

  1. The image labels are NLP extracted so there could be some erroneous labels but the NLP labeling accuracy is estimated to be >90%.
  2. Very limited numbers of disease region bounding boxes (See BBox_list_2017.csv)
  3. Chest x-ray radiology reports are not anticipated to be publicly shared. Parties who use this public dataset are encouraged to share their “updated” image labels and/or new bounding boxes in their own studied later, maybe through manual annotation


Modifications to original data

  • Original TAR archives were converted to ZIP archives to be compatible with the Kaggle platform

  • CSV headers slightly modified to be more explicit in comma separation and also to allow fields to be self-explanatory


Citations


Acknowledgements

This work was supported by the Intramural Research Program of the NClinical Center (clinicalcenter.nih.gov) and National Library of Medicine (www.nlm.nih.gov).

Search
Clear search
Close search
Google apps
Main menu