8 datasets found
  1. T

    curated_breast_imaging_ddsm

    • tensorflow.org
    Updated Jul 11, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). curated_breast_imaging_ddsm [Dataset]. https://www.tensorflow.org/datasets/catalog/curated_breast_imaging_ddsm
    Explore at:
    Dataset updated
    Jul 11, 2023
    Description

    The CBIS-DDSM (Curated Breast Imaging Subset of DDSM) is an updated and standardized version of the Digital Database for Screening Mammography (DDSM). The DDSM is a database of 2,620 scanned film mammography studies. It contains normal, benign, and malignant cases with verified pathology information.

    The default config is made of patches extracted from the original mammograms, following the description from (http://arxiv.org/abs/1708.09427), in order to frame the task to solve in a traditional image classification setting.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('curated_breast_imaging_ddsm', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

    https://storage.googleapis.com/tfds-data/visualization/fig/curated_breast_imaging_ddsm-patches-3.0.0.png" alt="Visualization" width="500px">

  2. k

    CBIS-DDSM--Breast-Cancer-Image-Dataset

    • kaggle.com
    Updated Dec 6, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2013). CBIS-DDSM--Breast-Cancer-Image-Dataset [Dataset]. https://www.kaggle.com/datasets/awsaf49/cbis-ddsm-breast-cancer-image-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 6, 2013
    License

    Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
    License information was derived automatically

    Description

    Curated Breast Imaging Subset DDSM Dataset (Mammography)

  3. k

    DDSM-Mammography

    • kaggle.com
    Updated Jan 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). DDSM-Mammography [Dataset]. https://www.kaggle.com/skooch/ddsm-mammography/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 21, 2021
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    tfrecords files of scans from the DDSM dataset

  4. k

    CBIS-DDSM--Breast-Cancer-Dataset-of-JPG-Images

    • kaggle.com
    Updated Jan 29, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). CBIS-DDSM--Breast-Cancer-Dataset-of-JPG-Images [Dataset]. https://www.kaggle.com/datasets/debjeetdas/breast-cancer-jpg-image-dataset-of-cbisddsm
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 29, 2023
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This Dataset contains the JPG images of Breast Cancer taken from the CBIS-DDSM.

    https://i.imgur.com/rz4rtQI.png" alt="Breast Cancer Images">

    Descripton

    This dataset contains JPG format images (2.49 GB) of the original CBIS-DDSM dataset (163 GB) which are in DICOM format and by maintaining the same resolution of the images as it was in the original dataset.

    The original dataset was split into train and test by having two cases one is Mass and another is Calcification(Calc) i.e. calc_case_description_test_set.csv, calc_case_description_train_set.csv, mass_case_description_test_set.csv, mass_case_description_train_set.csv, and metadata.csv but in here this dataset is made by converting the images from DICOM to JPG format, removing the unnecessary columns by Data Cleaning and concatenating both the Mass and Calcification(Calc) cases train test into one i.e. calc_case(with_jpg_img).csv, mass_case(with_jpg_img).csv, and metadata(with_jpg_img).csv.


    | Collection | | | --- | --- | | Number of Studies | 6775 | | Number of Series | 6775 | | Number of Participants | 1,566(NB) | | Number of Images | 10239 | | Modalities | MG | | Image Size (GB) | 6(.jpg) |


    NB: The image data for this collection is structured such that each participant has multiple patient IDs. For example, pat_id 00038 has 10 separate patient IDs which provide information about the scans within the IDs (e.g. Calc-Test_P_00038_LEFT_CC, Calc-Test_P_00038_RIGHT_CC_1) This makes it appear as though there are 6,671 participants according to the DICOM metadata, but there are only 1,566 actual participants in the cohort.

    File Description

    1. JPG image folder file structure

    https://i.imgur.com/KtBMlVm.png" alt="Cancer Image file structure"> File naming: - Folder name: Subject ID > Study UID > Series UID - File name: Series Description > img_0 > 1.jpg


    2. CSV files description

    CSV FileDescription
    calc_case(with_jpg_img).csvThis file contains the Calcification cases patients with their patient_id, breast_density, left or right breast, image view, abnormality id, abnormality type, mass shape, mass margins, assessment, pathology, subtlety, jpg_fullMammo_img_path, jpg_crop_img_path, jpg_ROI_img_path
    mass_case(with_jpg_img).csvThis file contains the Mass cases patients with their patient_id, breast_density, left or right breast, image view, abnormality id, abnormality type, mass shape, mass margins, assessment, pathology, subtlety, jpg_fullMammo_img_path, jpg_crop_img_path, jpg_ROI_img_path
    metadata(with_jpg_img).csvThis file contains both of the Mass and Calcification(Calc) patients with their Series UID, Subject ID, Study UID, Series Description, Modality, SOP Class Name, SOP Class UID, Number of Images, jpg_folder_path



    Summary

    The CBIS-DDSM (Curated Breast Imaging Subset of DDSM) is an updated and standardized version of the Digital Database for Screening Mammography (DDSM). The DDSM is a database of 2,620 scanned film mammography studies. It contains normal, benign, and malignant cases with verified pathology information. The scale of the database along with ground truth validation makes the DDSM a useful tool in the development and testing of decision support systems. The CBIS-DDSM collection includes a subset of the DDSM data selected and curated by a trained mammographer. The images have been decompressed and converted to DICOM format. Updated ROI segmentation and bounding boxes, and pathologic diagnosis for training data are also included. A manuscript describing how to use this dataset in detail is available at https://www.nature.com/articles/sdata2017177.

    Published research results from work in developing decision support systems in mammography are difficult to replicate due to the lack of a standard evaluation data set; most computer-aided diagnosis (CADx) and detection (CADe) algorithms for breast cancer in mammography are evaluated on private data sets or on unspecified subsets of public databases. Few well-curated public datasets have been provided for the mammography community. These include the DDSM, the Mammographic Imaging Analysis Society (MIAS) database, and the Image Retrieval in Medical Applications (IRMA) project. Although these public data sets are useful, they are limited in terms of data set size and accessibility.

    For example, most researchers using the DDSM do not leverage all its images for a variety of historical reasons. When the database was released in 1997, computational resources to process hundreds or thousands of images were not widely available. Additionally, the DDSM images are saved in non-standard compression files that require the use of decompression code that has not been updated or maintained for modern computers. Finally, the ROI annotations for the abnormalities in the DDSM were provided to indicate a general position of lesions, but not a precise segmentation for them. Therefore, many researchers must implement segmentation algorithms for accurate feature extraction. This causes an inability to directly compare the performance of methods or to replicate prior results. The CBIS-DDSM collection addresses that challenge by publicly releasing a curated and standardized version of the DDSM for evaluation of future CADx and CADe systems (sometimes referred to generally as CAD) research in mammography.

    Source of the Original dataset

    https://doi.org/10.7937/K9/TCIA.2016.7O02S9CY

    Citations & Data Usage Policy

    Users must abide by the TCIA Data Usage Policy and Restrictions. Attribution should include references to the following citations:


    CBIS-DDSM Citation

    Sawyer-Lee, R., Gimenez, F., Hoogi, A., & Rubin, D. (2016). Curated Breast Imaging Subset of Digital Database for Screening Mammography (CBIS-DDSM) (Version 1) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/K9/TCIA.2016.7O02S9CY


    Publication Citation

    Lee, R. S., Gimenez, F., Hoogi, A., Miyake, K. K., Gorovoy, M., & Rubin, D. L. (2017). A curated mammography data set for use in computer-aided detection and diagnosis research. In Scientific Data (Vol. 4, Issue 1). Springer Science and Business Media LLC. https://doi.org/10.1038/sdata.2017.177


    TCIA Citation

    Clark, K., Vendt, B., Smith, K., Freymann, J., Kirby, J., Koppel, P., Moore, S., Phillips, S., Maffitt, D., Pringle, M., Tarbox, L., & Prior, F. (2013). The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository. In Journal of Digital Imaging (Vol. 26, Issue 6, pp. 1045–1057). Springer Science and Business Media LLC. https://doi.org/10.1007/s10278-013-9622-7



    NOTE: The CBIS-DDSM dataset lack DICOM image for cropped and ROI images, so there are some cropped and ROI images that may not be found corresponding to their given path, so it is recommended to use the full mammography images from this dataset (only)

  5. d

    Dataset of Breast mammography images with Masses - Dataset - B2FIND

    • b2find.dkrz.de
    Updated Oct 23, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Dataset of Breast mammography images with Masses - Dataset - B2FIND [Dataset]. https://b2find.dkrz.de/dataset/94fb7bbf-3128-5fa3-ac32-989c171fb9b1
    Explore at:
    Dataset updated
    Oct 23, 2023
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The dataset contains mammography with benign and malignant masses. Images in this dataset were first extracted 106 masses images from INbreast dataset, 53 masses images from MIAS dataset, and 2188 masses images DDSM dataset. Then we use data augmentation and contrast-limited adaptive histogram equalization to preprocess our images. After data augmentation, Inbreast dataset has 7632 images, MIAS dataset has 3816 images, DDSM dataset has 13128 images. In addition, we also integrate INbreast, MIAS, DDSM together. All the images were resized to 227*227 pixels.

  6. DDSM-mammography-positive-case

    • kaggle.com
    zip
    Updated Jan 13, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Laurent Pourchot (2023). DDSM-mammography-positive-case [Dataset]. https://www.kaggle.com/datasets/pourchot/ddsm-mammography-positive-case
    Explore at:
    zip(19549837664 bytes)Available download formats
    Dataset updated
    Jan 13, 2023
    Authors
    Laurent Pourchot
    Description

    Dataset

    This dataset was created by Laurent Pourchot

    Contents

  7. CBIS-DDSM Data Set Preprocessed

    • kaggle.com
    zip
    Updated Jul 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MrTicha (2023). CBIS-DDSM Data Set Preprocessed [Dataset]. https://www.kaggle.com/datasets/mohamedbenticha/cbis-ddsm
    Explore at:
    zip(10829167796 bytes)Available download formats
    Dataset updated
    Jul 11, 2023
    Authors
    MrTicha
    Description

    This is a prepossessed version of the CBIS-DDSM data set for mammography segmentation.

    The preprocessing pipeline used on this data is inspired from this github repository : https://github.com/CleonWong/Can-You-Find-The-Tumour

    The Data is structered as follows : https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F13630556%2Fc7a7ba31b83470782decd9b2049ea9ef%2Ffolder.png?generation=1699352159040166&alt=media" alt="">

  8. o

    Data from: Connected-UNets: a deep learning architecture for breast mass...

    • omicsdi.org
    xml
    Updated Feb 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Baccouche A (2024). Connected-UNets: a deep learning architecture for breast mass segmentation. [Dataset]. https://www.omicsdi.org/dataset/biostudies-literature/S-EPMC8640011
    Explore at:
    xmlAvailable download formats
    Dataset updated
    Feb 29, 2024
    Authors
    Baccouche A
    Variables measured
    Unknown
    Description

    Breast cancer analysis implies that radiologists inspect mammograms to detect suspicious breast lesions and identify mass tumors. Artificial intelligence techniques offer automatic systems for breast mass segmentation to assist radiologists in their diagnosis. With the rapid development of deep learning and its application to medical imaging challenges, UNet and its variations is one of the state-of-the-art models for medical image segmentation that showed promising performance on mammography. In this paper, we propose an architecture, called Connected-UNets, which connects two UNets using additional modified skip connections. We integrate Atrous Spatial Pyramid Pooling (ASPP) in the two standard UNets to emphasize the contextual information within the encoder-decoder network architecture. We also apply the proposed architecture on the Attention UNet (AUNet) and the Residual UNet (ResUNet). We evaluated the proposed architectures on two publically available datasets, the Curated Breast Imaging Subset of Digital Database for Screening Mammography (CBIS-DDSM) and INbreast, and additionally on a private dataset. Experiments were also conducted using additional synthetic data using the cycle-consistent Generative Adversarial Network (CycleGAN) model between two unpaired datasets to augment and enhance the images. Qualitative and quantitative results show that the proposed architecture can achieve better automatic mass segmentation with a high Dice score of 89.52%, 95.28%, and 95.88% and Intersection over Union (IoU) score of 80.02%, 91.03%, and 92.27%, respectively, on CBIS-DDSM, INbreast, and the private dataset.

  9. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
(2023). curated_breast_imaging_ddsm [Dataset]. https://www.tensorflow.org/datasets/catalog/curated_breast_imaging_ddsm

curated_breast_imaging_ddsm

Related Article
Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Jul 11, 2023
Description

The CBIS-DDSM (Curated Breast Imaging Subset of DDSM) is an updated and standardized version of the Digital Database for Screening Mammography (DDSM). The DDSM is a database of 2,620 scanned film mammography studies. It contains normal, benign, and malignant cases with verified pathology information.

The default config is made of patches extracted from the original mammograms, following the description from (http://arxiv.org/abs/1708.09427), in order to frame the task to solve in a traditional image classification setting.

To use this dataset:

import tensorflow_datasets as tfds

ds = tfds.load('curated_breast_imaging_ddsm', split='train')
for ex in ds.take(4):
 print(ex)

See the guide for more informations on tensorflow_datasets.

https://storage.googleapis.com/tfds-data/visualization/fig/curated_breast_imaging_ddsm-patches-3.0.0.png" alt="Visualization" width="500px">

Search
Clear search
Close search
Google apps
Main menu