Search
Clear search
Close search
Main menu
Google apps
100+ datasets found
  1. Z

    MuMu: Multimodal Music Dataset

    • data.niaid.nih.gov
    • zenodo.org
    Updated Dec 6, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oramas, Sergio (2022). MuMu: Multimodal Music Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_831188
    Explore at:
    Dataset updated
    Dec 6, 2022
    Dataset authored and provided by
    Oramas, Sergio
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    MuMu is a Multimodal Music dataset with multi-label genre annotations that combines information from the Amazon Reviews dataset and the Million Song Dataset (MSD). The former contains millions of album customer reviews and album metadata gathered from Amazon.com. The latter is a collection of metadata and precomputed audio features for a million songs.

    To map the information from both datasets we use MusicBrainz. This process yields the final set of 147,295 songs, which belong to 31,471 albums. For the mapped set of albums, there are 447,583 customer reviews from the Amazon Dataset. The dataset have been used for multi-label music genre classification experiments in the related publication. In addition to genre annotations, this dataset provides further information about each album, such as genre annotations, average rating, selling rank, similar products, and cover image url. For every text review it also provides helpfulness score of the reviews, average rating, and summary of the review.

    The mapping between the three datasets (Amazon, MusicBrainz and MSD), genre annotations, metadata, data splits, text reviews and links to images are available here. Images and audio files can not be released due to copyright issues.

    MuMu dataset (mapping, metadata, annotations and text reviews)

    Data splits and multimodal feature embeddings for ISMIR multi-label classification experiments

    These data can be used together with the Tartarus deep learning library https://github.com/sergiooramas/tartarus.

    NOTE: This version provides simplified files with metadata and splits.

    Scientific References

    Please cite the following papers if using MuMu dataset or Tartarus library.

    Oramas, S., Barbieri, F., Nieto, O., and Serra, X (2018). Multimodal Deep Learning for Music Genre Classification, Transactions of the International Society for Music Information Retrieval, V(1).

    Oramas S., Nieto O., Barbieri F., & Serra X. (2017). Multi-label Music Genre Classification from audio, text and images using Deep Features. In Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR 2017). https://arxiv.org/abs/1707.04916

  2. Electronic Music Features Dataset

    • kaggle.com
    Updated Dec 4, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Caparrini (2017). Electronic Music Features Dataset [Dataset]. https://www.kaggle.com/datasets/caparrini/beatsdataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 4, 2017
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Caparrini
    License

    http://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html

    Description

    Context

    What makes us, humans, able to tell apart two songs of different genres? Maybe you have ever been in the diffcult situation to explain show it sounds the music style that you like to someone. Then, could an automatic genre classifcation be possible?

    Content

    Each row is an electronic music song. The dataset contains 100 song for each genre among 23 electronic music genres, they were the top (100) songs of their genres on November 2016. The 71 columns are audio features extracted of a two random minutes sample of the file audio. These features have been extracted using pyAudioAnalysis (https://github.com/tyiannak/pyAudioAnalysis).

    The song names for each track is exactly the same as in the https://www.kaggle.com/caparrini/electronic-music-features-201611-beatporttop100 dataset.

  3. Indian Regional Music Dataset

    • zenodo.org
    bin
    Updated May 27, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yeshwant Singh; Yeshwant Singh; Anupam Biswas; Anupam Biswas (2022). Indian Regional Music Dataset [Dataset]. http://doi.org/10.5281/zenodo.5882452
    Explore at:
    binAvailable download formats
    Dataset updated
    May 27, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Yeshwant Singh; Yeshwant Singh; Anupam Biswas; Anupam Biswas
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is a collection of mel-spectrogram features extracted from Indian regional music containing the following languages:
    Hindi, Gujarati, Marathi, Konkani, Bengali, Oriya, Kashmiri, Assamese, Nepali, Konyak, Manipuri, Khasi & Jaintia, Tamil, Malayalam, Punjabi, Telugu, Kannada.

    Five recordings are collected for each language for four artists (2Male + 2Female) each. 2 artists out of 4 for each language are old veteran performers, and the remaining 2 are contemporary performers. Overall, the dataset includes 17 languages and 68 artists (34 Males and 34 Females). There are 340 recordings in the dataset, with a total duration of 29.3 hrs.

    Mel-spectrogram is extracted from a 1-second segment with a 1/2 second sliding window for each song. Extracted mel-spectrogram for each segment is annotated with language, location, local_song_index, global_song_index, language_id, location_id, artist_id, gender_id and no_of_artists.

    _

    This project was funded under the grant number: ECR/2018/000204 by the Science & Engineering Research Board (SERB).

  4. m

    Music Dataset: Lyrics and Metadata from 1950 to 2019

    • data.mendeley.com
    • narcis.nl
    Updated Oct 23, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Luan Moura (2020). Music Dataset: Lyrics and Metadata from 1950 to 2019 [Dataset]. http://doi.org/10.17632/3t9vbwxgr5.3
    Explore at:
    Dataset updated
    Oct 23, 2020
    Authors
    Luan Moura
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset was studied on Temporal Analysis and Visualisation of Music paper, in the following link:

           https://sol.sbc.org.br/index.php/eniac/article/view/12155
    

    This dataset provides a list of lyrics from 1950 to 2019 describing music metadata as sadness, danceability, loudness, acousticness, etc. We also provide some informations as lyrics which can be used to natural language processing.

    The audio data was scraped using Echo Nest® API integrated engine with spotipy Python’s package. The spotipy API permits the user to search for specific genres, artists,songs, release date, etc. To obtain the lyrics we used the Lyrics Genius® API as baseURL for requesting data based on the song title and artist name.

  5. MGD: Music Genre Dataset

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated May 28, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel P. Oliveira; Gabriel P. Oliveira; Mariana O. Silva; Mariana O. Silva; Danilo B. Seufitelli; Danilo B. Seufitelli; Anisio Lacerda; Mirella M. Moro; Mirella M. Moro; Anisio Lacerda (2021). MGD: Music Genre Dataset [Dataset]. http://doi.org/10.5281/zenodo.4778563
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 28, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Gabriel P. Oliveira; Gabriel P. Oliveira; Mariana O. Silva; Mariana O. Silva; Danilo B. Seufitelli; Danilo B. Seufitelli; Anisio Lacerda; Mirella M. Moro; Mirella M. Moro; Anisio Lacerda
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    MGD: Music Genre Dataset

    Over recent years, the world has seen a dramatic change in the way people consume music, moving from physical records to streaming services. Since 2017, such services have become the main source of revenue within the global recorded music market.
    Therefore, this dataset is built by using data from Spotify. It provides a weekly chart of the 200 most streamed songs for each country and territory it is present, as well as an aggregated global chart.

    Considering that countries behave differently when it comes to musical tastes, we use chart data from global and regional markets from January 2017 to December 2019, considering eight of the top 10 music markets according to IFPI: United States (1st), Japan (2nd), United Kingdom (3rd), Germany (4th), France (5th), Canada (8th), Australia (9th), and Brazil (10th).

    We also provide information about the hit songs and artists present in the charts, such as all collaborating artists within a song (since the charts only provide the main ones) and their respective genres, which is the core of this work. MGD also provides data about musical collaboration, as we build collaboration networks based on artist partnerships in hit songs. Therefore, this dataset contains:

    • Genre Networks: Success-based genre collaboration networks
    • Genre Mapping: Genre mapping from Spotify genres to super-genres
    • Artist Networks: Success-based artist collaboration networks
    • Artists: Some artist data
    • Hit Songs: Hit Song data and features
    • Charts: Enhanced data from Spotify Weekly Top 200 Charts

    This dataset was originally built for a conference paper at ISMIR 2020. If you make use of the dataset, please also cite the following paper:

    Gabriel P. Oliveira, Mariana O. Silva, Danilo B. Seufitelli, Anisio Lacerda, and Mirella M. Moro. Detecting Collaboration Profiles in Success-based Music Genre Networks. In Proceedings of the 21st International Society for Music Information Retrieval Conference (ISMIR 2020), 2020.

    @inproceedings{ismir/OliveiraSSLM20,
     title = {Detecting Collaboration Profiles in Success-based Music Genre Networks},
     author = {Gabriel P. Oliveira and 
          Mariana O. Silva and 
          Danilo B. Seufitelli and 
          Anisio Lacerda and
          Mirella M. Moro},
     booktitle = {21st International Society for Music Information Retrieval Conference}
     pages = {726--732},
     year = {2020}
    }

  6. music_genre

    • huggingface.co
    Updated Sep 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CCMUSIC Database (2023). music_genre [Dataset]. https://huggingface.co/datasets/ccmusic-database/music_genre
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 30, 2023
    Dataset provided by
    CCMusic.com
    Authors
    CCMUSIC Database
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Dataset Card for Music Genre

    The Default dataset comprises approximately 1,700 musical pieces in .mp3 format, sourced from the NetEase music. The lengths of these pieces range from 270 to 300 seconds. All are sampled at the rate of 22,050 Hz. As the website providing the audio music includes style labels for the downloaded music, there are no specific annotators involved. Validation is achieved concurrently with the downloading process. They are categorized into a total of 16… See the full description on the dataset page: https://huggingface.co/datasets/ccmusic-database/music_genre.

  7. MusicCaps

    • huggingface.co
    Updated Jan 27, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google (2023). MusicCaps [Dataset]. https://huggingface.co/datasets/google/MusicCaps
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 27, 2023
    Dataset authored and provided by
    Googlehttp://google.com/
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Dataset Card for MusicCaps

      Dataset Summary
    

    The MusicCaps dataset contains 5,521 music examples, each of which is labeled with an English aspect list and a free text caption written by musicians. An aspect list is for example "pop, tinny wide hi hats, mellow piano melody, high pitched female vocal melody, sustained pulsating synth lead", while the caption consists of multiple sentences about the music, e.g., "A low sounding male voice is rapping over a fast paced… See the full description on the dataset page: https://huggingface.co/datasets/google/MusicCaps.

  8. Data from: MusicOSet: An Enhanced Open Dataset for Music Data Mining

    • zenodo.org
    • data.niaid.nih.gov
    bin, zip
    Updated Jun 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mariana O. Silva; Mariana O. Silva; Laís Mota; Mirella M. Moro; Mirella M. Moro; Laís Mota (2021). MusicOSet: An Enhanced Open Dataset for Music Data Mining [Dataset]. http://doi.org/10.5281/zenodo.4904639
    Explore at:
    zip, binAvailable download formats
    Dataset updated
    Jun 7, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Mariana O. Silva; Mariana O. Silva; Laís Mota; Mirella M. Moro; Mirella M. Moro; Laís Mota
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    MusicOSet is an open and enhanced dataset of musical elements (artists, songs and albums) based on musical popularity classification. Provides a directly accessible collection of data suitable for numerous tasks in music data mining (e.g., data visualization, classification, clustering, similarity search, MIR, HSS and so forth). To create MusicOSet, the potential information sources were divided into three main categories: music popularity sources, metadata sources, and acoustic and lyrical features sources. Data from all three categories were initially collected between January and May 2019. Nevertheless, the update and enhancement of the data happened in June 2019.

    The attractive features of MusicOSet include:

    • Integration and centralization of different musical data sources
    • Calculation of popularity scores and classification of hits and non-hits musical elements, varying from 1962 to 2018
    • Enriched metadata for music, artists, and albums from the US popular music industry
    • Availability of acoustic and lyrical resources
    • Unrestricted access in two formats: SQL database and compressed .csv files
    |    Data    | # Records |
    |:-----------------:|:---------:|
    | Songs       | 20,405  |
    | Artists      | 11,518  |
    | Albums      | 26,522  |
    | Lyrics      | 19,664  |
    | Acoustic Features | 20,405  |
    | Genres      | 1,561   |
  9. P

    MUSIC-AVQA Dataset

    • paperswithcode.com
    Updated Dec 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guangyao Li; Yake Wei; Yapeng Tian; Chenliang Xu; Ji-Rong Wen; Di Hu (2023). MUSIC-AVQA Dataset [Dataset]. https://paperswithcode.com/dataset/music-avqa
    Explore at:
    Dataset updated
    Dec 5, 2023
    Authors
    Guangyao Li; Yake Wei; Yapeng Tian; Chenliang Xu; Ji-Rong Wen; Di Hu
    Description

    The large-scale MUSIC-AVQA dataset of musical performance contains 45,867 question-answer pairs, distributed in 9,288 videos for over 150 hours. All QA pairs types are divided into 3 modal scenarios, which contain 9 question types and 33 question templates. Finally, as an open-ended problem of our AVQA tasks, all 42 kinds of answers constitute a set for selection.

  10. Ways to discover new music worldwide 2022, by age

    • statista.com
    Updated Aug 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Ways to discover new music worldwide 2022, by age [Dataset]. https://www.statista.com/statistics/1273351/new-music-discovery-by-age-worldwide/
    Explore at:
    Dataset updated
    Aug 2, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Nov 2021
    Area covered
    Worldwide
    Description

    According to a study on music consumption worldwide in 2022, younger generations tended to find new songs via music apps and social media, while older generations also used the radio as a format to discover new audio content.

  11. Z

    Song Describer Dataset

    • data.niaid.nih.gov
    • huggingface.co
    • +2more
    Updated Jul 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Won, Minz (2024). Song Describer Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10072000
    Explore at:
    Dataset updated
    Jul 10, 2024
    Dataset provided by
    Weck, Benno
    Won, Minz
    Manco, Ilaria
    Bogdanov, Dmitry
    Tovstogan, Philip
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation

    A retro-futurist drum machine groove drenched in bubbly synthetic sound effects and a hint of an acid bassline. The Song Describer Dataset (SDD) contains ~1.1k captions for 706 permissively licensed music recordings. It is designed for use in evaluation of models that address music-and-language (M&L) tasks such as music captioning, text-to-music generation and music-language retrieval. More information about the data, collection method and validation is provided in the paper describing the dataset. If you use this dataset, please cite our paper: The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation, Manco, Ilaria and Weck, Benno and Doh, Seungheon and Won, Minz and Zhang, Yixiao and Bogdanov, Dmitry and Wu, Yusong and Chen, Ke and Tovstogan, Philip and Benetos, Emmanouil and Quinton, Elio and Fazekas, György and Nam, Juhan, Machine Learning for Audio Workshop at NeurIPS 2023, 2023

  12. h

    music-fingerprint-dataset

    • huggingface.co
    Updated Sep 22, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aditya Kumar (2022). music-fingerprint-dataset [Dataset]. https://huggingface.co/datasets/arch-raven/music-fingerprint-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 22, 2022
    Authors
    Aditya Kumar
    Description

    Neural Audio Fingerprint Dataset

    (c) 2021 by Sungkyun Chang https://github.com/mimbres/neural-audio-fp This dataset includes all music sources, background noise and impulse-reponses (IR) samples that have been used in the work "https://arxiv.org/abs/2010.11910">"Neural Audio Fingerprint for High-specific Audio Retrieval based on Contrastive Learning".

      Format:
    

    16-bit PCM Mono WAV, Sampling rate 8000 Hz

      Description:
    

    /… See the full description on the dataset page: https://huggingface.co/datasets/arch-raven/music-fingerprint-dataset.

  13. Music Dataset

    • brightdata.com
    .json, .csv, .xlsx
    Updated Jan 6, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2017). Music Dataset [Dataset]. https://brightdata.com/products/datasets/music
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Jan 6, 2017
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Unlock powerful insights with our custom music datasets, offering access to millions of records from popular music platforms like Spotify, SoundCloud, Amazon Music, YouTube Music, and more. These datasets provide comprehensive data points such as track titles, artists, albums, genres, release dates, play counts, playlist details, popularity scores, user-generated tags, and much more, allowing you to analyze music trends, listener behavior, and industry patterns with precision. Use these datasets to optimize your music strategies by identifying trending tracks, analyzing artist performance, understanding playlist dynamics, and tracking audience preferences across platforms. Gain valuable insights into streaming habits, regional popularity, and emerging genres to make data-driven decisions that enhance your marketing campaigns, content creation, and audience engagement. Whether you’re a music producer, marketer, data analyst, or researcher, our music datasets empower you with the data needed to stay ahead in the ever-evolving music industry. Available in various formats such as JSON, CSV, and Parquet, and delivered via flexible options like API, S3, or email, these datasets ensure seamless integration into your workflows.

  14. Music Genre fMRI Dataset

    • openneuro.org
    Updated Aug 23, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tomoya Nakai; Naoko Koide-Majima; Shinji Nishimoto (2023). Music Genre fMRI Dataset [Dataset]. http://doi.org/10.18112/openneuro.ds003720.v1.0.1
    Explore at:
    Dataset updated
    Aug 23, 2023
    Dataset provided by
    OpenNeurohttps://openneuro.org/
    Authors
    Tomoya Nakai; Naoko Koide-Majima; Shinji Nishimoto
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Music Genre fMRI Dataset by Tomoya Nakai, Naoko Koide-Majima, and Shinji Nishimoto

    References:

    1. Nakai, Koide-Majima, and Nishimoto (2021). Correspondence of categorical and feature-based representations of music in the human brain. Brain and Behavior. 11(1), e01936. https://doi.org/10.1002/brb3.1936

    2. Nakai, Koide-Majima, and Nishimoto (2022). Music genre neuroimaging dataset. Data in Brief. 40, 107675. https://doi.org/10.1016/j.dib.2021.107675

    We measured brain activity using functional MRI while five subjects (“sub-001”, …, “sub-005”) listened to music stimuli of 10 different genres.

    The entire folder consists of subject-wise subfolders (“sub-001”,…). Each subject’s folder contains the following subfolders:

    1. anat: T1-weighted structural images

    2. func: functional signals (multi-band echo-planar images)

    Each subject performed 18 runs consisting of 12 training runs and 6 test runs. The training and test data were assigned with the following notations:

    1. Training data: sub-00*_task-Training_run-**_bold.json

    2. Test data: sub-00*_task-Test_run-**_bold.json

    Each *_event.tsv file contains following information:

    1. Onset: stimulus onset

    2. Genre: genre type (out of 10 genres)

    3. Track: index to identify the original track

    4. Start: onset of excerpt from the original track (second)

    5. End: offset of excerpt from the original track (second)

    The duration of all stimuli is 15s. For each clip, 2 s of fade-in and fade-out effects were applied, and the overall signal intensity was normalized in terms of the root mean square.

    For the training runs, the 1st stimulus (0-15s) is the same as the last stimulus of the previous run (600-615s). For the test runs, the1st stimulus (0-15s) is the same as the last stimulus of the same run (600-615s).

    Preprocessed data are available from Zenodo (https://doi.org/10.5281/zenodo.8275363). Experimental stimuli can be generated using GTZAN_Preprocess.py included in the same repository.

    The original music stimuli (GTZAN dataset) can be found here: https://www.kaggle.com/datasets/andradaolteanu/gtzan-dataset-music-genre-classification

    Caution This dataset can be used for research purposes only. The data were anonymized, and users shall not perform analyses to re-identify individual subjects.

  15. Music Album Reviews and Ratings Dataset

    • kaggle.com
    Updated Aug 3, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael Bryant (2022). Music Album Reviews and Ratings Dataset [Dataset]. https://www.kaggle.com/datasets/michaelbryantds/78k-music-album-reviews
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 3, 2022
    Dataset provided by
    Kaggle
    Authors
    Michael Bryant
    Description

    Context

    This dataset contains 80K music album reviews by the users of rateyourmusic.com.

    Content

    The dataset was acquired by scraping on May 2022. It contains 79922 album reviews and ratings (if available).

    The scraper can be found at this GitHub repo.

    Acknowledgements

    The album chart of albums from which the reviews were scraped from can be found here.

    Inspiration

    This dataset can be used to practice NLP.

  16. MuSe: The Musical Sentiment Dataset

    • zenodo.org
    • data.niaid.nih.gov
    bin, csv
    Updated Mar 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christopher Akiki; Christopher Akiki; Manuel Burghardt; Manuel Burghardt (2021). MuSe: The Musical Sentiment Dataset [Dataset]. http://doi.org/10.5281/zenodo.4281165
    Explore at:
    csv, binAvailable download formats
    Dataset updated
    Mar 1, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Christopher Akiki; Christopher Akiki; Manuel Burghardt; Manuel Burghardt
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The MuSe (Music Sentiment) dataset contains sentiment information for 90,408 songs. We computed scores for the affective dimensions of valence, dominance and arousal, based on the user-generated tags that are available for each song via Last.fm. In addition, we provide artist and title metadata as well as a Spotify ID and a MusicBrainz ID, which allow researchers to extend the dataset with further metadata, such as genre or year.

    Though the tags themselves cannot be included in the dataset, we include a jupyter notebook in our accompanying Github repository that demonstrates how to fetch the tags of a given song from the Last.fm API (Last.fm_API.ipynb)

    We further include a jupyter notebook in the same repository that demonstrates how one might enrich the dataset with audio features using different endpoints of the Spotify API using the included Spotify IDs (spotify_API.ipynb). Please note that in its current form, the dataset only contains tentative spotify IDs for a subset (around 68%) of the songs.

  17. Indian Folk Music Dataset

    • zenodo.org
    • data.niaid.nih.gov
    bin
    Updated May 27, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yeshwant Singh; Yeshwant Singh; Lilapati Waikhom; Lilapati Waikhom; Vivek Meena; Vivek Meena; Anupam Biswas; Anupam Biswas (2022). Indian Folk Music Dataset [Dataset]. http://doi.org/10.5281/zenodo.6584021
    Explore at:
    binAvailable download formats
    Dataset updated
    May 27, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Yeshwant Singh; Yeshwant Singh; Lilapati Waikhom; Lilapati Waikhom; Vivek Meena; Vivek Meena; Anupam Biswas; Anupam Biswas
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is a collection of mel-spectrogram features extracted from Indian folk music containing the following 15 folk styles:
    Bauls, Bhavageethe, Garba, Kajri, Maand, Sohar, Tamang Selo, Veeragase, Bhatiali, Bihu, Gidha, Lavani, Naatupura Paatu, Sufi, Uttarakhandi.

    The number of recordings varies from 16 to 50 in the mentioned folk styles representing the scarcity of availability of given folk styles on the Internet. There are at least 4 artists and a maximum of 22. Overall there are 125 artists (34 female + 91 male) in these 15 folk styles.

    There is a total of 606 recordings in the dataset, with a total duration of 54.45 hrs.
    Mel-spectrogram is extracted from a 3-second segment with each song's 1/2 second sliding window. Extracted mel-spectrogram for each segment is annotated with folk_style, state, artist, gender, song, source, no_of_artists, folk_style_id, state_id, artist_id, gender_id.
    _
    This project was funded under the grant number: ECR/2018/000204 by the Science & Engineering Research Board (SERB).

  18. t

    MUSIC dataset - Dataset - LDM

    • service.tib.eu
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). MUSIC dataset - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/music-dataset
    Explore at:
    Dataset updated
    Dec 2, 2024
    Description

    The MUSIC dataset used in this paper for spectral CT reconstruction.

  19. Most popular music streaming services in the U.S. 2018-2019, by audience

    • statista.com
    Updated May 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2024). Most popular music streaming services in the U.S. 2018-2019, by audience [Dataset]. https://www.statista.com/statistics/798125/most-popular-us-music-streaming-services-ranked-by-audience/
    Explore at:
    Dataset updated
    May 29, 2024
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Mar 2018 - Sep 2019
    Area covered
    United States
    Description

    The most successful music streaming service in the United States was Apple Music as of September, with the most up to date information showing that 49.5 million users accessed the platform each month. Spotify closely followed, with a similarly impressive 47.7 million monthly users.

    What is a music streaming service?

    Music streaming services provide their users with a database compiled of songs, playlists, albums and videos, where content can be accessed online, downloaded, shared, bookmarked and organized.

    The music streaming business is huge, and has sometimes been lauded as the savior of the music industry. The biggest two services are in constant competition for the monopoly of the market. Apple Music was launched in 2015, whereas Spotify has been around since 2008. Other popular streaming services include Deezer, SoundCloud and iHeartRadio.

    Do artists make a lot of money from streaming services? 

    In short, unfortunately not. Both Apple Music and Spotify have been frequently criticized for the tiny royalty payments they offer artists. Particularly for emerging talent, streaming services are far from a lucrative source of income. Bigger, established stars like Taylor Swift are more likely to regularly make a good amount of money this way. But either way, a track needs to go viral or be streamed several million times before it earns any real cash.

  20. Moroccan Music Data

    • kaggle.com
    Updated Jun 12, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ac0Hik (2023). Moroccan Music Data [Dataset]. https://www.kaggle.com/datasets/ac0hik/moroccan-music-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 12, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Ac0Hik
    Description

    Our Moroccan Music Dataset is a comprehensive collection of audio samples representing six prominent genres of Moroccan music. The dataset is structured into six folders, each dedicated to a specific genre: Chaabi, Rap, Andalusian, Rai, Gnawa, and Imazighn.

    Each folder contains a diverse range of audio samples showcasing the unique characteristics and styles of the respective genre. From the infectious beats of Chaabi to the powerful lyrics of Rap, the soulful melodies of Andalusian, the energetic rhythms of Rai, the spiritual chants of Gnawa, and the traditional sounds of Imazighn, the dataset encompasses the rich musical landscape of Morocco.

    thumbnail : https://www.freepik.com/free-photo/still-life-art-photography-concept-with-antique-samovar-violin-isolated-black-background_13499739.htm#query=moroccan%20music&position=24&from_view=search&track=ais

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Oramas, Sergio (2022). MuMu: Multimodal Music Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_831188

MuMu: Multimodal Music Dataset

Explore at:
Dataset updated
Dec 6, 2022
Dataset authored and provided by
Oramas, Sergio
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

MuMu is a Multimodal Music dataset with multi-label genre annotations that combines information from the Amazon Reviews dataset and the Million Song Dataset (MSD). The former contains millions of album customer reviews and album metadata gathered from Amazon.com. The latter is a collection of metadata and precomputed audio features for a million songs.

To map the information from both datasets we use MusicBrainz. This process yields the final set of 147,295 songs, which belong to 31,471 albums. For the mapped set of albums, there are 447,583 customer reviews from the Amazon Dataset. The dataset have been used for multi-label music genre classification experiments in the related publication. In addition to genre annotations, this dataset provides further information about each album, such as genre annotations, average rating, selling rank, similar products, and cover image url. For every text review it also provides helpfulness score of the reviews, average rating, and summary of the review.

The mapping between the three datasets (Amazon, MusicBrainz and MSD), genre annotations, metadata, data splits, text reviews and links to images are available here. Images and audio files can not be released due to copyright issues.

MuMu dataset (mapping, metadata, annotations and text reviews)

Data splits and multimodal feature embeddings for ISMIR multi-label classification experiments

These data can be used together with the Tartarus deep learning library https://github.com/sergiooramas/tartarus.

NOTE: This version provides simplified files with metadata and splits.

Scientific References

Please cite the following papers if using MuMu dataset or Tartarus library.

Oramas, S., Barbieri, F., Nieto, O., and Serra, X (2018). Multimodal Deep Learning for Music Genre Classification, Transactions of the International Society for Music Information Retrieval, V(1).

Oramas S., Nieto O., Barbieri F., & Serra X. (2017). Multi-label Music Genre Classification from audio, text and images using Deep Features. In Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR 2017). https://arxiv.org/abs/1707.04916