Search
Clear search
Close search
Main menu
Google apps
100+ datasets found
  1. P

    UDC Dataset

    • paperswithcode.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ryan Lowe; Nissan Pow; Iulian Serban; Joelle Pineau, UDC Dataset [Dataset]. https://paperswithcode.com/dataset/ubuntu-dialogue-corpus
    Explore at:
    Authors
    Ryan Lowe; Nissan Pow; Iulian Serban; Joelle Pineau
    Description

    Ubuntu Dialogue Corpus (UDC) is a dataset containing almost 1 million multi-turn dialogues, with a total of over 7 million utterances and 100 million words. This provides a unique resource for research into building dialogue managers based on neural language models that can make use of large amounts of unlabeled data. The dataset has both the multi-turn property of conversations in the Dialog State Tracking Challenge datasets, and the unstructured nature of interactions from microblog services such as Twitter.

  2. t

    Ubuntu Dialogue Corpus (UDC) - Dataset - LDM

    • service.tib.eu
    Updated Jan 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Ubuntu Dialogue Corpus (UDC) - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/ubuntu-dialogue-corpus--udc-
    Explore at:
    Dataset updated
    Jan 2, 2025
    Description

    The Ubuntu Dialogue Corpus (UDC) dataset was extracted from the Ubuntu Relay Chat Channel. Although the topics in the dataset are not as diverse as in the MTC, the dataset is very large, containing about 1.85 million conversations with an average of 5 utterances per conversation.

  3. h

    tags-ask-ubuntu

    • huggingface.co
    Updated Apr 4, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    tags-ask-ubuntu [Dataset]. https://huggingface.co/datasets/SauravMaheshkar/tags-ask-ubuntu
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 4, 2024
    Authors
    Saurav Maheshkar
    License

    https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/

    Description

    Source Paper: https://arxiv.org/abs/1802.06916

      Usage
    

    from torch_geometric.datasets.cornell import CornellTemporalHyperGraphDataset

    dataset = CornellTemporalHyperGraphDataset(root = "./", name="tags-ask-ubuntu", split="train")

      Citation
    

    @article{Benson-2018-simplicial, author = {Benson, Austin R. and Abebe, Rediet and Schaub, Michael T. and Jadbabaie, Ali and Kleinberg, Jon}, title = {Simplicial closure and higher-order link prediction}, year = {2018}, doi =… See the full description on the dataset page: https://huggingface.co/datasets/SauravMaheshkar/tags-ask-ubuntu.

  4. t

    Lowe et al. (2024). Dataset: Ubuntu Dialogue Corpus....

    • service.tib.eu
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Lowe et al. (2024). Dataset: Ubuntu Dialogue Corpus. https://doi.org/10.57702/tirz36oi [Dataset]. https://service.tib.eu/ldmservice/dataset/ubuntu-dialogue-corpus
    Explore at:
    Dataset updated
    Dec 2, 2024
    Description

    The Ubuntu Dialogue Corpus is the largest freely available multi-turn based dialogue corpus which consists of almost one million two-way conversations extracted from the Ubuntu chat logs.

  5. t

    Ubuntu Dialogue dataset - Dataset - LDM

    • service.tib.eu
    Updated Nov 25, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Ubuntu Dialogue dataset - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/ubuntu-dialogue-dataset
    Explore at:
    Dataset updated
    Nov 25, 2024
    Description

    The Ubuntu Dialogue dataset consists of about 1.85 million conversations, each with an average of 5 utterances per conversation, ideal for training dialogue models that can provide expert knowledge or recommendations in domain-specific conversations.

  6. E

    Ubuntu

    • live.european-language-grid.eu
    tmx
    Updated Mar 26, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Ubuntu [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/7339
    Explore at:
    tmxAvailable download formats
    Dataset updated
    Mar 26, 2022
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    EN-IS parallel corpus of Ubuntu localization files, 10,572 TUs, EN-IS, Domain: Software interface. The data originally came in aligned format from the Arni Magnusson Institute in Iceland. The following processing was performed: manual spot-check for quality.

  7. C

    Ubuntu Statistics By Market Share, Traffic Share, Usage Metrics and Facts

    • coolest-gadgets.com
    Updated Feb 10, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Coolest Gadgets (2025). Ubuntu Statistics By Market Share, Traffic Share, Usage Metrics and Facts [Dataset]. https://www.coolest-gadgets.com/ubuntu-statistics/
    Explore at:
    Dataset updated
    Feb 10, 2025
    Dataset authored and provided by
    Coolest Gadgets
    License

    https://www.coolest-gadgets.com/privacy-policyhttps://www.coolest-gadgets.com/privacy-policy

    Time period covered
    2022 - 2032
    Area covered
    Global
    Description

    Introduction

    Ubuntu Statistics: Ubuntu has a great reputation as one of the most widely used Linux distributions due to its simplicity, reliability, and outstanding community support. Now, in 2024, this has never changed, as it is a choice for personal and professional use. It is versatile and able to run on everything from their desktop to cloud servers and devices on the IOT.

    This article discusses the latest Ubuntu statistics, trends, and insights into what is happening in terms of its growth, usage, and market position in 2025.

  8. h

    my-test-dataset-ubuntu

    • huggingface.co
    Updated Apr 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    excode (2023). my-test-dataset-ubuntu [Dataset]. https://huggingface.co/datasets/excode/my-test-dataset-ubuntu
    Explore at:
    Dataset updated
    Apr 21, 2023
    Authors
    excode
    Description

    excode/my-test-dataset-ubuntu dataset hosted on Hugging Face and contributed by the HF Datasets community

  9. r

    List of 234,623 Canonical Ubuntu Customers

    • readycontacts.com
    Updated Feb 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). List of 234,623 Canonical Ubuntu Customers [Dataset]. https://www.readycontacts.com/target-account-profiling/canonical-ubuntu/
    Explore at:
    Dataset updated
    Feb 5, 2025
    Description

    Our Canonical Ubuntu Users List helps you reach your targeted prospects across the globe. Get Free customized Canonical Ubuntu Users Email List today and boost ROI.

  10. h

    misc-ubuntu-latest-3.8

    • huggingface.co
    Updated Sep 21, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Optimum-Benchmark (2024). misc-ubuntu-latest-3.8 [Dataset]. https://huggingface.co/datasets/optimum-benchmark/misc-ubuntu-latest-3.8
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 21, 2024
    Dataset authored and provided by
    Optimum-Benchmark
    Description

    optimum-benchmark/misc-ubuntu-latest-3.8 dataset hosted on Hugging Face and contributed by the HF Datasets community

  11. w

    Subjects of Ubuntu 8.10 Linux bible

    • workwithdata.com
    Updated Feb 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2025). Subjects of Ubuntu 8.10 Linux bible [Dataset]. https://www.workwithdata.com/datasets/book-subjects?f=1&fcol0=book&fop0=%3D&fval0=Ubuntu+8.10+Linux+bible
    Explore at:
    Dataset updated
    Feb 3, 2025
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about book subjects and is filtered where the books is Ubuntu 8.10 Linux bible, featuring 10 columns including authors, average publication date, book publishers, book subject, and books. The preview is ordered by number of books (descending).

  12. w

    Subjects of Ubuntu server administration

    • workwithdata.com
    Updated Jul 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2024). Subjects of Ubuntu server administration [Dataset]. https://www.workwithdata.com/datasets/book-subjects?f=1&fcol0=book&fop0=%3D&fval0=Ubuntu+server+administration
    Explore at:
    Dataset updated
    Jul 1, 2024
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is about book subjects and is filtered where the books is Ubuntu server administration. It has 10 columns such as authors, average publication date, book publishers, book subject, and books. The data is ordered by earliest publication date (descending).

  13. Ubuntu ovm

    • kaggle.com
    zip
    Updated Dec 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Đức Nguyễn (2024). Ubuntu ovm [Dataset]. https://www.kaggle.com/datasets/ducnguyen998/ubuntu-ovm
    Explore at:
    zip(8680166643 bytes)Available download formats
    Dataset updated
    Dec 17, 2024
    Authors
    Đức Nguyễn
    Description

    Dataset

    This dataset was created by Đức Nguyễn

    Contents

  14. h

    misc-ubuntu-latest-3.12

    • huggingface.co
    Updated Sep 21, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Optimum-Benchmark (2024). misc-ubuntu-latest-3.12 [Dataset]. https://huggingface.co/datasets/optimum-benchmark/misc-ubuntu-latest-3.12
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 21, 2024
    Dataset authored and provided by
    Optimum-Benchmark
    Description

    optimum-benchmark/misc-ubuntu-latest-3.12 dataset hosted on Hugging Face and contributed by the HF Datasets community

  15. P

    AskUbuntu Dataset

    • paperswithcode.com
    Updated Apr 13, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AskUbuntu Dataset [Dataset]. https://paperswithcode.com/dataset/askubuntu
    Explore at:
    Dataset updated
    Apr 13, 2021
    Description

    AskUbuntu question dataset is a preprocessed collection of questions taken from the AskUbuntu.com 2014 corpus dump. It also comes with 400*20 manual annotations, marking pairs of questions as "similar" or "non-similar".

  16. f

    Symbolic consumption and representation of self: a study of interactions in...

    • figshare.com
    • scielo.figshare.com
    jpeg
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    OSÍRIS LUÍS DA CUNHA FERNANDES; NELSON DA CRUZ MONTEIRO FERNANDES; FERNANDO GOMES DE PAIVA JÚNIOR; ANDRÉ LUIZ MARANHÃO DE SOUZA LEÃO; MARCONI FREITAS DA COSTA (2023). Symbolic consumption and representation of self: a study of interactions in a virtual community of Ubuntu-Br users [Dataset]. http://doi.org/10.6084/m9.figshare.11350673.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    SciELO journals
    Authors
    OSÍRIS LUÍS DA CUNHA FERNANDES; NELSON DA CRUZ MONTEIRO FERNANDES; FERNANDO GOMES DE PAIVA JÚNIOR; ANDRÉ LUIZ MARANHÃO DE SOUZA LEÃO; MARCONI FREITAS DA COSTA
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Abstract This study aims to explain how the symbolic consumption of the Ubuntu operating system is used for the representation of self in interactions in the Ubuntu virtual community from Brazil. We adopted the Goffmanian concept of self, the netnography of communication as the research method, and case study as a research strategy. The paralinguistic, the extralinguistic, and the definition of “I” are aspects used in virtual interactions. They have the linguistic function of corroborating and praising the statements of migration of Windows users to Ubuntu, emphasizing the distinctive features of the concept of Ubuntu, highlighting its expression of shared feelings of love and freedom, as ways of projecting the self of humanity to each other. In the case of the operating system, this characteristic is represented through the provision of support among users at the forum of the virtual community.

  17. f

    The Ubuntu Apache2 default page | Arts And Entertainment Data | Arts And...

    • datastore.forage.ai
    Updated Sep 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). The Ubuntu Apache2 default page | Arts And Entertainment Data | Arts And Entertainment [Dataset]. https://datastore.forage.ai/searchresults/?resource_keyword=Arts
    Explore at:
    Dataset updated
    Sep 24, 2024
    Description

    The Ubuntu Apache2 default page provides a brief introduction to the Apache2 server, a popular open-source web server software. This page serves as a diagnostic tool to test the installation and configuration of the Apache2 server on Ubuntu systems. It also provides a taste of the documentation available for the web server and its configuration options.

    The Ubuntu Apache2 default page is designed to be simple and easy to understand, with minimal technical jargon. The page describes the main configuration files and directories used by the Apache2 server, as well as how to manage and customize these settings. The page also provides an overview of the default document roots and how to configure additional document roots for virtual hosts.

  18. i

    Grant Giving Statistics for Ubuntu Kdce Foundation

    • instrumentl.com
    Updated Oct 15, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Grant Giving Statistics for Ubuntu Kdce Foundation [Dataset]. https://www.instrumentl.com/990-report/ubuntu-kdce-foundation
    Explore at:
    Dataset updated
    Oct 15, 2021
    Variables measured
    Total Assets, Total Giving, Average Grant Amount
    Description

    Financial overview and grant giving statistics of Ubuntu Kdce Foundation

  19. stockfish-ubuntu-x86-64-avx2

    • kaggle.com
    zip
    Updated Dec 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    kaggleuseer (2024). stockfish-ubuntu-x86-64-avx2 [Dataset]. https://www.kaggle.com/datasets/kaggleuseer/stockfish-ubuntu-x86-64-avx2/code
    Explore at:
    zip(64828140 bytes)Available download formats
    Dataset updated
    Dec 3, 2024
    Authors
    kaggleuseer
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by kaggleuseer

    Released under Apache 2.0

    Contents

  20. w

    ubuntu-service.com - Historical whois Lookup

    • whoisdatacenter.com
    csv
    Updated Jan 4, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc (2025). ubuntu-service.com - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/ubuntu-service.com/
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jan 4, 2025
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - Feb 25, 2025
    Description

    Explore the historical Whois records related to ubuntu-service.com (Domain). Get insights into ownership history and changes over time.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Ryan Lowe; Nissan Pow; Iulian Serban; Joelle Pineau, UDC Dataset [Dataset]. https://paperswithcode.com/dataset/ubuntu-dialogue-corpus

UDC Dataset

Ubuntu Dialogue Corpus

Explore at:
Authors
Ryan Lowe; Nissan Pow; Iulian Serban; Joelle Pineau
Description

Ubuntu Dialogue Corpus (UDC) is a dataset containing almost 1 million multi-turn dialogues, with a total of over 7 million utterances and 100 million words. This provides a unique resource for research into building dialogue managers based on neural language models that can make use of large amounts of unlabeled data. The dataset has both the multi-turn property of conversations in the Dialog State Tracking Challenge datasets, and the unstructured nature of interactions from microblog services such as Twitter.