100+ datasets found

P
UDC Dataset
paperswithcode.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ryan Lowe; Nissan Pow; Iulian Serban; Joelle Pineau, UDC Dataset [Dataset]. https://paperswithcode.com/dataset/ubuntu-dialogue-corpus
Explore at:
Authors
Ryan Lowe; Nissan Pow; Iulian Serban; Joelle Pineau
Description
Ubuntu Dialogue Corpus (UDC) is a dataset containing almost 1 million multi-turn dialogues, with a total of over 7 million utterances and 100 million words. This provides a unique resource for research into building dialogue managers based on neural language models that can make use of large amounts of unlabeled data. The dataset has both the multi-turn property of conversations in the Dialog State Tracking Challenge datasets, and the unstructured nature of interactions from microblog services such as Twitter.
t
Ubuntu Dialogue Corpus (UDC) - Dataset - LDM
service.tib.eu
Updated Jan 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Ubuntu Dialogue Corpus (UDC) - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/ubuntu-dialogue-corpus--udc-
Explore at:
Dataset updated
Jan 2, 2025
Description
The Ubuntu Dialogue Corpus (UDC) dataset was extracted from the Ubuntu Relay Chat Channel. Although the topics in the dataset are not as diverse as in the MTC, the dataset is very large, containing about 1.85 million conversations with an average of 5 utterances per conversation.
h
tags-ask-ubuntu
huggingface.co
Updated Apr 4, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
tags-ask-ubuntu [Dataset]. https://huggingface.co/datasets/SauravMaheshkar/tags-ask-ubuntu
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 4, 2024
Authors
Saurav Maheshkar
License
https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/
Description
Source Paper: https://arxiv.org/abs/1802.06916

Usage

from torch_geometric.datasets.cornell import CornellTemporalHyperGraphDataset

dataset = CornellTemporalHyperGraphDataset(root = "./", name="tags-ask-ubuntu", split="train")

Citation

@article{Benson-2018-simplicial, author = {Benson, Austin R. and Abebe, Rediet and Schaub, Michael T. and Jadbabaie, Ali and Kleinberg, Jon}, title = {Simplicial closure and higher-order link prediction}, year = {2018}, doi =… See the full description on the dataset page: https://huggingface.co/datasets/SauravMaheshkar/tags-ask-ubuntu.
t
Lowe et al. (2024). Dataset: Ubuntu Dialogue Corpus....
service.tib.eu
Updated Dec 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Lowe et al. (2024). Dataset: Ubuntu Dialogue Corpus. https://doi.org/10.57702/tirz36oi [Dataset]. https://service.tib.eu/ldmservice/dataset/ubuntu-dialogue-corpus
Explore at:
Dataset updated
Dec 2, 2024
Description
The Ubuntu Dialogue Corpus is the largest freely available multi-turn based dialogue corpus which consists of almost one million two-way conversations extracted from the Ubuntu chat logs.
t
Ubuntu Dialogue dataset - Dataset - LDM
service.tib.eu
Updated Nov 25, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Ubuntu Dialogue dataset - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/ubuntu-dialogue-dataset
Explore at:
Dataset updated
Nov 25, 2024
Description
The Ubuntu Dialogue dataset consists of about 1.85 million conversations, each with an average of 5 utterances per conversation, ideal for training dialogue models that can provide expert knowledge or recommendations in domain-specific conversations.
E
Ubuntu
live.european-language-grid.eu
tmx
Updated Mar 26, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Ubuntu [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/7339
Explore at:
tmxAvailable download formats
Dataset updated
Mar 26, 2022
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
EN-IS parallel corpus of Ubuntu localization files, 10,572 TUs, EN-IS, Domain: Software interface. The data originally came in aligned format from the Arni Magnusson Institute in Iceland. The following processing was performed: manual spot-check for quality.
C
Ubuntu Statistics By Market Share, Traffic Share, Usage Metrics and Facts
coolest-gadgets.com
Updated Feb 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Coolest Gadgets (2025). Ubuntu Statistics By Market Share, Traffic Share, Usage Metrics and Facts [Dataset]. https://www.coolest-gadgets.com/ubuntu-statistics/
Explore at:
Dataset updated
Feb 10, 2025
Dataset authored and provided by
Coolest Gadgets
License
https://www.coolest-gadgets.com/privacy-policyhttps://www.coolest-gadgets.com/privacy-policy
Time period covered
2022 - 2032
Area covered
Global
Description
Introduction

Ubuntu Statistics: Ubuntu has a great reputation as one of the most widely used Linux distributions due to its simplicity, reliability, and outstanding community support. Now, in 2024, this has never changed, as it is a choice for personal and professional use. It is versatile and able to run on everything from their desktop to cloud servers and devices on the IOT.

This article discusses the latest Ubuntu statistics, trends, and insights into what is happening in terms of its growth, usage, and market position in 2025.
h
my-test-dataset-ubuntu
huggingface.co
Updated Apr 21, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
excode (2023). my-test-dataset-ubuntu [Dataset]. https://huggingface.co/datasets/excode/my-test-dataset-ubuntu
Explore at:
Dataset updated
Apr 21, 2023
Authors
excode
Description
excode/my-test-dataset-ubuntu dataset hosted on Hugging Face and contributed by the HF Datasets community
r
List of 234,623 Canonical Ubuntu Customers
readycontacts.com
Updated Feb 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). List of 234,623 Canonical Ubuntu Customers [Dataset]. https://www.readycontacts.com/target-account-profiling/canonical-ubuntu/
Explore at:
Dataset updated
Feb 5, 2025
Description
Our Canonical Ubuntu Users List helps you reach your targeted prospects across the globe. Get Free customized Canonical Ubuntu Users Email List today and boost ROI.
h
misc-ubuntu-latest-3.8
huggingface.co
Updated Sep 21, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Optimum-Benchmark (2024). misc-ubuntu-latest-3.8 [Dataset]. https://huggingface.co/datasets/optimum-benchmark/misc-ubuntu-latest-3.8
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 21, 2024
Dataset authored and provided by
Optimum-Benchmark
Description
optimum-benchmark/misc-ubuntu-latest-3.8 dataset hosted on Hugging Face and contributed by the HF Datasets community
w
Subjects of Ubuntu 8.10 Linux bible
workwithdata.com
Updated Feb 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2025). Subjects of Ubuntu 8.10 Linux bible [Dataset]. https://www.workwithdata.com/datasets/book-subjects?f=1&fcol0=book&fop0=%3D&fval0=Ubuntu+8.10+Linux+bible
Explore at:
Dataset updated
Feb 3, 2025
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is about book subjects and is filtered where the books is Ubuntu 8.10 Linux bible, featuring 10 columns including authors, average publication date, book publishers, book subject, and books. The preview is ordered by number of books (descending).
w
Subjects of Ubuntu server administration
workwithdata.com
Updated Jul 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2024). Subjects of Ubuntu server administration [Dataset]. https://www.workwithdata.com/datasets/book-subjects?f=1&fcol0=book&fop0=%3D&fval0=Ubuntu+server+administration
Explore at:
Dataset updated
Jul 1, 2024
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is about book subjects and is filtered where the books is Ubuntu server administration. It has 10 columns such as authors, average publication date, book publishers, book subject, and books. The data is ordered by earliest publication date (descending).
Ubuntu ovm
kaggle.com
zip
Updated Dec 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Đức Nguyễn (2024). Ubuntu ovm [Dataset]. https://www.kaggle.com/datasets/ducnguyen998/ubuntu-ovm
Explore at:
zip(8680166643 bytes)Available download formats
Dataset updated
Dec 17, 2024
Authors
Đức Nguyễn
Description
Dataset

This dataset was created by Đức Nguyễn

Contents
h
misc-ubuntu-latest-3.12
huggingface.co
Updated Sep 21, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Optimum-Benchmark (2024). misc-ubuntu-latest-3.12 [Dataset]. https://huggingface.co/datasets/optimum-benchmark/misc-ubuntu-latest-3.12
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 21, 2024
Dataset authored and provided by
Optimum-Benchmark
Description
optimum-benchmark/misc-ubuntu-latest-3.12 dataset hosted on Hugging Face and contributed by the HF Datasets community
P
AskUbuntu Dataset
paperswithcode.com
Updated Apr 13, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AskUbuntu Dataset [Dataset]. https://paperswithcode.com/dataset/askubuntu
Explore at:
Dataset updated
Apr 13, 2021
Description
AskUbuntu question dataset is a preprocessed collection of questions taken from the AskUbuntu.com 2014 corpus dump. It also comes with 400*20 manual annotations, marking pairs of questions as "similar" or "non-similar".
f
Symbolic consumption and representation of self: a study of interactions in...
figshare.com
scielo.figshare.com
jpeg
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
OSÍRIS LUÍS DA CUNHA FERNANDES; NELSON DA CRUZ MONTEIRO FERNANDES; FERNANDO GOMES DE PAIVA JÚNIOR; ANDRÉ LUIZ MARANHÃO DE SOUZA LEÃO; MARCONI FREITAS DA COSTA (2023). Symbolic consumption and representation of self: a study of interactions in a virtual community of Ubuntu-Br users [Dataset]. http://doi.org/10.6084/m9.figshare.11350673.v1
Explore at:
jpegAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.11350673.v1
Dataset updated
Jun 1, 2023
Dataset provided by
SciELO journals
Authors
OSÍRIS LUÍS DA CUNHA FERNANDES; NELSON DA CRUZ MONTEIRO FERNANDES; FERNANDO GOMES DE PAIVA JÚNIOR; ANDRÉ LUIZ MARANHÃO DE SOUZA LEÃO; MARCONI FREITAS DA COSTA
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Abstract This study aims to explain how the symbolic consumption of the Ubuntu operating system is used for the representation of self in interactions in the Ubuntu virtual community from Brazil. We adopted the Goffmanian concept of self, the netnography of communication as the research method, and case study as a research strategy. The paralinguistic, the extralinguistic, and the definition of “I” are aspects used in virtual interactions. They have the linguistic function of corroborating and praising the statements of migration of Windows users to Ubuntu, emphasizing the distinctive features of the concept of Ubuntu, highlighting its expression of shared feelings of love and freedom, as ways of projecting the self of humanity to each other. In the case of the operating system, this characteristic is represented through the provision of support among users at the forum of the virtual community.
f
The Ubuntu Apache2 default page | Arts And Entertainment Data | Arts And...
datastore.forage.ai
Updated Sep 24, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). The Ubuntu Apache2 default page | Arts And Entertainment Data | Arts And Entertainment [Dataset]. https://datastore.forage.ai/searchresults/?resource_keyword=Arts
Explore at:
Dataset updated
Sep 24, 2024
Description
The Ubuntu Apache2 default page provides a brief introduction to the Apache2 server, a popular open-source web server software. This page serves as a diagnostic tool to test the installation and configuration of the Apache2 server on Ubuntu systems. It also provides a taste of the documentation available for the web server and its configuration options.

The Ubuntu Apache2 default page is designed to be simple and easy to understand, with minimal technical jargon. The page describes the main configuration files and directories used by the Apache2 server, as well as how to manage and customize these settings. The page also provides an overview of the default document roots and how to configure additional document roots for virtual hosts.
i
Grant Giving Statistics for Ubuntu Kdce Foundation
instrumentl.com
Updated Oct 15, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). Grant Giving Statistics for Ubuntu Kdce Foundation [Dataset]. https://www.instrumentl.com/990-report/ubuntu-kdce-foundation
Explore at:
Dataset updated
Oct 15, 2021
Variables measured
Total Assets, Total Giving, Average Grant Amount
Description
Financial overview and grant giving statistics of Ubuntu Kdce Foundation
stockfish-ubuntu-x86-64-avx2
kaggle.com
zip
Updated Dec 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
kaggleuseer (2024). stockfish-ubuntu-x86-64-avx2 [Dataset]. https://www.kaggle.com/datasets/kaggleuseer/stockfish-ubuntu-x86-64-avx2/code
Explore at:
zip(64828140 bytes)Available download formats
Dataset updated
Dec 3, 2024
Authors
kaggleuseer
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by kaggleuseer

Released under Apache 2.0

Contents
w
ubuntu-service.com - Historical whois Lookup
whoisdatacenter.com
csv
Updated Jan 4, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
AllHeart Web Inc (2025). ubuntu-service.com - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/ubuntu-service.com/
Explore at:
csvAvailable download formats
Dataset updated
Jan 4, 2025
Dataset authored and provided by
AllHeart Web Inc
License
https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/
Time period covered
Mar 15, 1985 - Feb 25, 2025
Description
Explore the historical Whois records related to ubuntu-service.com (Domain). Get insights into ownership history and changes over time.

Facebook

Twitter

Click to copy link

Link copied

Cite

Ryan Lowe; Nissan Pow; Iulian Serban; Joelle Pineau, UDC Dataset [Dataset]. https://paperswithcode.com/dataset/ubuntu-dialogue-corpus

UDC Dataset

Ubuntu Dialogue Corpus

Explore at:

Authors

Ryan Lowe; Nissan Pow; Iulian Serban; Joelle Pineau

Description

Ubuntu Dialogue Corpus (UDC) is a dataset containing almost 1 million multi-turn dialogues, with a total of over 7 million utterances and 100 million words. This provides a unique resource for research into building dialogue managers based on neural language models that can make use of large amounts of unlabeled data. The dataset has both the multi-turn property of conversations in the Dialog State Tracking Challenge datasets, and the unstructured nature of interactions from microblog services such as Twitter.

UDC Dataset

Ubuntu Dialogue Corpus (UDC) - Dataset - LDM

tags-ask-ubuntu

Lowe et al. (2024). Dataset: Ubuntu Dialogue Corpus....

Ubuntu Dialogue dataset - Dataset - LDM

Ubuntu

Ubuntu Statistics By Market Share, Traffic Share, Usage Metrics and Facts

Introduction

my-test-dataset-ubuntu

List of 234,623 Canonical Ubuntu Customers

misc-ubuntu-latest-3.8

Subjects of Ubuntu 8.10 Linux bible

Subjects of Ubuntu server administration

Ubuntu ovm

Dataset

Contents

misc-ubuntu-latest-3.12

AskUbuntu Dataset

Symbolic consumption and representation of self: a study of interactions in...

The Ubuntu Apache2 default page | Arts And Entertainment Data | Arts And...

Grant Giving Statistics for Ubuntu Kdce Foundation

stockfish-ubuntu-x86-64-avx2

Dataset

Contents

ubuntu-service.com - Historical whois Lookup

UDC Dataset

Ubuntu Dialogue Corpus