32 datasets found

h
esperanto
huggingface.co
Updated Jun 7, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Infinitestarcode (2023). esperanto [Dataset]. https://huggingface.co/datasets/Infinitestarcode/esperanto
Explore at:
Dataset updated
Jun 7, 2023
Authors
Infinitestarcode
Description
Esperanto Dataset

Mostly English

Vortlisto

https://github.com/paulmakepeace/vortlisto

License

The original word list was created in the 90's for a now-defunct exam (see README.md for more details). It's unclear what the copyright status of that text is. Bill Walker with others' help provided translations. Those translations came from various sources. (Who owns the translation of a word?) Bill Walker has kindly given permission for his gcselist.htm… See the full description on the dataset page: https://huggingface.co/datasets/Infinitestarcode/esperanto.
E
Arbobanko (Esperanto Treebank)
catalog.elra.info
live.european-language-grid.eu
Updated Nov 18, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency) (2019). Arbobanko (Esperanto Treebank) [Dataset]. https://catalog.elra.info/en-us/repository/browse/ELRA-W0129/
Explore at:
Dataset updated
Nov 18, 2019
Dataset provided by
ELRA (European Language Resources Association)
ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency)
License
https://catalog.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdf
https://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdf
Description
The Arbobanko (Esperanto Treebank) is a 52,000 token dependency treebank of Esperanto with texts from the MONATO news magazine, consisting of random excerpts from the period 2000-2010. All words were annotated for lemma, part-of-speech, inflection, compounding and affixing, syntactic function, dependency links, NER types, semantic types of nouns and adjectives, and verb frame categories.Morphosyntactic and dependency annotation was performed with the EspGram parser, and manually revised. Semantic categories were added in a second round of annotation, and are also manually revised and disambiguated. The format is native Constraint Grammar sgml, with token-based tag lines, xml with feature-attribute pairs or CoNNL tab format.
w
Esperanto badge
workwithdata.com
Updated Apr 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2024). Esperanto badge [Dataset]. https://www.workwithdata.com/object/rijks-100880
Explore at:
Dataset updated
Apr 17, 2024
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Explore Esperanto badge through unique data from multiples sources: key facts, real-time news, interactive charts, detailed maps & open datasets
d
Radios in Esperanto
deepfo.com
csv, excel, html, xml
Updated Jan 7, 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deepfo.com by Polyolbion SL, Barcelona, Spain (2011). Radios in Esperanto [Dataset]. https://deepfo.com/en/most/Radios-in-Esperanto
Explore at:
xml, excel, csv, htmlAvailable download formats
Dataset updated
Jan 7, 2011
Dataset authored and provided by
Deepfo.com by Polyolbion SL, Barcelona, Spain
License
https://deepfo.com/documentacion.php?idioma=enhttps://deepfo.com/documentacion.php?idioma=en
Description
Radios in Esperanto. name, image, date Commenced operations, date founded, Frequency, city Headquarters, administrative division Headquarters, country Headquarters, continent Headquarters, Country, continent, coverage, Language, Prefix, date dissolved, Website, Owner
d
Esperanto-English LMF Apertium Bilingual dictionary - Dataset - B2FIND
b2find.dkrz.de
Updated Mar 5, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Esperanto-English LMF Apertium Bilingual dictionary - Dataset - B2FIND [Dataset]. https://b2find.dkrz.de/dataset/af1acc7e-94e7-583e-b13c-bdd87985ec74
Explore at:
Dataset updated
Mar 5, 2024
Description
This is the LMF version of the Apertium bilingual dictionary for Esperanto and English languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) were generated in addition to the bilingual correspondence (SenseAxis) element. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as Esperanto-English). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.
C
Esperanto-Catalan LMF Apertium Bilingual dictionary
dataverse.csuc.cat
dtd, txt, xml, zip
Updated Oct 13, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CORA.Repositori de Dades de Recerca (2023). Esperanto-Catalan LMF Apertium Bilingual dictionary [Dataset]. http://doi.org/10.34810/data316
Explore at:
txt(147), txt(1832), zip(993893), dtd(7509), xml(12331), txt(35147)Available download formats
Unique identifier
https://doi.org/10.34810/data316
Dataset updated
Oct 13, 2023
Dataset provided by
CORA.Repositori de Dades de Recerca
License
https://dataverse.csuc.cat/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34810/data316https://dataverse.csuc.cat/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34810/data316
Description
This is the LMF version of the Apertium bilingual dictionary for Esperanto and Catalanlanguages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) were generated in addition to the bilingual correspondence (SenseAxis) element. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as Esperanto-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.
d
Livelanguage Workspaces Ukc lexicons Esperanto UKC Lexicon
livepeople.datascientia.eu
Updated Aug 29, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Livelanguage Workspaces Ukc lexicons Esperanto UKC Lexicon [Dataset]. https://livepeople.datascientia.eu/dataset/ukc-lexicon-epo
Explore at:
Dataset updated
Aug 29, 2022
Description
Esperanto is a language from the Constructed family, spoken in Eurasia. The UKC Lexicon of Esperanto is represented as a lexico-semantic network. It consists of words, word senses, synsets, as well as sense-level and synset-level relationships.
h
esperanto
huggingface.co
Updated Aug 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Chris Murphy (2023). esperanto [Dataset]. https://huggingface.co/datasets/chriswmurphy/esperanto
Explore at:
Dataset updated
Aug 4, 2023
Authors
Chris Murphy
Description
Dataset Card for "esperanto"

More Information needed
v
Esperanto Jeans's Company profile with phone,email, buyers, suppliers,...
volza.com
csv
Updated Feb 1, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Volza.LLC (2024). Esperanto Jeans's Company profile with phone,email, buyers, suppliers, price, export import shipments. [Dataset]. https://www.volza.com/company-profile/esperanto-jeans-company-30894130
Explore at:
csvAvailable download formats
Dataset updated
Feb 1, 2024
Dataset provided by
Volza.LLC
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2014 - Sep 30, 2021
Variables measured
Count of exporters, Count of importers, Sum of export value, Sum of import value, Count of export shipments, Count of import shipments
Description
Credit report of Esperanto Jeans contains unique and detailed export import market intelligence with it's phone, email, Linkedin and details of each import and export shipment like product, quantity, price, buyer, supplier names, country and date of shipment.
v
International Ready Esperanto's Company profile with phone,email, buyers,...
volza.com
csv
Updated Nov 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Volza.LLC (2023). International Ready Esperanto's Company profile with phone,email, buyers, suppliers, price, export import shipments. [Dataset]. https://www.volza.com/company-profile/international-ready-esperanto-22824521
Explore at:
csvAvailable download formats
Dataset updated
Nov 8, 2023
Dataset provided by
Volza.LLC
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2014 - Sep 30, 2021
Variables measured
Count of exporters, Count of importers, Sum of export value, Sum of import value, Count of export shipments, Count of import shipments
Description
Credit report of International Ready Esperanto contains unique and detailed export import market intelligence with it's phone, email, Linkedin and details of each import and export shipment like product, quantity, price, buyer, supplier names, country and date of shipment.
d
magazines in Esperanto
deepfo.com
csv, excel, html, xml
Updated Jan 18, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deepfo.com by Polyolbion SL, Barcelona, Spain (2021). magazines in Esperanto [Dataset]. https://deepfo.com/en/most/magazines-in-Esperanto
Explore at:
html, csv, xml, excelAvailable download formats
Dataset updated
Jan 18, 2021
Dataset authored and provided by
Deepfo.com by Polyolbion SL, Barcelona, Spain
License
https://deepfo.com/documentacion.php?idioma=enhttps://deepfo.com/documentacion.php?idioma=en
Description
magazines in Esperanto. name, image, categories, date Closed, date first issue, date founded, Frequency, city Headquarters, administrative division Headquarters, country Headquarters, continent Headquarters, Country, continent, ISSN, Website
Ontolex-lemon and TIAD versions of Apertium Esperanto-English dictionary
zenodo.org
explore.openaire.eu
bin, tsv
Updated Sep 3, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Christian Chiarcos; Christian Chiarcos; Maxim Ionov; Maxim Ionov (2020). Ontolex-lemon and TIAD versions of Apertium Esperanto-English dictionary [Dataset]. http://doi.org/10.5281/zenodo.4012300
Explore at:
tsv, binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.4012300
Dataset updated
Sep 3, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Christian Chiarcos; Christian Chiarcos; Maxim Ionov; Maxim Ionov
License
https://www.gnu.org/licenses/old-licenses/gpl-2.0-standalone.htmlhttps://www.gnu.org/licenses/old-licenses/gpl-2.0-standalone.html
Description
OntoLex-lemon and TSV conversion of Apertium Bidix. For more details, see https://www.aclweb.org/anthology/2020.lrec-1.401/

Authors of the original data:

(c) 2008--2009, Jacob Nordfalk (c) 2009, Hèctor Alòs i Font (c) 2005--2007, Universitat d'Alacant (Transducens group) -- English data (c) 2005--2007, Universitat Pompeu Fabra -- English data
w
Data from: Secondary school Esperanto
workwithdata.com
Updated Jan 3, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2022). Secondary school Esperanto [Dataset]. https://www.workwithdata.com/book/Secondary%20school%20Esperanto_450013
Explore at:
Dataset updated
Jan 3, 2022
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Explore Secondary school Esperanto through unique data from multiples sources: key facts, real-time news, interactive charts, detailed maps & open datasets
w
The accepted Esperanto dictionary = Leksikono de oficialaj vortoj
workwithdata.com
Updated Jun 24, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Work With Data (2022). The accepted Esperanto dictionary = Leksikono de oficialaj vortoj [Dataset]. https://www.workwithdata.com/book/the-accepted-esperanto-dictionary-leksikono-de-oficialaj-vortoj-book-by-edward-ockey-0000
Explore at:
Dataset updated
Jun 24, 2022
Dataset authored and provided by
Work With Data
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Explore The accepted Esperanto dictionary = Leksikono de oficialaj vortoj through unique data from multiples sources: key facts, real-time news, interactive charts, detailed maps & open datasets
esperanto_leipzig
kaggle.com
zip
Updated May 29, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Anton Popov (2021). esperanto_leipzig [Dataset]. https://www.kaggle.com/calmscout/esperanto-leipzig
Explore at:
zip(217743886 bytes)Available download formats
Dataset updated
May 29, 2021
Authors
Anton Popov
Area covered
Leipzig
Description
Dataset

This dataset was created by Anton Popov

Contents

It contains the following files:
s
Data from: Wikidata2Wikipedia: Learning to Generate Wikipedia Summaries for...
eprints.soton.ac.uk
Updated Apr 15, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kaffee, Lucie-Aimée; Elsahar, Hady; Vougiouklis, Pavlos; Gravier, Christophe; Laforest, Frederique; Hare, Jonathon; Simperl, Elena (2018). Wikidata2Wikipedia: Learning to Generate Wikipedia Summaries for Underserved Languages from Wikidata [Dataset]. https://eprints.soton.ac.uk/427379/
Explore at:
Dataset updated
Apr 15, 2018
Dataset provided by
University of Southampton
Authors
Kaffee, Lucie-Aimée; Elsahar, Hady; Vougiouklis, Pavlos; Gravier, Christophe; Laforest, Frederique; Hare, Jonathon; Simperl, Elena
Description
The associated repository contains the code and the corpora that were used in order to build a "learnable" system that generates open-domain textual summaries in Arabic and Esperanto given a set of Wikidata triples as input. The two corpora that have been used for the experiments are included in the repository: (i) Wikidata triples aligned with Wikipedia summaries in Arabic and (ii) Wikidata triples aligned with Wikipedia summaries in Esperanto.
w
Apertium
data.wu.ac.at
Updated Oct 10, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dictionaries (2013). Apertium [Dataset]. https://data.wu.ac.at/odso/datahub_io/Mjg1YzZkYzktOTc5Ny00NDhmLTljMWUtY2RlYzljMTJiOTEz
Explore at:
Dataset updated
Oct 10, 2013
Dataset provided by
Dictionaries
Description
Description

"Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance tools, and open linguistic data for several language pairs."

Language-pair data includes:

Spanish ⇆ Catalan (apertium-es-ca)

Spanish ← Romanian (apertium-es-ro)

French ⇆ Catalan (apertium-fr-ca)

Occitan ⇆ Catalan (apertium-oc-ca)

English ⇆ Galician (apertium-en-gl)

Swedish → Danish (apertium-sv-da)

Occitan ⇆ Spanish (apertium-oc-es)

Spanish ⇆ Portuguese (apertium-es-pt)

English ⇆ Catalan (apertium-en-ca)

English ⇆ Spanish (apertium-en-es)

English ⇆ Esperanto (apertium-en-eo)

Spanish ⇆ Galician (apertium-es-gl)

French ⇆ Spanish (apertium-fr-es)

Esperanto ← Spanish (apertium-eo-es)

Welsh → English (apertium-cy-en)

Breton → French (apertium-br-fr)

Esperanto ← Catalan (apertium-eo-ca)

Portuguese ⇆ Catalan (apertium-pt-ca)

Portuguese ⇆ Galician (apertium-pt-gl)

Basque → Spanish (apertium-eu-es)

Norwegian Nynorsk ⇆ Norwegian Bokmål (apertium-nn-nb)

The above are the "released" language pairs, data includes:

dictionaries for morphological analysis and generation

disambiguation (statistical models, rules, in some cases Constraint Grammars)

bilingual (transfer) dictionaries

structural transfer rules

There is also a lot of data of the above kinds for unreleased language pairs, eg. Icelandic → English, North Sámi → Lule Sámi; and tools to maintain such data.

License

COPYING file in language pair data archive contains a copy of the GPL.
E
Text to Terminological Concept System
live.european-language-grid.eu
Updated Jul 14, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2021). Text to Terminological Concept System [Dataset]. https://live.european-language-grid.eu/catalogue/tool-service/8122
Explore at:
Dataset updated
Jul 14, 2021
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Text2TCS automatically extracts terminological concept systems from natural language text. Terms are domain-specific natural language expressions that describe domain-specific concepts. It extracts terms, concepts and concept relations and represent them in a terminological concept system, building on a prespecified relation typology: generic, partitive, activity, associative, causal, spatial, instrumental, origination, and property relations. Syonyms are detected and finally grouped in the output format (text and TBX/XML).
The system has been trained on English and German but builds on a pre-trained multilingual neural model (XLM-R) that allows Text2TCS to transfer its functionality to the following languages: Afrikaans, Albanian, Amharic, Arabic, Armenian, Assamese, Azerbaijani, Basque, Belarusian, Bengali, Bengali Romanized, Bosnian, Breton, Bulgarian, Burmese, Catalan, Chinese (Simplified), Chinese (Traditional), Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Filipino, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Hausa, Hebrew, Hindi, Hindi Romanized, Hungarian, Icelandic, Indonesian, Irish, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Kurdish (Kurmanji), Kyrgyz, Lao, Latin, Latvian, Lithuanian, Macedonian, Malagasy, Malay, Malayalam, Marathi, Mongolian, Nepali, Norwegian, Oriya, Oromo, Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Sanskri, Scottish, Gaelic, Serbian, Sindhi, Sinhala, Slovak, Slovenian, Somali, Spanish, Sundanese, Swahili, Swedish, Tamil, Tamil Romanized, Telugu, Telugu Romanized, Thai, Turkish, Ukrainian, Urdu, Urdu Romanized, Uyghur, Uzbek, Vietnamese, Welsh, Western, Frisian, Xhosa, Yiddish.
The list of input and output languages below is more restrictive since we utilize an automated language recognition tool and a sentence tokenizer. The indicated languages represent the languages officially supported by those two tools and XLM-R, even though our application might be able to also process other languages from the list above.
h
audio_letters_eo
huggingface.co
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xekri Dragon (2023). audio_letters_eo [Dataset]. https://huggingface.co/datasets/xekri/audio_letters_eo
Explore at:
Dataset updated
Jun 1, 2023
Authors
Xekri Dragon
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Audio files sampled at 48000Hz of an American male pronouncing the names of the Esperanto letters in three ways. Retroflex-r and trilled-r are included.
d
Radios en Esperanto
deepfo.com
csv, excel, html, xml
Updated Jul 1, 2011
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Deepfo.com by Polyolbion SL, Barcelona, Spain (2011). Radios en Esperanto [Dataset]. https://deepfo.com/es/most/Radios-en-Esperanto
Explore at:
xml, csv, excel, htmlAvailable download formats
Dataset updated
Jul 1, 2011
Dataset authored and provided by
Deepfo.com by Polyolbion SL, Barcelona, Spain
License
https://deepfo.com/documentacion.php?idioma=eshttps://deepfo.com/documentacion.php?idioma=es
Description
Radios en Esperanto. nombre, imagen, Fecha inicio operaciones, Fecha de fundación, Frecuencia, ciudad sede, división administrativa sede, país sede, continente sede, País, continente, cobertura, Idioma, Prefijo, Fecha de disolución, Sitio web, Owner

Facebook

Twitter

Click to copy link

Link copied

Cite

Infinitestarcode (2023). esperanto [Dataset]. https://huggingface.co/datasets/Infinitestarcode/esperanto

esperanto

Infinitestarcode/esperanto

Explore at:

Dataset updated

Jun 7, 2023

Authors

Infinitestarcode

Description

Esperanto Dataset

Mostly English

  Vortlisto

https://github.com/paulmakepeace/vortlisto

  License

The original word list was created in the 90's for a now-defunct exam (see README.md for more details). It's unclear what the copyright status of that text is. Bill Walker with others' help provided translations. Those translations came from various sources. (Who owns the translation of a word?) Bill Walker has kindly given permission for his gcselist.htm… See the full description on the dataset page: https://huggingface.co/datasets/Infinitestarcode/esperanto.

Clear search

Close search

Google apps

Main menu

esperanto

Arbobanko (Esperanto Treebank)

Esperanto badge

Radios in Esperanto

Esperanto-English LMF Apertium Bilingual dictionary - Dataset - B2FIND

Esperanto-Catalan LMF Apertium Bilingual dictionary

Livelanguage Workspaces Ukc lexicons Esperanto UKC Lexicon

esperanto

Esperanto Jeans's Company profile with phone,email, buyers, suppliers,...

International Ready Esperanto's Company profile with phone,email, buyers,...

magazines in Esperanto

Ontolex-lemon and TIAD versions of Apertium Esperanto-English dictionary

Data from: Secondary school Esperanto

The accepted Esperanto dictionary = Leksikono de oficialaj vortoj

esperanto_leipzig

Dataset

Contents

Data from: Wikidata2Wikipedia: Learning to Generate Wikipedia Summaries for...

Apertium

Description

License

Text to Terminological Concept System

audio_letters_eo

Radios en Esperanto

esperanto

Infinitestarcode/esperanto