32 datasets found
  1. h

    esperanto

    • huggingface.co
    Updated Jun 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Infinitestarcode (2023). esperanto [Dataset]. https://huggingface.co/datasets/Infinitestarcode/esperanto
    Explore at:
    Dataset updated
    Jun 7, 2023
    Authors
    Infinitestarcode
    Description

    Esperanto Dataset

    Mostly English

      Vortlisto
    

    https://github.com/paulmakepeace/vortlisto

      License
    

    The original word list was created in the 90's for a now-defunct exam (see README.md for more details). It's unclear what the copyright status of that text is. Bill Walker with others' help provided translations. Those translations came from various sources. (Who owns the translation of a word?) Bill Walker has kindly given permission for his gcselist.htm… See the full description on the dataset page: https://huggingface.co/datasets/Infinitestarcode/esperanto.

  2. E

    Arbobanko (Esperanto Treebank)

    • catalog.elra.info
    • live.european-language-grid.eu
    Updated Nov 18, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency) (2019). Arbobanko (Esperanto Treebank) [Dataset]. https://catalog.elra.info/en-us/repository/browse/ELRA-W0129/
    Explore at:
    Dataset updated
    Nov 18, 2019
    Dataset provided by
    ELRA (European Language Resources Association)
    ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency)
    License

    https://catalog.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdf

    https://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdf

    Description

    The Arbobanko (Esperanto Treebank) is a 52,000 token dependency treebank of Esperanto with texts from the MONATO news magazine, consisting of random excerpts from the period 2000-2010. All words were annotated for lemma, part-of-speech, inflection, compounding and affixing, syntactic function, dependency links, NER types, semantic types of nouns and adjectives, and verb frame categories.Morphosyntactic and dependency annotation was performed with the EspGram parser, and manually revised. Semantic categories were added in a second round of annotation, and are also manually revised and disambiguated. The format is native Constraint Grammar sgml, with token-based tag lines, xml with feature-attribute pairs or CoNNL tab format.

  3. w

    Esperanto badge

    • workwithdata.com
    Updated Apr 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2024). Esperanto badge [Dataset]. https://www.workwithdata.com/object/rijks-100880
    Explore at:
    Dataset updated
    Apr 17, 2024
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explore Esperanto badge through unique data from multiples sources: key facts, real-time news, interactive charts, detailed maps & open datasets

  4. d

    Radios in Esperanto

    • deepfo.com
    csv, excel, html, xml
    Updated Jan 7, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepfo.com by Polyolbion SL, Barcelona, Spain (2011). Radios in Esperanto [Dataset]. https://deepfo.com/en/most/Radios-in-Esperanto
    Explore at:
    xml, excel, csv, htmlAvailable download formats
    Dataset updated
    Jan 7, 2011
    Dataset authored and provided by
    Deepfo.com by Polyolbion SL, Barcelona, Spain
    License

    https://deepfo.com/documentacion.php?idioma=enhttps://deepfo.com/documentacion.php?idioma=en

    Description

    Radios in Esperanto. name, image, date Commenced operations, date founded, Frequency, city Headquarters, administrative division Headquarters, country Headquarters, continent Headquarters, Country, continent, coverage, Language, Prefix, date dissolved, Website, Owner

  5. d

    Esperanto-English LMF Apertium Bilingual dictionary - Dataset - B2FIND

    • b2find.dkrz.de
    Updated Mar 5, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Esperanto-English LMF Apertium Bilingual dictionary - Dataset - B2FIND [Dataset]. https://b2find.dkrz.de/dataset/af1acc7e-94e7-583e-b13c-bdd87985ec74
    Explore at:
    Dataset updated
    Mar 5, 2024
    Description

    This is the LMF version of the Apertium bilingual dictionary for Esperanto and English languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) were generated in addition to the bilingual correspondence (SenseAxis) element. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as Esperanto-English). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.

  6. C

    Esperanto-Catalan LMF Apertium Bilingual dictionary

    • dataverse.csuc.cat
    dtd, txt, xml, zip
    Updated Oct 13, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CORA.Repositori de Dades de Recerca (2023). Esperanto-Catalan LMF Apertium Bilingual dictionary [Dataset]. http://doi.org/10.34810/data316
    Explore at:
    txt(147), txt(1832), zip(993893), dtd(7509), xml(12331), txt(35147)Available download formats
    Dataset updated
    Oct 13, 2023
    Dataset provided by
    CORA.Repositori de Dades de Recerca
    License

    https://dataverse.csuc.cat/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34810/data316https://dataverse.csuc.cat/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34810/data316

    Description

    This is the LMF version of the Apertium bilingual dictionary for Esperanto and Catalanlanguages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) were generated in addition to the bilingual correspondence (SenseAxis) element. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as Esperanto-Catalan). The platform provides: a language-independent machine translation engine; tools to manage the linguistic data necessary to build a machine translation system for a given language pair and linguistic data for a growing number of language pairs.

  7. d

    Livelanguage Workspaces Ukc lexicons Esperanto UKC Lexicon

    • livepeople.datascientia.eu
    Updated Aug 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). Livelanguage Workspaces Ukc lexicons Esperanto UKC Lexicon [Dataset]. https://livepeople.datascientia.eu/dataset/ukc-lexicon-epo
    Explore at:
    Dataset updated
    Aug 29, 2022
    Description

    Esperanto is a language from the Constructed family, spoken in Eurasia. The UKC Lexicon of Esperanto is represented as a lexico-semantic network. It consists of words, word senses, synsets, as well as sense-level and synset-level relationships.

  8. h

    esperanto

    • huggingface.co
    Updated Aug 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chris Murphy (2023). esperanto [Dataset]. https://huggingface.co/datasets/chriswmurphy/esperanto
    Explore at:
    Dataset updated
    Aug 4, 2023
    Authors
    Chris Murphy
    Description

    Dataset Card for "esperanto"

    More Information needed

  9. v

    Esperanto Jeans's Company profile with phone,email, buyers, suppliers,...

    • volza.com
    csv
    Updated Feb 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Volza.LLC (2024). Esperanto Jeans's Company profile with phone,email, buyers, suppliers, price, export import shipments. [Dataset]. https://www.volza.com/company-profile/esperanto-jeans-company-30894130
    Explore at:
    csvAvailable download formats
    Dataset updated
    Feb 1, 2024
    Dataset provided by
    Volza.LLC
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2014 - Sep 30, 2021
    Variables measured
    Count of exporters, Count of importers, Sum of export value, Sum of import value, Count of export shipments, Count of import shipments
    Description

    Credit report of Esperanto Jeans contains unique and detailed export import market intelligence with it's phone, email, Linkedin and details of each import and export shipment like product, quantity, price, buyer, supplier names, country and date of shipment.

  10. v

    International Ready Esperanto's Company profile with phone,email, buyers,...

    • volza.com
    csv
    Updated Nov 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Volza.LLC (2023). International Ready Esperanto's Company profile with phone,email, buyers, suppliers, price, export import shipments. [Dataset]. https://www.volza.com/company-profile/international-ready-esperanto-22824521
    Explore at:
    csvAvailable download formats
    Dataset updated
    Nov 8, 2023
    Dataset provided by
    Volza.LLC
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2014 - Sep 30, 2021
    Variables measured
    Count of exporters, Count of importers, Sum of export value, Sum of import value, Count of export shipments, Count of import shipments
    Description

    Credit report of International Ready Esperanto contains unique and detailed export import market intelligence with it's phone, email, Linkedin and details of each import and export shipment like product, quantity, price, buyer, supplier names, country and date of shipment.

  11. d

    magazines in Esperanto

    • deepfo.com
    csv, excel, html, xml
    Updated Jan 18, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepfo.com by Polyolbion SL, Barcelona, Spain (2021). magazines in Esperanto [Dataset]. https://deepfo.com/en/most/magazines-in-Esperanto
    Explore at:
    html, csv, xml, excelAvailable download formats
    Dataset updated
    Jan 18, 2021
    Dataset authored and provided by
    Deepfo.com by Polyolbion SL, Barcelona, Spain
    License

    https://deepfo.com/documentacion.php?idioma=enhttps://deepfo.com/documentacion.php?idioma=en

    Description

    magazines in Esperanto. name, image, categories, date Closed, date first issue, date founded, Frequency, city Headquarters, administrative division Headquarters, country Headquarters, continent Headquarters, Country, continent, ISSN, Website

  12. Ontolex-lemon and TIAD versions of Apertium Esperanto-English dictionary

    • zenodo.org
    • explore.openaire.eu
    bin, tsv
    Updated Sep 3, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Christian Chiarcos; Christian Chiarcos; Maxim Ionov; Maxim Ionov (2020). Ontolex-lemon and TIAD versions of Apertium Esperanto-English dictionary [Dataset]. http://doi.org/10.5281/zenodo.4012300
    Explore at:
    tsv, binAvailable download formats
    Dataset updated
    Sep 3, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Christian Chiarcos; Christian Chiarcos; Maxim Ionov; Maxim Ionov
    License

    https://www.gnu.org/licenses/old-licenses/gpl-2.0-standalone.htmlhttps://www.gnu.org/licenses/old-licenses/gpl-2.0-standalone.html

    Description

    OntoLex-lemon and TSV conversion of Apertium Bidix. For more details, see https://www.aclweb.org/anthology/2020.lrec-1.401/

    Authors of the original data:

    (c) 2008--2009, Jacob Nordfalk (c) 2009, Hèctor Alòs i Font (c) 2005--2007, Universitat d'Alacant (Transducens group) -- English data (c) 2005--2007, Universitat Pompeu Fabra -- English data

  13. w

    Data from: Secondary school Esperanto

    • workwithdata.com
    Updated Jan 3, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2022). Secondary school Esperanto [Dataset]. https://www.workwithdata.com/book/Secondary%20school%20Esperanto_450013
    Explore at:
    Dataset updated
    Jan 3, 2022
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explore Secondary school Esperanto through unique data from multiples sources: key facts, real-time news, interactive charts, detailed maps & open datasets

  14. w

    The accepted Esperanto dictionary = Leksikono de oficialaj vortoj

    • workwithdata.com
    Updated Jun 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Work With Data (2022). The accepted Esperanto dictionary = Leksikono de oficialaj vortoj [Dataset]. https://www.workwithdata.com/book/the-accepted-esperanto-dictionary-leksikono-de-oficialaj-vortoj-book-by-edward-ockey-0000
    Explore at:
    Dataset updated
    Jun 24, 2022
    Dataset authored and provided by
    Work With Data
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explore The accepted Esperanto dictionary = Leksikono de oficialaj vortoj through unique data from multiples sources: key facts, real-time news, interactive charts, detailed maps & open datasets

  15. esperanto_leipzig

    • kaggle.com
    zip
    Updated May 29, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anton Popov (2021). esperanto_leipzig [Dataset]. https://www.kaggle.com/calmscout/esperanto-leipzig
    Explore at:
    zip(217743886 bytes)Available download formats
    Dataset updated
    May 29, 2021
    Authors
    Anton Popov
    Area covered
    Leipzig
    Description

    Dataset

    This dataset was created by Anton Popov

    Contents

    It contains the following files:

  16. s

    Data from: Wikidata2Wikipedia: Learning to Generate Wikipedia Summaries for...

    • eprints.soton.ac.uk
    Updated Apr 15, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kaffee, Lucie-Aimée; Elsahar, Hady; Vougiouklis, Pavlos; Gravier, Christophe; Laforest, Frederique; Hare, Jonathon; Simperl, Elena (2018). Wikidata2Wikipedia: Learning to Generate Wikipedia Summaries for Underserved Languages from Wikidata [Dataset]. https://eprints.soton.ac.uk/427379/
    Explore at:
    Dataset updated
    Apr 15, 2018
    Dataset provided by
    University of Southampton
    Authors
    Kaffee, Lucie-Aimée; Elsahar, Hady; Vougiouklis, Pavlos; Gravier, Christophe; Laforest, Frederique; Hare, Jonathon; Simperl, Elena
    Description

    The associated repository contains the code and the corpora that were used in order to build a "learnable" system that generates open-domain textual summaries in Arabic and Esperanto given a set of Wikidata triples as input. The two corpora that have been used for the experiments are included in the repository: (i) Wikidata triples aligned with Wikipedia summaries in Arabic and (ii) Wikidata triples aligned with Wikipedia summaries in Esperanto.

  17. w

    Apertium

    • data.wu.ac.at
    Updated Oct 10, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dictionaries (2013). Apertium [Dataset]. https://data.wu.ac.at/odso/datahub_io/Mjg1YzZkYzktOTc5Ny00NDhmLTljMWUtY2RlYzljMTJiOTEz
    Explore at:
    Dataset updated
    Oct 10, 2013
    Dataset provided by
    Dictionaries
    Description

    Description

    "Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance tools, and open linguistic data for several language pairs."

    Language-pair data includes:

    • Spanish ⇆ Catalan (apertium-es-ca)
    • Spanish ← Romanian (apertium-es-ro)
    • French ⇆ Catalan (apertium-fr-ca)
    • Occitan ⇆ Catalan (apertium-oc-ca)
    • English ⇆ Galician (apertium-en-gl)
    • Swedish → Danish (apertium-sv-da)
    • Occitan ⇆ Spanish (apertium-oc-es)
    • Spanish ⇆ Portuguese (apertium-es-pt)
    • English ⇆ Catalan (apertium-en-ca)
    • English ⇆ Spanish (apertium-en-es)
    • English ⇆ Esperanto (apertium-en-eo)
    • Spanish ⇆ Galician (apertium-es-gl)
    • French ⇆ Spanish (apertium-fr-es)
    • Esperanto ← Spanish (apertium-eo-es)
    • Welsh → English (apertium-cy-en)
    • Breton → French (apertium-br-fr)
    • Esperanto ← Catalan (apertium-eo-ca)
    • Portuguese ⇆ Catalan (apertium-pt-ca)
    • Portuguese ⇆ Galician (apertium-pt-gl)
    • Basque → Spanish (apertium-eu-es)
    • Norwegian Nynorsk ⇆ Norwegian Bokmål (apertium-nn-nb)

    The above are the "released" language pairs, data includes:

    • dictionaries for morphological analysis and generation
    • disambiguation (statistical models, rules, in some cases Constraint Grammars)
    • bilingual (transfer) dictionaries
    • structural transfer rules

    There is also a lot of data of the above kinds for unreleased language pairs, eg. Icelandic → English, North Sámi → Lule Sámi; and tools to maintain such data.

    License

    COPYING file in language pair data archive contains a copy of the GPL.

  18. E

    Text to Terminological Concept System

    • live.european-language-grid.eu
    Updated Jul 14, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). Text to Terminological Concept System [Dataset]. https://live.european-language-grid.eu/catalogue/tool-service/8122
    Explore at:
    Dataset updated
    Jul 14, 2021
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Text2TCS automatically extracts terminological concept systems from natural language text. Terms are domain-specific natural language expressions that describe domain-specific concepts. It extracts terms, concepts and concept relations and represent them in a terminological concept system, building on a prespecified relation typology: generic, partitive, activity, associative, causal, spatial, instrumental, origination, and property relations. Syonyms are detected and finally grouped in the output format (text and TBX/XML).

    The system has been trained on English and German but builds on a pre-trained multilingual neural model (XLM-R) that allows Text2TCS to transfer its functionality to the following languages: Afrikaans, Albanian, Amharic, Arabic, Armenian, Assamese, Azerbaijani, Basque, Belarusian, Bengali, Bengali Romanized, Bosnian, Breton, Bulgarian, Burmese, Catalan, Chinese (Simplified), Chinese (Traditional), Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Filipino, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Hausa, Hebrew, Hindi, Hindi Romanized, Hungarian, Icelandic, Indonesian, Irish, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Kurdish (Kurmanji), Kyrgyz, Lao, Latin, Latvian, Lithuanian, Macedonian, Malagasy, Malay, Malayalam, Marathi, Mongolian, Nepali, Norwegian, Oriya, Oromo, Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Sanskri, Scottish, Gaelic, Serbian, Sindhi, Sinhala, Slovak, Slovenian, Somali, Spanish, Sundanese, Swahili, Swedish, Tamil, Tamil Romanized, Telugu, Telugu Romanized, Thai, Turkish, Ukrainian, Urdu, Urdu Romanized, Uyghur, Uzbek, Vietnamese, Welsh, Western, Frisian, Xhosa, Yiddish.

    The list of input and output languages below is more restrictive since we utilize an automated language recognition tool and a sentence tokenizer. The indicated languages represent the languages officially supported by those two tools and XLM-R, even though our application might be able to also process other languages from the list above.

  19. h

    audio_letters_eo

    • huggingface.co
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xekri Dragon (2023). audio_letters_eo [Dataset]. https://huggingface.co/datasets/xekri/audio_letters_eo
    Explore at:
    Dataset updated
    Jun 1, 2023
    Authors
    Xekri Dragon
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Audio files sampled at 48000Hz of an American male pronouncing the names of the Esperanto letters in three ways. Retroflex-r and trilled-r are included.

  20. d

    Radios en Esperanto

    • deepfo.com
    csv, excel, html, xml
    Updated Jul 1, 2011
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deepfo.com by Polyolbion SL, Barcelona, Spain (2011). Radios en Esperanto [Dataset]. https://deepfo.com/es/most/Radios-en-Esperanto
    Explore at:
    xml, csv, excel, htmlAvailable download formats
    Dataset updated
    Jul 1, 2011
    Dataset authored and provided by
    Deepfo.com by Polyolbion SL, Barcelona, Spain
    License

    https://deepfo.com/documentacion.php?idioma=eshttps://deepfo.com/documentacion.php?idioma=es

    Description

    Radios en Esperanto. nombre, imagen, Fecha inicio operaciones, Fecha de fundación, Frecuencia, ciudad sede, división administrativa sede, país sede, continente sede, País, continente, cobertura, Idioma, Prefijo, Fecha de disolución, Sitio web, Owner

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Infinitestarcode (2023). esperanto [Dataset]. https://huggingface.co/datasets/Infinitestarcode/esperanto

esperanto

Infinitestarcode/esperanto

Explore at:
Dataset updated
Jun 7, 2023
Authors
Infinitestarcode
Description

Esperanto Dataset

Mostly English

  Vortlisto

https://github.com/paulmakepeace/vortlisto

  License

The original word list was created in the 90's for a now-defunct exam (see README.md for more details). It's unclear what the copyright status of that text is. Bill Walker with others' help provided translations. Those translations came from various sources. (Who owns the translation of a word?) Bill Walker has kindly given permission for his gcselist.htm… See the full description on the dataset page: https://huggingface.co/datasets/Infinitestarcode/esperanto.

Search
Clear search
Close search
Google apps
Main menu