Czech language datasets and dictionaries

Dataset Components

Inflections variants icon

Inflections & Variants

words icon

Words & Expressions

 
translation icon

Translations

Pronunciation icon

Pronunciation

Phonetic transcription notes tooltip

senses icon

Senses

Definitions notes tooltip
Disambiguators notes tooltip

Examples of usage icon

Examples of Usage

Full sentences notes tooltip
Short phrases notes tooltip

usage labels icon

Usage Labels

Range of application notes tooltip
Register notes tooltip
Geographical region notes tooltip
Sentiment notes tooltip

fetures icon

Features

Spell check notes tooltip
Geo multilingual table

grammar icon

Grammar

Part of speech notes tooltip
Gender notes tooltip
Number notes tooltip
Subcategorization notes tooltip
Valency notes tooltip

semantic icon

Semantic Labels

Synonyms notes tooltip
Antonyms notes tooltip
Context notes tooltip
Domains notes tooltip

Notes notes tooltip

Translations

From Czech:

Afrikaans, Albanian*, Arabic, Armenian*, Azerbaijani, Bulgarian, Catalan, Chinese Simplified, Chinese Traditional, Croatian, Danish, Dari*, Dutch, English, Estonian, Farsi, Finnish, French, Frisian, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malay, Norwegian, Pashto*, Polish, Portuguese Brazil, Portuguese Portugal, Romanian, Russian, Serbian, Slovak, Slovene, Spanish, Swedish, Thai, Turkish, Ukrainian, Urdu, Valencian*, Vietnamese, Welsh*
* not complete

 

To Czech:

English

  Global Czech Dictonary

A series of multi-layered lexicographic datasets for 25 languages including Czech.

Each language resource is developed from scratch, using a methodology based on corpus evidence and sharing a consistent overall framework and technical infrastructure across all languages.

The underlying monolingual layer can either be used on its own or serve as a base for adding translation equivalents in other languages and producing bilingual and multilingual versions, which can also be cross linked to the other language sets.

  • single word lemmas
  • multiword expressions
  • phonetic inscription
  • definitions
  • examples of usage
  • translation equivalents
  • synonyms
  • antonyms

Statistics

Total entries

23.093

Senses

55.337

 

Examples

33.606

   Password English to  Czech Dictonary

An English to Czech dictionary database including translations in nearly 50 languages.
The core was originally developed for intermediate level learners, including over 29,000 entries with 39,000 senses, 37,000 examples, and usage notes, along with translation equivalents in the other languages.

The data is complemented by human voice audio files for the headwords and the multiword expressions, including distinction between American and British English pronunciation, as well as with supplements on English language and grammar.

  • single word lemmas
  • multiword expressions
  • phonetic inscription
  • definitions
  • examples of usage
  • translation equivalents
  • synonyms
  • antonyms
  • audio

   Multigloss Czech Dictionary

A series of innovative multlingual glossaries, based on a human-edited bilingual index of each language to English that is semi-automatically generated to translations in 45 more languages, currently available for 22 language.

  • single word lemmas
  • multiword expressions
  • part of speech
  • translation equivalents