English Language Datasets and Dictionaries

The Lexicala data stems from numerous lexicographic resources compiled by experienced lexicographers at

K Dictionaries, for the creation of monolingual, bilingual, multilingual and learner’s dictionaries.

 

Each dictionary prototype was conceived with its own macrostructure and entry microstructure to serve

a unique purpose and satisfy a specific target user group.

 

The following dictionary series are available online, in mobile apps, and in print.

DATASET COMPONENT

words icon
Words & expressions
Inflections variants icon
Inflections & variants
etimology icon
Etimology
translation icon
Translations
Pronunciation icon
Pronunciation

Phonetic transcription notes tooltip
Alternative script notes tooltip
Audio notes tooltip

Senses

Definitions notes tooltip
Dismbiguators notes tooltip

Examples of usage icon
Examples of usage

Full sentences notes tooltip
Short phrases notes tooltip

usage labels icon
Usage labels

Range of application notes tooltip
Register notes tooltip
Geographical region notes tooltip
Sentiment notes tooltip

Features

Frequency notes tooltip
Spell check notes tooltip
Geo multilingual table
Biographical entries
Geographical entries

grammar icon
Grammar

Part of speech notes tooltip
Gender notes tooltip
Number notes tooltip
Subcategorization notes tooltip
Valency notes tooltip

semantic icon
Semantic labels

Synonyms notes tooltip
Antonyms notes tooltip
Context notes tooltip
Domains notes tooltip

Notes notes tooltip

Components

words icon

Words and Expressions

Inflections variants icon

Inflections and Variants

translation icon

Translations

etimology icon

Etimology

senses icon
Senses

Definitions

Disambiguators

semantic icon
Semantic labels

Synonyms

Antonyms

Context

Domain

usage labels icon
Usage labels

Range of Application

Register

Geographical region

Sentiment

grammar icon
Grammar

Part of Speech

Gram. Gender

Gram. Number

Subcategorization

Valency

fetures icon
Features

Frequency

Spell check

Geo multilingual table

Geographical entries

Biographical entries

 

Examples of usage icon
Examples of Usage

Full sentence

Short phrases

Pronunciation icon
Pronunciation

Phonetic transcription

Alternative script

Note i-icon from hudi
Notes

extra information

on language and

grammar

Translations

From English:

 

Afrikaans, Albanian*, Arabic, Armenian*, Azerbaijani, Bulgarian, Catalan, Chinese Simplified, Chinese Traditonal, Croatian, Czech, Danish, Dari*, Dutch, Estonian, Farsi, Finnish, French, Frisian, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Korean, Latvian, Lithuanian, Malay, Norwegian, Pashto*, Polish, Portuguese Brazil, Portuguese Portugal, Romanian, Russian, Serbian, Slovak, Slovene, Spanish, Swedish, Thai, Turkish, Ukrainian, Urdu, Valencian*, Vietnamese, Welsh*

 

To English:

 

Arabic, Catalan, Chinese Simplified, Croatian, Danish, Dutch, Estonian, French, Frisian, German, Hebrew, Hungarian, Indonesian, Italian, Japanese, Malay, Norwegian, Polish, Portuguese Brazil, Portuguese Portugal, Russian, Slovene, S panish, Swedish, Ukrainian

Dictonary types 

Global Multilingual Series

 

A series of multi-layered lexicographic datasets for 25 languages including Arabic.

 

Each language resource is developed from scratch, using a methodology based on corpus evidence and sharing a consistent overall framework and technical infrastructure across all languages.

 

 

The underlying monolingual layer can either be used on its own or serve as a base for adding translation equivalents in other languages and producing bilingual and multilingual versions, which can also be cross linked to the other language sets.

 

 

  • single word lemmas
  • multiword expressions
  • phonetic inscription
  • definitions
  • examples of usage
  • translation equivalents
  • synonyms
  • antonyms

 Password English Multilingual Dictonary

  

An English to Hebrew dictionary database including translations in nearly 50 languages.

 

The core was originally developed for intermediate level learners, including over 29,000 entries with 39,000 senses, 37,000 examples, and usage notes, along with translation equivalents in the other languages.

 

The data is complemented by human voice audio files for the headwords and the multiword expressions, including distinction between American and British English pronunciation, as well as with supplements on English language and grammar.

 

  • single word lemmas
  • multiword expressions
  • phonetic inscription
  • definitions
  • examples of usage
  • translation equivalents
  • synonyms
  • antonyms
  • audio

Passport English Learner’s Dictionaries

 

A series of semi bilingual English dictionaries for beginners, including 12,000 entries with 15,000 senses and their translations and 20,000 examples of usage, a bilingual index from the learner’s language to English, supplements, illustrations, and word games.

  • single word lemmas
  • multiword expressions
  • phonetic inscription
  • definitions
  • examples of usage
  • translation equivalents
  • synonyms
  • antonyms

Random House Webster’s Colege Dictionary

 

random icon

This legacy English monolingual dictonary was originally conceived for American students and general users.

Following the closure of its reference division, Random House has entrusted K Dictionaries with using its brand name for this linguistic treasure internationally and continuing to update the content and upgrade the data. RHWCD includes 133,000 entries with 191,000 sense.

  • single word lemmas
  • multiword expressions
  • phonetic inscription
  • definition
  • examples of usage
  • synonyms
  • antonyms

Contact

Contact us to ask about our resources and services.