data Icon  LEXICAL DATA

Our resources rely on deep lexical analyses of human languages, meticulously deciphering and mapping their linguistic DNA, identifying and categorizing the different elements, and linking them to each other and across multiple languages.

These datasets have been developed over the years by K Dictionaries and published by its partners in various media – serving millions of users around the world.

The content is human-curated, enriched by automatic language generation and supplemented by morphological word form lists, language and grammar guides, biographical and geographical tables, phonetic transcription (IPA), alternative scripts, and vocal pronunciation.

The data are available in XML, JSON and JSON-LD (RDF) formats.

COMPONENTS

words icon
Words
and expressions
Inflections variants icon
Inflections
and variants
etimology icon
Etimology
translation icon
Translations
Pronunciation icon
Pronunciation

Phonetic transcription notes tooltip
Alternative script notes tooltip
Audio notes tooltip

Senses

Definitions notes tooltip
Disambiguators notes tooltip

Examples of usage icon
Examples of usage

Full sentences notes tooltip
Short phrases notes tooltip

usage labels icon
Usage labels

Range of application notes tooltip
Register notes tooltip
Geographical region notes tooltip
Sentiment notes tooltip

Features

Frequency notes tooltip
Spell check notes tooltip
Geo multilingual table
Biographical entries
Geographical entries

grammar icon
Grammar

Part of speech notes tooltip
Gender notes tooltip
Number notes tooltip
Subcategorization notes tooltip
Valency notes tooltip

semantic icon
Semantic labels

Synonyms notes tooltip
Antonyms notes tooltip
Context notes tooltip
Domain notes tooltip

Notes notes tooltip

IN USE BY

industry icon    Industry

 

  • natural language processing integrators
  • software and technology companies
  • language learning providers
  • online dictionary websites
  • mobile app developers
  • publishers

academia icon   Academia

 

  • exchange with language and research institutes
  • invited talks and workshops
  • publication of scholarly papers
  • sponsorship for conferences and academic activity
  • internships to university students of lexicography,
    linguistics, translation, NLP and computer science
api  API

Most of our data are available via API. Our REST API enables flexible search options and returns JSON responses, whether complete dictionary entries or specific components – featuring rich syntactic and semantic information, sense definitions and various means of disambiguation, examples of usage and multiword expressions, translations and more – allowing easy processing and seamless integration with other applications. 

You can read documentation, register and gain access on our API website, just click the link below.

CUSTOM-MADE DATA

We can research and create data exactly for your needs.
Reach out to discuss in detail.

custom data
Font Resize
Contrast