data Icon Lexical Data 

Our resources rely on deep lexical analysis of human languages, meticulously deciphering and mapping

their linguistic DNA, identifying and categorizing the different elements, and linking them to each other and

across multiple languages.

 

The datasets have been developed over the years by K Dictionaries and published by our partners in

various media, serving millions of users around the world.

 

The content is human-curated, enriched by automatic language generation and supplemented by

morphological word form lists, language and grammar guides, biographical and geographical tables, 

phonetic transcription (IPA), alternative scripts, frequency, and vocal pronunciation.

 

The data are available in XML, JSON and JSON-LD (RDF) formats. 

Data Components

words icon

Words and Expressions

Inflections variants icon

Inflections and Variants

translation icon

Translations

etimology icon

Etimology

senses icon
Senses

Definitions

Disambiguators

semantic icon
Semantic labels

Synonyms

Antonyms

Context

Domain

usage labels icon
Usage labels

Range of Application

Register

Geographical region

Sentiment

grammar icon
Grammar

Part of Speech

Gram. Gender

Gram. Number

Subcategorization

Valency

fetures icon
Features

Frequency

Spell check

Geo multilingual table

Geographical entries

Biographical entries

Examples of usage icon
Examples of Usage

Full sentences

Short phrases

Pronunciation icon
Pronunciation

Phonetic transcription

Alternative script

Note i-icon from hudi
Notes

extra information

on language and

grammar

Data Sample (JSON)

{
    “id”: “DE_DE00019883”,
    “source”: “global”,
    “language”: “de”,
    “version”: 1,
    “headword”: {
        “text”: “Schloss”,
         “pronunciation”: {
            “value”: “ʃlɔs”
        },
        “pos”: “noun”,
        “gender”: “neuter”,
        “inflections”: [
            {
                “text”: “Schlosses”,
                “number”: “singular”,
                “case”: “genitive”
            },
            {
                “text”: “Schlösser” ,
                “pronunciation”: {
                    “value”: “ˈʃlœsɐ”
                },
                “number”: “plural”,
                “case”: “nominative”
            }
        ]
    },
    “senses”: [
               

In Use In

 
  • language models
  • machine translation
  • natural language processing
  • language learning solutions
  • online dictionary websites
  • mobile applications
  • research and innovation projects
  • internship programs

api  

Most of our data are available on Lexicala API.

Our REST API enables flexible search options and returns JSON responses with full dictionary

entries or  specific components – featuring syntactic and semantic details, sense definitions and

various disambiguation forms, examples of usage and multiword expressions, translations and

more – allowing easy processing and seamless integration with other applications.

For the API documentation, registration and access, click the link below.

Custom-Made Data

We research and create language data to suit your requirements. 

 

Contact us to discuss in detail.

custom data