Lexical Data

Our resources rely on deep lexical analysis of human languages, meticulously deciphering and mapping their linguistic DNA, identifying and categorizing the different elements, and linking them to each other and across multiple languages.

 

The datasets have been developed over the years by K Dictionaries and published by our partners in various media, serving millions of users around the world.

 

The content is human-curated, enriched by automatic language generation and supplemented by

morphological word form lists, language and grammar guides, biographical and geographical tables,

phonetic transcription (IPA), alternative scripts, frequency, and vocal pronunciation.

The data are available in XML, JSON and JSON-LD (RDF) formats.

Data Components

Words and Expressions

Inflections and Variants

Translations

Etimology

Senses

Definitions
Disambiguators

Semantic labels

Synonyms
Antonyms
Context
Domain

Usage labels

Range of Application
Register
Geographical region
Sentiment

Grammar

Part of Speech
Gram. Gender
Gram. Number
Subcategorization
Valency

Features

Frequency
Spell check
Geo multilingual table
Geographical entries
Biographical entries

Examples of Usage

Full sentences
Short phrases

Pronunciation

Phonetic transcription
Alternative script

Notes

Extra information
on language and grammar

Data Sample (JSON)

				
					{
  "id": "DE_DE00019883",
  "source": "global",
  "language": "de",
  "version": 1,
  "headword": {
    "text": "Schloss",
    "pronunciation": {
      "value": "ʃlɔs"
    },
    "pos": "noun",
    "gender": "neuter",
    "inflections": [
      {
        "text": "Schlosses",
        "number": "singular",
        "case": "genitive"
      },
      {
        "text": "Schlösser",
        "pronunciation": {
          "value": "ˈʃlœsɐ"
        },
        "number": "plural",
        "case": "nominative"
      }
    ]
  },
  "senses": [
    {    
				
			

In Use In

  • language models
  • machine translation
  • natural language processing
  • language learning solutions
  • online dictionary websites
  • mobile applications
  • research and innovation projects
  • internship programs

API

Most of our data are available on Lexicala API.

Our REST API enables flexible search options and returns JSON responses with full dictionary

entries or  specific components – featuring syntactic and semantic details, sense definitions and

various disambiguation forms, examples of usage and multiword expressions, translations and

more – allowing easy processing and seamless integration with other applications.

For the API documentation, registration and access, click below.