Lexical Data

Home » Lexical Data

Our resources rely on deep lexical analysis of human languages, meticulously deciphering and mapping their linguistic DNA, identifying and categorizing the different elements, and linking them to each other and across multiple languages.

The datasets have been developed over the years by K Dictionaries and published by our partners in various media, serving millions of users around the world.

The content is human-curated, enriched by automatic language generation and supplemented by

morphological word form lists, language and grammar guides, biographical and geographical tables,

phonetic transcription (IPA), alternative scripts, frequency, and vocal pronunciation.

The data are available in XML, JSON and JSON-LD (RDF) formats.

Data Components

Words and Expressions

Inflections and Variants

Translations

Etimology

Senses

Definitions
Disambiguators

Semantic labels

Synonyms
Antonyms
Context
Domain

Usage labels

Range of Application
Register
Geographical region
Sentiment

Grammar

Part of Speech
Gram. Gender
Gram. Number
Subcategorization
Valency

Features

Frequency
Spell check
Geo multilingual table
Geographical entries
Biographical entries

Examples of Usage

Full sentences
Short phrases

Pronunciation

Phonetic transcription
Alternative script

Notes

Extra information
on language and grammar

Data Sample (JSON)

				
					{
  "id": "DE_DE00019883",
  "source": "global",
  "language": "de",
  "version": 1,
  "headword": {
    "text": "Schloss",
    "pronunciation": {
      "value": "ʃlɔs"
    },
    "pos": "noun",
    "gender": "neuter",
    "inflections": [
      {
        "text": "Schlosses",
        "number": "singular",
        "case": "genitive"
      },
      {
        "text": "Schlösser",
        "pronunciation": {
          "value": "ˈʃlœsɐ"
        },
        "number": "plural",
        "case": "nominative"
      }
    ]
  },
  "senses": [
    {

In Use In

language models
machine translation
natural language processing
language learning solutions
online dictionary websites
mobile applications
research and innovation projects
internship programs

API

Most of our data are available on Lexicala API.

Our REST API enables flexible search options and returns JSON responses with full dictionary

entries or specific components – featuring syntactic and semantic details, sense definitions and

various disambiguation forms, examples of usage and multiword expressions, translations and

more – allowing easy processing and seamless integration with other applications.

For the API documentation, registration and access, click below.

Spanish	Hebrew
El navío atracó en la noche.	הספינה הגיעה למזח בלילה.
los macizos alpinos	רכסי האלפים
La masa leuda.	הבצק תּוֹפֵחַ.
¡No te preocupes!	אל תדאג
el bosquejo de una pintura	סקיצת ציור
La palabra “mesa” es de género femenino.	המילה “צלחת” היא ממין נקבה.
una obra de teatro en cinco actos	מחזה בחמש מערכות
la masa atomica de qualqer cosa	המסה האטומית של דבר מה
¿Cómo se dice “luna” en inglés?	איך אומרים “ירח” באנגלית?
abonarse al cable	לעשות מינוי לכבלים

Lexical Data

Data Components

Data Sample (JSON)

In Use In

API

Arabic

German

Spanish

Hebrew

ARABIC	CHINESE	domain
زوجي السابق	前夫
عقاب بالسجن عشرين سنة	判二十年的牢狱
مقطوعة موسيقية كلاسيكية لباخ	巴特前奏曲	music
ملأ دجاجة بالحشوة	把一只鸡塞满馅料	culinary
رسم دائرة	画圆	geometry
طرد شخصا ما من دولة	将某人从国家中驱逐
مفرد وجمع كلمة	一个词的单复数	grammar
عمل حاصل جمع عدة أرقام	做几笔数目的总额	mathematics
رياح شمالية	北风
منظر خيالي	不真实的景象

ARABIC	DANISH	domain
السفارة الألمانية في باريس	den tyske ambassade i Paris	politics
قامت الشرطة بالقبض على المجرم.	Politiet har fanget forbryderen.	law
تقع برلين على دائرة عرض 52 درجة شمالاً وعلى خط طول 13 درجة شرقًا.	Berlin ligger omtrent på 52 grader nordlig bredde og 13 grader østlig længde.	geography
تمركز كل المشتركين على خط الانطلاق.	Alle konkurrencedeltagerne står på startlinjen.	sport
قطة أليفة	en tillidsfuld kat
حزمة من الفجل/الثوم	et bundt purløg/radiser
قانون الجاذبية	tyngdeloven	mathematics, physics
“لقد فعلها!” – “كم هذا مبهر، خاصة مع كل المساعدة التي تلقاها!”	“Han klarede det!‟ – “Det tror pokker, med al den hjælp, han har fået!‟
بذور دوار الشمس	solsikkekerne	botanics
اشتد السيل على نحو مخيف، لكن هذا الرعب انتهى بعد دقائق معدودة.	Det haglede frygteligt, men efter et par minutter var ubehaget overstået.

ARABIC	DUTCH	domain
أغنية من ألبومها الغنائي الجديد	een lied uit haar laatste album	music
مراسلنا في المنطقة المنكوبة	onze verslaggever uit het crisisgebied	journalism
عش السنونو	zwaluwennest	zoology
الولايات المتحدة الأمريكية وحلفائها	de USA en haar bondgenoten	politics
يضخ القلب الدم عبر الأوعية الدموية.	Het hart pompt het bloed door de aderen.	anatomy
المفعول به يكون في حالة النصب.	Het directe object is accusatief.	grammar
روض نمرا	een tijger temmen
نشر خبرا	een bericht verspreiden
دراسة الحقوق	rechten studeren
مثل صيني	een Chinees spreekwoord

ARABIC	ENGLISH	domain
فيلم روائي	feature film	cinema, television
حالة طقس هادئة	calm weather	meteorology
الفيلم عبارة عن تقليد هزلي لأفلام الغرب الأمريكية القديمة.	The film is a parody of the old Hollywood westerns.	television