Hebrew language Data and Dictionaries

Data Components

Words and Expressions

Inflections and Variants

Translations

Etimology

Senses

Definitions
Disambiguators

Semantic labels

Synonyms
Antonyms
Context
Domain

Usage labels

Range of Application
Register
Geographical region
Sentiment

Grammar

Part of Speech
Gram. Gender
Gram. Number
Subcategorization
Valency

Features

Frequency
Spell check
Geo multilingual table
Geographical entries
Biographical entries

Examples of Usage

Full sentences
Short phrases

Pronunciation

Phonetic transcription
Alternative script

Notes

Extra information
on language and grammar

Hebrew Lexical Data Sample

Available Translations

From Hebrew:

Afrikaans, Albanian*, Arabic, Armenian*, Azerbaijani, Bulgarian, Catalan, Chinese Simplified, Chinese Traditional, Croatian, Czech, Dari*,

Danish, Dutch, English, Estonian, Farsi, Finnish, French, Frisian, German, Greek, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese,

Korean, Latvian, Lithuanian, Malay, Norwegian, Pashto*, Polish, Portuguese Brazil, Portuguese Portugal, Romanian, Russian, Serbian, Slovak,

Slovene, Spanish, Swedish, Thai, Turkish, Ukrainian, Urdu, Valencian*, Vietnamese, Welsh

To Hebrew:

English, French

Parallel Corpora for Hebrew Language

Parallel corpora for French language consist of manually curated full sentences and short phrases with translation equivalents, based on corpus evidence and frequency, originally created by our editors and translators worldwide as examples of usage for dictionary entries.

The data can be applied to boost the performance of Language Service Providers, to train Machine Learning models and enhance their Neural Machine Translation solutions.

Dictonary types

  Global Hebrew Dictonary

A series of multi-layered lexicographic datasets for 25 languages including French.

 

Each language resource is developed from scratch, using a methodology based on corpus evidence and sharing a consistent overall framework and technical infrastructure across all languages.

 

The underlying monolingual layer can either be used on its own or serve as a base for adding translation equivalents in other languages and producing bilingual and multilingual versions, which can also be cross linked to the other language sets.

 

  • single word lemmas
  • multiword expressions
  • phonetic inscription
  • definitions
  • examples of usage
  • translation equivalents
  • synonyms
  • antonyms

Password English to Hebrew Dictonary

An English to French dictionary database including translations in nearly 50 languages.


The core was originally developed for intermediate level learners, including over 29,000 entries with 39,000 senses, 37,000 examples, and usage notes, along with translation equivalents in the other languages.

 

The data is complemented by human voice audio files for the headwords and the multiword expressions, including distinction between American and British English pronunciation, as well as with supplements on English language and grammar.

 

  • single word lemmas
  • multiword expressions
  • phonetic inscription
  • definitions
  • examples of usage
  • translation equivalents
  • synonyms
  • antonyms
  • audio

Multigloss Hebrew Dictionary

A series of innovative multlingual glossaries, based on a human-edited bilingual index of each language to English that is semi-automatically generated to translations in 45 more languages, currently available for 22 language.

 

  • single word lemmas
  • multiword expressions
  • part of speech
  • translation equivalents