Language, Data and Knowledge (LDK 2021) conference

Multilingual Data & Knowledge

Language, Data and Knowledge
(LDK 2021) conference

From September 1-4 this year, the third edition of the Language, Data and Knowledge (LDK 2021) conference will take place as a hybrid event in Zaragoza, Spain. This conference will build on previous successful editions in Leipzig, Germany in 2019 and in Galway, Ireland in 2017. It aims to bring together researchers from a variety of disciplines working on linguistic data and data science. This includes applications in data science, natural language processing and machine learning as well as digital humanities and commercial applications of these technologies. This year the conference is supported by the NexusLinguarum COST Action for a “European network for web-centred linguistics data science.”

Language data is of vital importance to a large number of methodologies in machine learning, natural language processing and Semantic Web research, and these applications crucially depend on linguistic and semantic annotation and the existence of high-quality language resources. As such the conference is concerned with the acquisition, provenance, representation, maintenance, usability, quality as well as legal, organizational and infrastructure aspects of linguistic data. In addition, the application of knowledge graphs and their exploitation in use cases in industry, such as biomedical, fintech and legaltech applications, as well as use cases in humanities, and social sciences is a particular focus of the conference.

This year, we are delighted to welcome three distinguished invited speakers: Mathieu Lafourcade of the University of Montpellier will talk about his work on Games With A Purpose (GWAP) and how these can be used in the development of lexical and semantic networks. Sara Tonelli from the Fondazione Bruno Kessler will give a talk entitled “A Smell is Worth a Thousand Words: Olfactory Information Extraction and Semantic Processing in a Multilingual Perspective.” Finally, Mikel Forcada of the University of Alicante will present the development of the Apertium dictionaries in a talk entitled “Free/open-source machine translation for the low-resource languages of Spain.” Besides this we have an exciting programme with 21 accepted papers and 17 posters to be presented over the two main conference days.

There is also a packed programme of workshops and tutorials both before and after the main conference, with September 1 hosting tutorials on the Linked Data in Latin (LiLa) project and the DBpedia project. In addition, there is the “1st Workshop on Sentiment Analysis & Linguistic Linked Data”, “4th shared task for Translation Inference Across Dictionaries”, “2nd International Workshop on Artificial Intelligence for Historical Image and Visual Cultural Artefacts Enrichment” and a “Multisensory Data & Knowledge Workshop”, all taking place on that day. After the main conference on September 4, there are meetings of World Wide Web Consortium (W3C) community groups, in particular the Linked Data for Language Technologies (LD4LT) group will meet to discuss the harmonization of linguistic annotation, while the Ontology-Lexicon (OntoLex) group will consider its ongoing work.

Due to the ongoing pandemic, we have made this conference a hybrid event so that all participants can attend. We aim to provide live streams of the talks so that remote participants can be fully involved. In addition, there is no registration fee for either remote or in-person participation due to the support of NexusLinguarum. We hope to see you all either in person or online on September 1-4 for an exciting conference!

John P. McCrae and Thierry Declerck

LDK 2021 Chairs

Spanish	Hebrew
El navío atracó en la noche.	הספינה הגיעה למזח בלילה.
los macizos alpinos	רכסי האלפים
La masa leuda.	הבצק תּוֹפֵחַ.
¡No te preocupes!	אל תדאג
el bosquejo de una pintura	סקיצת ציור
La palabra “mesa” es de género femenino.	המילה “צלחת” היא ממין נקבה.
una obra de teatro en cinco actos	מחזה בחמש מערכות
la masa atomica de qualqer cosa	המסה האטומית של דבר מה
¿Cómo se dice “luna” en inglés?	איך אומרים “ירח” באנגלית?
abonarse al cable	לעשות מינוי לכבלים

ARABIC	CHINESE	domain
زوجي السابق	前夫
عقاب بالسجن عشرين سنة	判二十年的牢狱
مقطوعة موسيقية كلاسيكية لباخ	巴特前奏曲	music
ملأ دجاجة بالحشوة	把一只鸡塞满馅料	culinary
رسم دائرة	画圆	geometry
طرد شخصا ما من دولة	将某人从国家中驱逐
مفرد وجمع كلمة	一个词的单复数	grammar
عمل حاصل جمع عدة أرقام	做几笔数目的总额	mathematics
رياح شمالية	北风
منظر خيالي	不真实的景象

ARABIC	DANISH	domain
السفارة الألمانية في باريس	den tyske ambassade i Paris	politics
قامت الشرطة بالقبض على المجرم.	Politiet har fanget forbryderen.	law
تقع برلين على دائرة عرض 52 درجة شمالاً وعلى خط طول 13 درجة شرقًا.	Berlin ligger omtrent på 52 grader nordlig bredde og 13 grader østlig længde.	geography
تمركز كل المشتركين على خط الانطلاق.	Alle konkurrencedeltagerne står på startlinjen.	sport
قطة أليفة	en tillidsfuld kat
حزمة من الفجل/الثوم	et bundt purløg/radiser
قانون الجاذبية	tyngdeloven	mathematics, physics
“لقد فعلها!” – “كم هذا مبهر، خاصة مع كل المساعدة التي تلقاها!”	“Han klarede det!‟ – “Det tror pokker, med al den hjælp, han har fået!‟
بذور دوار الشمس	solsikkekerne	botanics
اشتد السيل على نحو مخيف، لكن هذا الرعب انتهى بعد دقائق معدودة.	Det haglede frygteligt, men efter et par minutter var ubehaget overstået.

ARABIC	DUTCH	domain
أغنية من ألبومها الغنائي الجديد	een lied uit haar laatste album	music
مراسلنا في المنطقة المنكوبة	onze verslaggever uit het crisisgebied	journalism
عش السنونو	zwaluwennest	zoology
الولايات المتحدة الأمريكية وحلفائها	de USA en haar bondgenoten	politics
يضخ القلب الدم عبر الأوعية الدموية.	Het hart pompt het bloed door de aderen.	anatomy
المفعول به يكون في حالة النصب.	Het directe object is accusatief.	grammar
روض نمرا	een tijger temmen
نشر خبرا	een bericht verspreiden
دراسة الحقوق	rechten studeren
مثل صيني	een Chinees spreekwoord

ARABIC	ENGLISH	domain
فيلم روائي	feature film	cinema, television
حالة طقس هادئة	calm weather	meteorology
الفيلم عبارة عن تقليد هزلي لأفلام الغرب الأمريكية القديمة.	The film is a parody of the old Hollywood westerns.	television

SHARE ON