National Language Technology Platform for Public Administration

  • Marko Tadić
  • , Daša Farkaš
  • , Matea Filko
  • , Artūrs Vasiļevskis
  • , Andrejs Vasiļjevs
  • , Jānis Ziediņš
  • , Željka Motika
  • , Mark Fishel
  • , Hrafn Loftsson
  • , Jón Guðnason
  • , Claudia Borg
  • , Keith Cortis
  • , Judie Attard
  • , Donatienne Spiteri

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This article presents the work in progress on the collaborative project of several European countries to develop National Language Technology Platform (NLTP). The project aims at combining the most advanced Language Technology tools and solutions in a new, state-of-the-art, Artificial Intelligence driven, National Language Technology Platform for five EU/EEA official and lower-resourced languages.

Original languageEnglish
Title of host publicationTowards Digital Language Equality Workshop, TDLE 2022 - as part of the International Conference on Language Resources and Evaluation, LREC 2022
EditorsItziar Aldabe, Begona Altuna, Aritz Farwell, German Rigau
PublisherEuropean Language Resources Association (ELRA)
Pages46-51
Number of pages6
ISBN (Electronic)9782493814036
Publication statusPublished - 2022
Event2022 Towards Digital Language Equality Workshop, TDLE 2022 - Marseille, France
Duration: 20 Jun 2022 → …

Publication series

NameTowards Digital Language Equality Workshop, TDLE 2022 - as part of the International Conference on Language Resources and Evaluation, LREC 2022

Conference

Conference2022 Towards Digital Language Equality Workshop, TDLE 2022
Country/TerritoryFrance
CityMarseille
Period20/06/22 → …

Bibliographical note

Funding Information: The work reported here was supported by the European Commission in the CEF Telecom Programme (Action No: 2020-EU-IA-0082, Grant Agreement No: INEA/CEF/ ICT/A2020/2278398). Funding Information: Technological support for Croatian has progressed in a number of LT areas compared to the state of affairs described in the META-NET White Paper (Tadić et al., 2012). Digital language resources have both increased in number and volume while they also improved in quality and variety. Resources, basic NLP tools and LT services are provided by academia, research institutes and occasionally private companies as outputs of various research projects, usually coordinated by academic institutions, predominantly funded by EU or national funds, and rarely self-funded. Some significant progress has been made with respect to available corpora and lexica, language models, text processing tools, and MT, while there is still a serious underdevelopment in the field of speech processing (both synthesis and recognition). The available datasets originate from a variety of sources and they cover several thematic domains, text types; they are available as raw or annotated; and come as monolingual, bilingual or multilingual resources. However, their individual size is lagging behind in terms of appropriateness for building large language models or robust, ready to use tools and applications. Publisher Copyright: © European Language Resources Association (ELRA), licensed under CC-BY-NC-4.0.

Other keywords

  • CAT tools
  • National Language Technology Platform
  • machine translation
  • parallel corpora

Fingerprint

Dive into the research topics of 'National Language Technology Platform for Public Administration'. Together they form a unique fingerprint.

Cite this