Projekti - Naslovnica - Ured za istraživanje

Ured za
istraživanje

PROJEKTI

Project

Acronym:

CESAR

Name:

CEntral and South-east europeAn Resources
[PROJECT URL | CORDIS URL]

Project status:

From: 2011-02-01 To: 2012-02-01 (Completed)

Contract number:

271022

Action line:

ICT Policy Support Programme, PILOT TYPE B

Type (Programme):

CIP

Instrument:

Project cost:

4.160.002,00 EUR

Project funding:

2.080.000,00 EUR

Project coordinator

Organisation Name:

Nyelvtudomanyi Intezet, Magyar Tudomanyos Akademia

Organisation adress:

Benzcur utca 33, 1068 BUDAPEST

Organisation country:

Mađarska

Contact person name:

Tamás Váradi

Contact person email:

Croatian partner

Organisation name:

Filozofski fakultet

Organisation address:

Ivana Lucica 3

Contact person name:

Marko Tadic

Contact person tel:

00385 1 6120142

Contact person fax:

00385 1 6156879

Contact person e-mail:

Partners

Organisation name

Country

Short description of project

Human language technologies crucially depend on language resources and tools that are usable, useful and available. However, even where language resources and respective tools are available they have been developed mostly in a sporadic manner, in response to specific project needs, with relatively little regard to their long-term sustainability, IPR status, interoperability, reusability in different contexts as well as to their potential deployment in multilingual applications. The CESAR project, in close harmony with META-NET intends to address this issue by enhancing, upgrading, standardising and cross-linking a wide variety of language resources and tools and making them available, thus contributing to an open linguistic infrastructure. Partners in the CESAR consortium are key players in their respective language community with a proven track record in European language technology projects including infrastructure initiatives, such as TELRI in the past and notably CLARIN at present. The project will make available a comprehensive set of language resources and tools covering the Hungarian, Polish, Croatian, Serbian, Bulgarian and Slovak languages. Resources will include interoperable mono and multilingual speech databases, corpora, dictionaries and wordnets and relevant language technology processing tools such as tokenisers, lemmatisers, taggers and parsers. A comprehensive initial list of tools and resources identified as potential targets of project activity can be found in Section 3.2. Depending on local capacities and efficiency considerations, the resources may be either made available as web services at partners' site or contributed to the META_SHARE digital exchange facility. In addition to the technical work required, great effort will be made to ensure sustainability through mobilising the LT community, raising awareness of the fundamental role of language resources among the R&D policy makers, the media and the general public.

Short description of the task performed by Croatian partner

Design by: M. Mačinković

(C)opyright by Sveučilište u Zagrebu,