MONOTRANS

Statistical Machine Translation Using Monolingual Corpora

 Coordinatore UNIVERSITAT ROVIRA I VIRGILI 

 Organization address address: CARRER DE ESCORXADOR
city: TARRAGONA
postcode: 43003

contact info
Titolo: Dr.
Nome: M. Dolores
Cognome: Jimenez Lopez
Email: send email
Telefono: -560486
Fax: -560540

 Nazionalità Coordinatore Spain [ES]
 Totale costo 207˙884 €
 EC contributo 207˙884 €
 Programma FP7-PEOPLE
Specific programme "People" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013)
 Code Call FP7-PEOPLE-2007-2-1-IEF
 Funding Scheme MC-IEF
 Anno di inizio 2009
 Periodo (anno-mese-giorno) 2009-03-01   -   2011-02-28

 Partecipanti

# participant  country  role  EC contrib. [€] 
1    UNIVERSITAT ROVIRA I VIRGILI

 Organization address address: CARRER DE ESCORXADOR
city: TARRAGONA
postcode: 43003

contact info
Titolo: Dr.
Nome: M. Dolores
Cognome: Jimenez Lopez
Email: send email
Telefono: -560486
Fax: -560540

ES (TARRAGONA) coordinator 0.00

Mappa


 Word cloud

Esplora la "nuvola delle parole (Word Cloud) per avere un'idea di massima del progetto.

corpora    machine    monolingual    database    language    data    dictionary    equivalents    bilingual    translation   

 Obiettivo del progetto (Objective)

'As evidenced by a number of machine translation competitions, statistical machine translation is producing encouraging results for language pairs where large corpora of previously translated texts are available for training. However, in practice the availability of such data is often a severe bottleneck. We therefore propose a methodology that only requires a bilingual dictionary and monolingual text corpora of the source and the target language, which should considerably relieve the data acquisition problem. What we suggest is a two stage procedure. In the first step we create a database of translation equivalents by extracting them from a pair of comparable monolingual corpora using a bilingual dictionary in combination with automatically generated thesauri of related words. In the second step we translate new sentences by retrieving appropriate translation equivalents from the database and by merging them using a combinatorial approach.'

Altri progetti dello stesso programma (FP7-PEOPLE)

LEAGUE (2012)

Low-Energy Aspects of Gauge Unifications and their Experimental Implications

Read More  

CORPAGEST (2013)

CorpAGEst. A corpus-based multimodal approach to the pragmatic competence of the elderly

Read More  

TRADEOFF METAL (2013)

Trade-offs in immunity in the metal hyperaccumulator Noccaea caerulescens

Read More