#	Pagina
attuale pagina	/open-h2020/projects/218731/index.html
-1	/open-h2020/projects/194081/index.html
-2	/open-h2020/projects/223994/index.html
-3	/open-h2020/projects/216526/index.html
-4	/open-h2020/projects/227408/index.html
-5	/open-h2020/projects/206252/index.html
-6	/open-h2020/projects/227559/index.html
-7	/open-h2020/projects/221165/index.html
-8	/open-fp7/projects/109584/index.html
-9	/open-h2020/projects/221823/index.html
-10	/open-h2020/per-topic/sufficient/list/index.html

Opendata, web and dolomites

EMBEDDIA SIGNED

Cross-Lingual Embeddings for Less-Represented Languages in European News Media

Total Cost €

EC-Contrib. €

Partnership

Views

EMBEDDIA project word cloud

Explore the words cloud of the EMBEDDIA project. It provides you a very rough idea of what is the project "EMBEDDIA" about.

internet world near continues cross absolutely barriers transformations quality academic engagement translations embeddings exist multilingual dominant fast longer lack services government multicultural life solutions networks civic limited smaller cultures web embeddia six represented mobility appropriate untenable clear monolingual changing neural national online operation fair french access expanding local contexts multiple personal coupled english deep time diversity lingual mostly language serves innovations proliferation computational german industry multilingualism urgently usually equitable professional realise few planning fundamental speed news websites natural streams luxury 37 basic languages truly communities everyday content nor citizens tools base leveraging becomes media form

Project "EMBEDDIA" data sheet

The following table provides information about the project.

Coordinator	INSTITUT JOZEF STEFAN Organization address address: Jamova 39 city: LJUBLJANA postcode: 1000 website: www.ijs.si contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.
Coordinator Country	Slovenia [SI]
Total cost	2˙998˙850 €
EC max contribution	2˙998˙850 € (100%)
Programme	1. H2020-EU.2.1.1. (INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies - Information and Communication Technologies (ICT))
Code Call	H2020-ICT-2018-2
Funding Scheme	RIA
Starting year	2019
Duration (year-month-day)	from 2019-01-01 to 2021-12-31

Partnership

Take a look of project's partnership.

#	participants	country	role	EC contrib. [€]
1	INSTITUT JOZEF STEFAN INSTITUT JOZEF STEFAN Organization address address: Jamova 39 city: LJUBLJANA postcode: 1000 website: www.ijs.si contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	SI (LJUBLJANA)	coordinator	560˙059.00
2	QUEEN MARY UNIVERSITY OF LONDON QUEEN MARY UNIVERSITY OF LONDON Organization address address: 327 MILE END ROAD city: LONDON postcode: E1 4NS website: http://www.qmul.ac.uk contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	UK (LONDON)	participant	451˙800.00
3	HELSINGIN YLIOPISTO HELSINGIN YLIOPISTO Organization address address: YLIOPISTONKATU 3 city: HELSINGIN YLIOPISTO postcode: 14 website: www.helsinki.fi contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	FI (HELSINGIN YLIOPISTO)	participant	448˙125.00
4	UNIVERSITE DE LA ROCHELLE UNIVERSITE DE LA ROCHELLE Organization address address: Avenue Albert-Einstein 23 city: LA ROCHELLE postcode: 17031 website: www.univ-lr.fr contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	FR (LA ROCHELLE)	participant	372˙500.00
5	UNIVERZA V LJUBLJANI UNIVERZA V LJUBLJANI Organization address address: KONGRESNI TRG 12 city: LJUBLJANA postcode: 1000 website: http://www.uni-lj.si contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	SI (LJUBLJANA)	participant	323˙750.00
6	TEXTA OU TEXTA OU Organization address address: ASULA TN 3 city: TALLINN postcode: 11312 website: n.a. contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	EE (TALLINN)	participant	306˙250.00
7	THE UNIVERSITY OF EDINBURGH THE UNIVERSITY OF EDINBURGH Organization address address: OLD COLLEGE, SOUTH BRIDGE city: EDINBURGH postcode: EH8 9YL website: www.ed.ac.uk contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	UK (EDINBURGH)	participant	175˙000.00
8	TRIKODER DRUSTVO S OGRANICENOM ODGOVORNOSCU ZA RAZVOJ INTERNET SUSTAVAI OBLIKOVANJE TRIKODER DRUSTVO S OGRANICENOM ODGOVORNOSCU ZA RAZVOJ INTERNET SUSTAVAI OBLIKOVANJE Organization address address: ULICA MIROSLAVA MIHOLICA 2 city: ZAGREB GRAD ZAGREB postcode: 10000 website: n.a. contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	HR (ZAGREB GRAD ZAGREB)	participant	125˙176.00
9	AS EKSPRESS MEEDIA AS EKSPRESS MEEDIA Organization address address: NARVA MNT 13 city: TALLINN postcode: 10151 website: n.a. contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	EE (TALLINN)	participant	113˙437.00
10	OY SUOMEN TIETOTOIMISTO - FINSKA NOTISBYRAN AB OY SUOMEN TIETOTOIMISTO - FINSKA NOTISBYRAN AB Organization address address: MALMINKATU 16A city: HELSINKI postcode: 100 website: n.a. contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	FI (HELSINKI)	participant	111˙737.00
11	STYRIA MEDIJSKI SERVISI DOO ZA TRGOVINU I USLUGE STYRIA MEDIJSKI SERVISI DOO ZA TRGOVINU I USLUGE Organization address address: ORESKOVICEVA 6H/1 city: ZAGREB postcode: 10000 website: n.a. contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	HR (ZAGREB)	participant	11˙013.00

Map

Project objective

Access to the internet is no longer a luxury---it is a basic component of everyday life and civic engagement, but one in which language continues to be a challenge for fair and equitable access. As Europe becomes more multicultural, and personal and professional mobility between cultures rapidly increases, access to fundamental resources such as local news and government services is limited by the great diversity of the EU's 37 languages. The internet mostly developed in English, and without clear planning for how language issues might form barriers to access and engagement, nor how multilingualism might be supported. In the EU, websites and online services for citizens have developed national local language resources, and often only provide a second language (usually English) when absolutely needed; but the great proliferation of web content, multiple and fast-changing content streams, and an expanding user interest base make this approach untenable. And while advanced natural language research and resources exist for a few dominant languages (English, French, German), many of Europe's smaller language communities---and the news media industry that serves them---lack appropriate tools for multilingual internet development. For the EU to realise a truly equitable, open, multilingual future internet, new tools allowing high quality transformations (not translations) between languages are urgently needed. The EMBEDDIA project seeks to address these challenges by leveraging innovations in the use of cross-lingual embeddings coupled with deep neural networks to allow existing monolingual resources to be used across languages, leveraging their high speed of operation for near real-time applications, without the need for large computational resources. Across three years, the project's six academic and four industry partners will develop novel solutions including for under-represented languages, and test them in real-world news and media production contexts.

Publications

List of publications.
year	authors and title	journal	last update
2019	Miok, Kristian; Nguyen-Doan, Dong; Zaharie, Daniela; Robnik-Å ikonja, Marko Generating Data using Monte Carlo Dropout published pages: , ISSN: , DOI: 10.5281/zenodo.3559060	1	2020-03-05
2019	Pivovarova, Lidia; Marjanen, Jani; Zosa, Elaine Word Clustering for Historical Newspapers Analysis published pages: 3-10, ISSN: , DOI: 10.5281/zenodo.3402940	Proceedings of the Workshop on Language Technology for Digital Historical Archives in conjuction with RANLP-2019	2020-03-05
2019	Shamila Nasreen; Matthew Purver; Julian Hough A Corpus Study on Questions, Responses and Misunderstanding Signals in Conversations with Alzheimer\'s Patients published pages: , ISSN: , DOI: 10.5281/zenodo.3689456	Proceedings of the 23rd Workshop on the Semantics and Pragmatics of Dialogue 13	2020-03-05
2019	AndraÅ¾ Repar, Matej Martinc, Senja Pollak Reproduction, replication, analysis and adaptation of a term alignment approach published pages: , ISSN: 1574-020X, DOI: 10.1007/s10579-019-09477-1	Language Resources and Evaluation	2020-03-05
2019	Jani Marjanen; Lidia Pivovarova; Elaine Zosa; Jussi KurunmÃ¤ki Clustering Ideological Terms in Historical Newspaper Data with Diachronic Word Embeddings published pages: , ISSN: , DOI: 10.5281/zenodo.3689467	HistoInformatics 2019: International Workshop on Computational History 2019	2020-03-05
2019	Kristian Miok, Dong Nguyen-Doan, Daniela Zaharie, and Marko Robnik-Å ikonja Generating Data using Monte Carlo Dropout published pages: , ISSN: , DOI:	IEEE 15th International Conference on Intelligent Computer Communication and Processing (ICCP 2019)	2020-02-11
2019	Matej Martinc, Senja Pollak Combining n -grams and deep convolutional features for language variety classification published pages: 607-632, ISSN: 1351-3249, DOI: 10.1017/S1351324919000299	Natural Language Engineering 25/5	2020-02-11
2019	AndraÅ¾ Repar, Vid PodpeÄan, AnÅ¾e VavpetiÄ, Nada LavraÄ, Senja Pollak TermEnsembler published pages: 93-120, ISSN: 0929-9971, DOI: 10.1075/term.00029.rep	Terminology 25/1	2020-02-11
2019	Senja Pollak, AndraÅ¾ Repar, Matej Martinc, and Vid PodpeÄan Karst exploration: Extracting terms and definitions from karst published pages: , ISSN: , DOI:	Proceedings of the 6th biennial conference on electronic lexicography, eLex 2019	2020-02-11
2019	Martinc, Matej; Å krlj, BlaÅ¾; Pollak, Senja Fake or Not: Distinguishing Between Bots, Males and Females published pages: , ISSN: , DOI:	Working Notes of CLEF 2019 - Conference and Labs of the Evaluation Forum 2	2020-02-11
2019	Martinc, Matej; Å krlj, BlaÅ¾; Pollak, Senja Who is hot and who is not? Profiling celebs on Twitter published pages: , ISSN: , DOI:	Working Notes of CLEF 2019 - Conference and Labs of the Evaluation Forum 6	2020-02-11
2019	Jani Marjanen, Lidia Pivovarova, Elaine Zosa, and Jussi KurunmÃ¤ki Clustering Ideological Terms in Historical Newspaper Data with Diachronic Word Embeddings published pages: , ISSN: , DOI:	Proceedings of the 5th International Workshop on Computational History	2020-02-11
2019	Lidia Pivovarova, Elaine Zosa, and Jussi KurunmÃ¤ki Word Clustering for Historical Newspapers Analysis published pages: , ISSN: , DOI:	Proceedings of the Workshop on Language Technology for Digital Historical Archives	2020-02-11
2019	Tadej Å kvorc, Simon Krek, Senja Pollak, Å pela Arhar Holdt, Marko Robnik-Å ikonja Predicting Slovene Text Complexity Using Readability Measures published pages: , ISSN: 2463-7807, DOI:	In Contributions to Contemporary History	2020-02-11
2019	AndraÅ¾ Pelicon, Matej Martinc, and Petra Kralj Novak Embeddia at SemEval-2019 Task 6: Detecting hate with neural network and transfer learning approaches published pages: , ISSN: , DOI:	Proceedings of The 13th International Workshop on Semantic Evaluation (SemEval)	2020-02-11
2019	Morteza Rohanian, Julian Hough, Matthew Purver Detecting Depression with Word-Level Multimodal Fusion published pages: 1443-1447, ISSN: , DOI: 10.21437/interspeech.2019-2283	Interspeech 2019	2020-02-11
2019	Jose G. Moreno, Elvys Linhares Pontes, Mickael Coustaty, Antoine Doucet TLR at BSNLP2019: A Multilingual Named Entity Recognition System published pages: 83-88, ISSN: , DOI: 10.18653/v1/w19-3711	Proceedings of the 7th Workshop on Balto-Slavic Natural Language Processing	2020-02-11
2019	Matej Martinc, Senja Pollak Pooled LSTM for Dutch cross-genre gender classification published pages: , ISSN: , DOI:	Proceedings of the Shared Task on Cross-Genre Gender Detection in Dutch at Computational Linguistic in Netherlands (CLIN 2019) conference	2020-02-11

Are you the coordinator (or a participant) of this project? Plaese send me more information about the "EMBEDDIA" project.

For instance: the website url (it has not provided by EU-opendata yet), the logo, a more detailed description of the project (in plain text as a rtf file or a word file), some pictures (as picture files, not embedded into any word file), twitter account, linkedin page, etc.

Send me an email (fabio@fabiodisconzi.com) and I put them in your project's page as son as possible.

Thanks. And then put a link of this page into your project's website.

The information about "EMBEDDIA" are provided by the European Opendata Portal: CORDIS opendata.

More projects from the same programme (H2020-EU.2.1.1.)

5G-COMPLETE (2019)

A unified network, Computational and stOrage resource Management framework targeting end-to-end Performance optimization for secure 5G muLti-tEchnology and multi-Tenancy Environments