Opendata, web and dolomites

LEXICAL SIGNED

Lexical Acquisition Across Languages

Total Cost €

0

EC-Contrib. €

0

Partnership

0

Views

0

Project "LEXICAL" data sheet

The following table provides information about the project.

Coordinator
THE CHANCELLOR MASTERS AND SCHOLARSOF THE UNIVERSITY OF CAMBRIDGE 

Organization address
address: TRINITY LANE THE OLD SCHOOLS
city: CAMBRIDGE
postcode: CB2 1TN
website: www.cam.ac.uk

contact info
title: n.a.
name: n.a.
surname: n.a.
function: n.a.
email: n.a.
telephone: n.a.
fax: n.a.

 Coordinator Country United Kingdom [UK]
 Project website http://ltl.mml.cam.ac.uk/projects/lexical/
 Total cost 1˙989˙203 €
 EC max contribution 1˙989˙203 € (100%)
 Programme 1. H2020-EU.1.1. (EXCELLENT SCIENCE - European Research Council (ERC))
 Code Call ERC-2014-CoG
 Funding Scheme ERC-COG
 Starting year 2015
 Duration (year-month-day) from 2015-09-01   to  2020-08-31

 Partnership

Take a look of project's partnership.

# participants  country  role  EC contrib. [€] 
1    THE CHANCELLOR MASTERS AND SCHOLARSOF THE UNIVERSITY OF CAMBRIDGE UK (CAMBRIDGE) coordinator 1˙989˙203.00

Map

 Project objective

Due to the growing volume of textual information available in multiple languages, there is a great demand for Natural Language Processing (NLP) techniques that can automatically process and manage multi-lingual texts, supporting information access and communication in core areas of society (e.g. healthcare, business, science). Many NLP tasks and applications rely on task-specific lexicons (e.g. dictionaries, word classifications) for optimal performance. Recently, automatic acquisition of lexicons from relevant texts has proved a promising, cost-effective alternative to manual lexicography. It has the potential to considerably enhance the viability and portability of NLP technology both within and across languages. However, this approach has been explored for a very small number of resource-rich languages only, leaving the vast majority of worlds’ languages without useful technology. The ambitious goal of this project is to take research in lexical acquisition to the level where it can support multi-lingual NLP, involving also languages for which no parallel language resources (e.g. corpora, knowledge resources) are available. Building on an emerging line of research which uses mainly naturally occurring supervision (connections between languages) to guide cross-lingual NLP, we will develop a radically novel approach to lexical acquisition. This approach will transfer lexical knowledge from one language to another as well as will learn it simultaneously for a diverse set of languages using new methodology based on guiding joint learning and inference with rich knowledge about cross-lingual connections. We not only aim to create next generation lexical acquisition technology but also aim to take cross-lingual NLP a big step toward to the direction where it is no longer dependent on parallel resources. We will use our approach to support fundamental tasks and applications aimed at broadening the global reach of NLP to areas where it is now critically needed.

 Publications

year authors and title journal last update
List of publications.
2019 Billy Chiu, Simon Baker, Martha Palmer, Anna Korhonen
Enhancing biomedical word embeddings by retrofitting to verb clusters
published pages: 125-134, ISSN: , DOI: 10.18653/v1/w19-5014
Proceedings of the 18th BioNLP Workshop and Shared Task 2020-04-24
2019 Edoardo Maria Ponti, Helen O’Horan, Yevgeni Berzak, Ivan Vulić, Roi Reichart, Thierry Poibeau, Ekaterina Shutova, Anna Korhonen
Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing
published pages: 559-601, ISSN: 0891-2017, DOI: 10.1162/coli_a_00357
Computational Linguistics 45/3 2020-04-24
2019 Billy Chiu, Olga Majewska, Sampo Pyysalo, Laura Wey, Ulla Stenius, Anna Korhonen, Martha Palmer
A neural classification method for supporting the creation of BioVerbNet
published pages: , ISSN: 2041-1480, DOI: 10.1186/s13326-018-0193-x
Journal of Biomedical Semantics 10/1 2020-04-24
2019 Ehsan Shareghi, Daniela Gerz, Ivan Vulić, Anna Korhonen
Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines
published pages: 4113-4118, ISSN: , DOI: 10.18653/v1/n19-1417
Proceedings of the 2019 Conference of the North 2020-04-24
2018 Daniela Gerz, Ivan Vulić, Edoardo Maria Ponti, Roi Reichart, Anna Korhonen
On the Relation between Linguistic Typology and (Limitations of) Multilingual Language Modeling
published pages: 316-327, ISSN: , DOI: 10.18653/v1/d18-1029
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing 2020-04-24
2019 Ehsan Shareghi, Yingzhen Li, Yi Zhu, Roi Reichart, Anna Korhonen
Bayesian learning for neural dependency parsing
published pages: 3509-3519, ISSN: , DOI: 10.18653/v1/n19-1354
Proceedings of the 2019 Conference of the North 2020-04-24
2019 Edoardo Maria Ponti, Ivan Vulić, Ryan Cotterell, Roi Reichart, Anna Korhonen
Towards Zero-shot Language Modeling
published pages: 2900-2910, ISSN: , DOI: 10.18653/v1/d19-1288
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) 2020-04-23
2019 Yi Zhu, Benjamin Heinzerling, Ivan Vulić, Michael Strube, Roi Reichart, Anna Korhonen
On the Importance of Subword Information for Morphological Tasks in Truly Low-Resource Languages
published pages: 216-226, ISSN: , DOI: 10.18653/v1/k19-1021
Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL) 2020-04-23
2019 Paweł Budzianowski, Ivan Vulić
Hello, It’s GPT-2 - How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems
published pages: 15-22, ISSN: , DOI: 10.18653/v1/d19-5602
Proceedings of the 3rd Workshop on Neural Generation and Translation 2020-04-23
2019 Qianchu Liu, Diana McCarthy, Ivan Vulić, Anna Korhonen
Investigating Cross-Lingual Alignment Methods for Contextualized Embeddings with Token-Level Evaluation
published pages: 33-43, ISSN: , DOI: 10.18653/v1/k19-1004
Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL) 2020-04-23
2019 Ivan Vulić, Goran Glavaš, Roi Reichart, and Anna Korhonen
Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?
published pages: 4406-4417, ISSN: , DOI: 10.18653/v1/d19-1449
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2019) 2020-04-23
2019 Edoardo Maria Ponti, Ivan Vulić, Goran Glavaš, Roi Reichart, Anna Korhonen
Cross-lingual Semantic Specialization via Lexical Relation Induction
published pages: 2206-2217, ISSN: , DOI: 10.18653/v1/d19-1226
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) 2020-04-23
2020 Anne Lauscher, Goran Glavaš, Simone Paolo Ponzetto, and Ivan Vulić
A General Framework for Implicit and Explicit Debiasing of Distributional Word Vector Spaces
published pages: , ISSN: , DOI:
Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI 2020) long paper, to appear 2020-04-23
2019 Aishwarya Kamath, Jonas Pfeiffer, Edoardo Maria Ponti, Goran Glavaš, Ivan Vulić
Specializing Distributional Vectors of All Words for Lexical Entailment
published pages: 72-83, ISSN: , DOI: 10.18653/v1/w19-4310
Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019) 2020-04-23
2018 Daniela Gerz*, Ivan Vulić*, Edoardo Maria Ponti, Roi Reichart, and Anna Korhonen
On the Relation between Linguistic Typology and (Limitations of) Multilingual Language Modeling
published pages: 316--327, ISSN: , DOI:
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing 2020-04-23
2018 Ponti, Edoardo Maria; Vulić, Ivan; Glavaš, Goran; Mrkšić, Nikola; Korhonen, Anna
Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization
published pages: 282--293, ISSN: , DOI: 10.18653/v1/D18-1026
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing 12 2020-04-23
2018 Daniela Gerz, Ivan Vulić, Edoardo Maria Ponti, Jason Naradowsky, Roi Reichart, and Anna Korhonen
Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction
published pages: pp. 451-465, ISSN: , DOI:
Presented at EMNLP 2018 vol. 6 2020-04-23
2017 Ivan Vulić, Nikola Mrkšić, Roi Reichart, Diarmuid Ó Séaghdha, Steve Young, Anna Korhonen
Morph-fitting: Fine-Tuning Word Vector Spaces with Simple Language-Specific Rules
published pages: 56-68, ISSN: , DOI: 10.18653/v1/P17-1006
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2020-04-23
2018 Edoardo Maria Ponti, Roi Reichart, Anna Korhonen, and Ivan Vulić
Isomorphic Transfer of Syntactic Structures in Cross-Lingual Natural Language Processing
published pages: pp. 1531-1542, ISSN: , DOI:
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018) long paper 2020-04-23
2018 Anders Søgaard, Sebastian Ruder, Ivan Vulić
On the Limitations of Unsupervised Bilingual Dictionary Induction
published pages: pp. 778-788, ISSN: , DOI:
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2020-04-23
2017 Ivan Vulić
Cross-Lingual Syntactically Informed Distributed Word Representations
published pages: 408-414, ISSN: , DOI:
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017) 2020-04-23
2017 Geert Heyman, Ivan Vulić, and Marie-Francine Moens
Bilingual Lexicon Induction by Learning to Combine Word-Level and Character-Level Representations
published pages: 1085-1095, ISSN: , DOI:
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017) 2020-04-23
2016 Ivan Vulic and Anna Korhonen
On the Role of Seed Lexicons in Learning Bilingual Word Embeddings
published pages: 247-257, ISSN: , DOI: 10.17863/CAM.9717
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics Volume 1: Long Papers 2020-04-23
2016 Ivan Vulic and Anna Korhonen
\"Is \"\"Universal Syntax\"\" Universally Useful for Learning Distributed Word Representations?\"
published pages: 518-524, ISSN: , DOI:
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics Volume 2: Short Papers 2020-04-23
2018 Goran Glavaš and Ivan Vulić
Discriminating between Lexico-Semantic Relations with the Specialization Tensor Model
published pages: pp. 181-187, ISSN: , DOI:
Proceedings of the 16th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018) 2020-04-23
2018 Goran Glavaš and Ivan Vulić
Explicit Retrofitting of Distributional Word Vectors
published pages: pp. 34-45, ISSN: , DOI:
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018) Volume 1: Long Papers 2020-04-23
2017 Nikola Mrkšić, Ivan Vulić, Diarmuid Ó Séaghdha, Roi Reichart, Ira Leviant, Milica Gašić, Anna Korhonen, and Steve Young
Semantic Specialization of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints
published pages: 309-324, ISSN: 2307-387X, DOI:
Transactions of the Association for Computational Linguistics, presented at EMNLP 2017 vol. 5 2020-04-23
2018 Ivan Vulić, Goran Glavaš, Nikola Mrkšić, and Anna Korhonen
Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources
published pages: , ISSN: , DOI:
Proceedings of the 16th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018) long paper 2020-04-23
2018 Ivan Vulić and Anna Korhonen
Injecting Lexical Contrast into Word Vectors by Guiding Vector Space Specialisation
published pages: 137-143, ISSN: , DOI:
Proceedings of The Third Workshop on Representation Learning for NLP 2020-04-23
2016 Ivan Vulić, Douwe Kiela, Stephen Clark, and Marie-Francine Moens
Multi-Modal Representations for Improved Bilingual Lexicon Learning
published pages: 188-194, ISSN: , DOI:
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics Volume 2: Short Papers 2020-04-23
2016 Ivan Vulić and Marie-Francine Moens
Bilingual Distributed Word Representations from Document-Aligned Comparable Data
published pages: 953-994, ISSN: 1076-9757, DOI: 10.1613/jair.4986
Journal of Artificial Intelligence Research Volume 55 2020-04-23
2018 Olga Majewska, Diana McCarthy, Ivan Vulić, and Anna Korhonen
Acquiring Verb Classes through Bottom-Up Semantic Verb Clustering
published pages: pp. 952-958, ISSN: , DOI:
Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018) 2020-04-23
2018 Guy Rotman, Ivan Vulić, and Roi Reichart
Bridging Languages through Images with Deep Partial Canonical Correlation Analysis
published pages: pp. 910-921, ISSN: , DOI:
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018) long paper 2020-04-23
2017 Ponti, Edoardo Maria and Anna Korhonen
Event-Related Features in Feedforward Neural Networks Contribute to Identifying Implicit Causal Relations in Discourse
published pages: 25-30, ISSN: , DOI:
Proceedings of the 2nd Workshop on Linking Models of Lexical, Sentential and Discourse-level Semantics 2020-04-23
2018 Marek Rei, Daniela Gerz, and Ivan Vulić
Scoring Lexical Entailment with a Supervised Directional Similarity Network
published pages: 638-643, ISSN: , DOI:
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) 2020-04-23
2018 Vulić, Ivan; Mrkšić, Nikola
Specialising Word Vectors for Lexical Entailment
published pages: 1134 -1145, ISSN: , DOI:
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Volume 1 (Long Papers) 2020-04-23
2017 Ivan Vulić, Daniela Gerz, Douwe Kiela, Felix Hill, Anna Korhonen
HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment
published pages: 781-835, ISSN: 0891-2017, DOI: 10.1162/COLI_a_00301
Computational Linguistics 43/4 2020-04-23
2018 Nikola Mrkšić and Ivan Vulić
Fully Statistical Neural Belief Tracking
published pages: pp. 108-113, ISSN: , DOI:
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018) short paper 2020-04-23
2018 Billy Chiu, Sampo Pyysalo, Ivan Vulić, Anna Korhonen
Bio-SimVerb and Bio-SimLex: wide-coverage evaluation sets of word similarity in biomedicine
published pages: , ISSN: 1471-2105, DOI: 10.1186/s12859-018-2039-z
BMC Bioinformatics 19/1 2020-04-23
2017 Ivan Vulić, Roy Schwartz, Ari Rappoport, Roi Reichart, Anna Korhonen
Automatic Selection of Context Configurations for Improved Class-Specific Word Representations
published pages: 112-122, ISSN: , DOI: 10.18653/v1/K17-1013
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017) 2020-04-23
2018 Ivan Vulić and Nikola Mrkšić
Specialising Word Vectors for Lexical Entailment
published pages: , ISSN: , DOI:
Proceedings of the 16th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2018) long paper 2020-04-23
2017 Olga Majewska, Ivan Vulić, Diana McCarthy, Yan Huang, Akira Murakami, Veronika Laippala, Anna Korhonen
Investigating the cross-lingual translatability of VerbNet-style classification
published pages: , ISSN: 1574-020X, DOI: 10.1007/s10579-017-9403-x
Language Resources and Evaluation 2020-04-23
2017 Edoardo Maria Ponti, Ivan Vulić, Anna Korhonen
Decoding Sentiment from Distributed Representations of Sentences
published pages: 22-32, ISSN: , DOI: 10.18653/v1/S17-1003
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics (*SEM 2017) 2020-04-23

Are you the coordinator (or a participant) of this project? Plaese send me more information about the "LEXICAL" project.

For instance: the website url (it has not provided by EU-opendata yet), the logo, a more detailed description of the project (in plain text as a rtf file or a word file), some pictures (as picture files, not embedded into any word file), twitter account, linkedin page, etc.

Send me an  email (fabio@fabiodisconzi.com) and I put them in your project's page as son as possible.

Thanks. And then put a link of this page into your project's website.

The information about "LEXICAL" are provided by the European Opendata Portal: CORDIS opendata.

More projects from the same programme (H2020-EU.1.1.)

HYPATIA (2019)

Privacy and Utility Allied

Read More  

EAST (2020)

Using Evolutionary Algorithms to Understand and Secure Web/Enterprise Systems

Read More  

GRAPH-IC (2019)

Silicon-Integrated Graphene Photodetectors for Future Photonic Integrated Circuits – Graph-IC

Read More