#	Pagina
attuale pagina	/open-h2020/projects/206936/index.html

Opendata, web and dolomites

FASTPARSE SIGNED

Fast Natural Language Parsing for Large-Scale NLP

Total Cost €

EC-Contrib. €

Partnership

Views

Outcomes and
results

FASTPARSE project word cloud

Explore the words cloud of the FASTPARSE project. It provides you a very rough idea of what is the project "FASTPARSE" about.

societies data small speed regularities modern opinion machine amounts ed search eliminate roadblock extract nlp forms techniques cognitively natural communicate extraction digest constitutes unstructured patterns unprecedented translation calculation fast individuals answering discover intermediate algorithms public popularization accurate fundamental syntactic circulating explicit model recode language suitable hardware documents barriers engines asset acceptable break hundred written mining monitor calculations generate parsing fastparse redundant technologies context unfortunately linguistic sentences inspired web compress granted leveraging people bottleneck summarization reuse fronts form parsers internet rely vast kinds prohibitive standard analyze avoiding joint faster question computational human power convenient

Project "FASTPARSE" data sheet

The following table provides information about the project.

Coordinator	UNIVERSIDADE DA CORUNA Organization address address: CALLE DE LA MAESTRANZA 9 city: LA CORUNA postcode: 15001 website: http://www.udc.es contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.
Coordinator Country	Spain [ES]
Project website	http://fastparse.grupolys.org
Total cost	1˙481˙747 €
EC max contribution	1˙481˙747 € (100%)
Programme	1. H2020-EU.1.1. (EXCELLENT SCIENCE - European Research Council (ERC))
Code Call	ERC-2016-STG
Funding Scheme	ERC-STG
Starting year	2017
Duration (year-month-day)	from 2017-02-01 to 2022-01-31

Partnership

Take a look of project's partnership.

#	participants	country	role	EC contrib. [€]
1	UNIVERSIDADE DA CORUNA UNIVERSIDADE DA CORUNA Organization address address: CALLE DE LA MAESTRANZA 9 city: LA CORUNA postcode: 15001 website: http://www.udc.es contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	ES (LA CORUNA)	coordinator	1˙481˙747.00

Map

Project objective

The popularization of information technology and the Internet has resulted in an unprecedented growth in the scale at which individuals and institutions generate, communicate and access information. In this context, the effective leveraging of the vast amounts of available data to discover and address people's needs is a fundamental problem of modern societies.

Since most of this circulating information is in the form of written or spoken human language, natural language processing (NLP) technologies are a key asset for this crucial goal. NLP can be used to break language barriers (machine translation), find required information (search engines, question answering), monitor public opinion (opinion mining), or digest large amounts of unstructured text into more convenient forms (information extraction, summarization), among other applications.

These and other NLP technologies rely on accurate syntactic parsing to extract or analyze the meaning of sentences. Unfortunately, current state-of-the-art parsing algorithms have high computational costs, processing less than a hundred sentences per second on standard hardware. While this is acceptable for working on small sets of documents, it is clearly prohibitive for large-scale processing, and thus constitutes a major roadblock for the widespread application of NLP.

The goal of this project is to eliminate this bottleneck by developing fast parsers that are suitable for web-scale processing. To do so, FASTPARSE will improve the speed of parsers on several fronts: by avoiding redundant calculations through the reuse of intermediate results from previous sentences; by applying a cognitively-inspired model to compress and recode linguistic information; and by exploiting regularities in human language to find patterns that the parsers can take for granted, avoiding their explicit calculation. The joint application of these techniques will result in much faster parsers that can power all kinds of web-scale NLP applications.

Publications

List of publications.
year	authors and title	journal	last update
2020	Daniel FernÃ¡ndez-GonzÃ¡lez, Carlos GÃ³mez-RodrÃguez Discontinuous Constituent Parsing with Pointer Networks published pages: In press, ISSN: , DOI:	Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20)	2020-03-11
2020	David Vilares, Michalina Strzyz, Anders SÃ¸gaard, Carlos GÃ³mez-RodrÃguez Parsing as Pretraining published pages: In press, ISSN: , DOI:	Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20)	2020-03-11
2020	Carlos GÃ³mez-RodrÃguez, Morten H. Christiansen, Ramon Ferrer-i-Cancho Cognitive Constraints Built into Formal Grammars: Implications for Language Evolution published pages: , ISSN: , DOI: 10.17617/2.3190925	The Evolution of Language: Proceedings of the 13th International Conference (EvoLang13)	2020-03-11
2019	Michalina Strzyz, David Vilares, Carlos GÃ³mez-RodrÃguez Sequence Tagging for Fast Dependency Parsing published pages: 49, ISSN: 2504-3900, DOI: 10.3390/proceedings2019021049	Proceedings 21/1	2020-03-11
2018	Mark DÃ¡ibhidh Anderson, David Vilares Increasing NLP Parsing Efficiency with Chunking published pages: 1160, ISSN: 2504-3900, DOI: 10.3390/proceedings2181160	Proceedings 2/18	2019-08-29
2019	Daniel FernÃ¡ndez-GonzÃ¡lez, Carlos GÃ³mez-RodrÃguez Faster shift-reduce constituent parsing with a non-binary, bottom-up strategy published pages: 559-574, ISSN: 0004-3702, DOI: 10.1016/j.artint.2019.07.006	Artificial Intelligence 275	2019-08-29
2019	Michalina Strzyz, Carlos GÃ³mez-RodrÃguez Speeding Up Natural Language Parsing by Reusing Partial Results published pages: , ISSN: , DOI:	Proceedings of the 20th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2019)	2019-08-29
2019	Yerai Doval, Carlos GÃ³mez-RodrÃguez Comparing neural- and N-gram-based language models for word segmentation published pages: 187-197, ISSN: 1532-2882, DOI: 10.1002/asi.24082	Journal of the Association for Information Science and Technology 70/2	2019-08-29
2017	GÃ³mez RodrÃguez, Carlos Towards fast natural language parsing: FASTPARSE ERC Starting Grant published pages: 121-124, ISSN: 1135-5948, DOI:	Procesamiento del Lenguaje Natural 59	2019-06-13
2019	Carlos GÃ³mez-RodrÃguez, Iago Alonso-Alonso, David Vilares How important is syntactic parsing accuracy? An empirical evaluation on rule-based sentiment analysis published pages: , ISSN: 0269-2821, DOI: 10.1007/s10462-017-9584-0	Artificial Intelligence Review	2019-06-13
2018	R. Ferrer-i-Cancho, C. GÃ³mez-RodrÃguez, J.L. Esteban Are crossing dependencies really scarce? published pages: 311-329, ISSN: 0378-4371, DOI: 10.1016/j.physa.2017.10.048	Physica A: Statistical Mechanics and its Applications 493	2019-06-13
2017	David Vilares, Miguel A. Alonso, Carlos GÃ³mez-RodrÃguez Tratamiento sintÃ¡ctico de la negaciÃ³n en anÃ¡lisis del sentimiento monolingÃ¼e y multilingÃ¼e published pages: 39-46, ISSN: , DOI:	Proc. of Taller de NEGaciÃ³n en ESpaÃ±ol (NEGES 2017)	2019-06-13
2018	MARCOS GARCIA, CARLOS GÃ“MEZ-RODRÃGUEZ, MIGUEL A. ALONSO New treebank or repurposed? On the feasibility of cross-lingual parsing of Romance languages with Universal Dependencies published pages: 91-122, ISSN: 1351-3249, DOI: 10.1017/S1351324917000377	Natural Language Engineering 24/01	2019-06-13
2017	Carlos GÃ³mez-RodrÃguez, Ramon Ferrer-i-Cancho Scarcity of crossing dependencies: A direct outcome of a specific constraint? published pages: 62304, ISSN: 2470-0045, DOI: 10.1103/PhysRevE.96.062304	Physical Review E 96/6	2019-06-13
2018	Chen, Xinying; GÃ³mez RodrÃguez, Carlos; Ferrer Cancho, Ramon A dependency look at the reality of constituency published pages: 104-106, ISSN: 1617-8351, DOI:	Glottometrics 40	2019-06-13
2017	Carlos GÃ³mez-RodrÃguez On the relation between dependency distance, crossing dependencies, and parsing published pages: 200-203, ISSN: 1571-0645, DOI: 10.1016/j.plrev.2017.05.007	Physics of Life Reviews 21	2019-06-13
2018	Carlos GÃ³mez-RodrÃguez Natural Language Parsing: Progress and Challenges published pages: 159-175, ISSN: 1889-3805, DOI:	BoletÃn de EstadÃstica e InvestigaciÃ³n Operativa 34(2)	2019-06-13
2018	Yerai Doval, David Vilares On the Processing and Analysis of Microtexts: From Normalization to Semantics published pages: 1170, ISSN: 2504-3900, DOI: 10.3390/proceedings2181170	Proceedings 2/18	2019-08-29
2019	Ramon Ferrer-i-Cancho, Carlos GÃ³mez-RodrÃguez Anti dependency distance minimization in short sequences. A graph theoretic approach published pages: In press, ISSN: 0929-6174, DOI:	Journal of Quantitative Linguistics	2019-08-29
2019	Mark Anderson, David Vilares, Carlos GÃ³mez-RodrÃguez Artificially Evolved Chunks for Morphosyntactic Analysis published pages: To appear, ISSN: , DOI:	Proceedings of the 18th International Workshop on Treebanks and Linguistic Theories	2019-08-29

Are you the coordinator (or a participant) of this project? Plaese send me more information about the "FASTPARSE" project.

For instance: the website url (it has not provided by EU-opendata yet), the logo, a more detailed description of the project (in plain text as a rtf file or a word file), some pictures (as picture files, not embedded into any word file), twitter account, linkedin page, etc.

Send me an email (fabio@fabiodisconzi.com) and I put them in your project's page as son as possible.

Thanks. And then put a link of this page into your project's website.

The information about "FASTPARSE" are provided by the European Opendata Portal: CORDIS opendata.