#	Pagina
attuale pagina	/open-h2020/projects/201445/index.html
-1	/open-h2020/projects/199891/index.html
-2	/open-h2020/projects/218399/index.html
-3	/open-h2020/projects/215428/index.html
-4	/open-fp7/projects/95816/index.html

Opendata, web and dolomites

TalkingHeads SIGNED

TalkingHeads: Audiovisual Speech Recognition in-the-wild

Total Cost €

EC-Contrib. €

Partnership

Views

Outcomes and
results

TalkingHeads project word cloud

Explore the words cloud of the TalkingHeads project. It provides you a very rough idea of what is the project "TalkingHeads" about.

career literature facial background complementing researcher conducting audio setting collaborators databases automatic publishing area movement skills diarization training noise occasionally er correlation mainly mouth expression extensive multimedia proposes industry talkingheads establishing recognizing noisy videos recognition academia auditory expertise plan assumed time video leadership listener personal research internationally talented speech speakers patterns maturity achievable alignment purely environment journals world wild multiple conferences vision detection laboratory computer visual supervisor brings unconstrained first supervisory tracking explored collected independent fusion perceives network refers attain

Project "TalkingHeads" data sheet

The following table provides information about the project.

Coordinator	THE UNIVERSITY OF NOTTINGHAM Organization address address: University Park city: NOTTINGHAM postcode: NG7 2RD website: www.nottingham.ac.uk contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.
Coordinator Country	United Kingdom [UK]
Project website	http://www.talking-heads.eu
Total cost	183˙454 €
EC max contribution	183˙454 € (100%)
Programme	1. H2020-EU.1.3.2. (Nurturing excellence by means of cross-border and cross-sector mobility)
Code Call	H2020-MSCA-IF-2015
Funding Scheme	MSCA-IF-EF-ST
Starting year	2016
Duration (year-month-day)	from 2016-06-01 to 2018-05-31

Partnership

Take a look of project's partnership.

#	participants	country	role	EC contrib. [€]
1	THE UNIVERSITY OF NOTTINGHAM THE UNIVERSITY OF NOTTINGHAM Organization address address: University Park city: NOTTINGHAM postcode: NG7 2RD website: www.nottingham.ac.uk contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	UK (NOTTINGHAM)	coordinator	183˙454.00

Map

Project objective

Audio-visual speech recognition refers to the problem of recognizing speech using both audio and video information. Speech is not a purely auditory process but the way that the listener perceives it is also through the recognition of the visual patterns associated with the mouth movement. This correlation of the audio-visual information has been occasionally explored in literature in order to develop more robust automatic speech recognition systems for cases in which the auditory environment is noisy (e.g. background noise, multiple speakers). However, the problem of audio-visual speech recognition has been mainly studied in controlled, laboratory conditions. TalkingHeads proposes, for the first time, the problem of audio-visual speech recognition in unconstrained (in-the-wild) videos collected from real-world multimedia databases and a set of methodologies that will work well under the assumed in-the-wild setting.

TalkingHeads brings together a talented but experienced researcher (ER) with expertise in speech analysis (diarization and recognition) and the Supervisor with large research experience in Computer Vision for face analysis in-the-wild (recognition, detection, alignment and tracking, and facial expression analysis). TalkingHeads will establish the ER as an independent and internationally recognized researcher in the area of audio-visual fusion and speech recognition. Through TalkingHeads’ achievable work plan, the ER will attain a high level of research maturity by (a) complementing his expertise on speech analysis through extensive training in Computer Vision, (b) conducting research on a challenging research problem (audio-visual speech recognition in-the-wild) with significant career opportunities in both the academia and the industry, (c) publishing at high impact factor conferences and journals, (d) establishing a network of research collaborators, and (e) enhancing personal skills (e.g. supervisory experience, leadership and management skills).

Publications

List of publications.
year	authors and title	journal	last update
2018	Themos Stafylakis, Georgios Tzimiropoulos Zero-shot keyword spotting for visual speech recognition in-the-wild published pages: , ISSN: , DOI:	European Conference on Computer Vision (ECCV)	2019-06-13
2017	Themos Stafylakis, Georgios Tzimiropoulos Combining Residual Networks with LSTMs for Lipreading published pages: , ISSN: , DOI:	Interspeech	2019-06-13
2016	Kong-Aik Lee, Ville HautamÃ¤ki, Tomi Kinnunen, Anthony Larcher, C Zhang, A Nautsch, T Stafylakis, G Liu, M Rouvier, W Rao, F Alegre, J Ma, MW Mak, AK Sarkar, H Delgado, R Saeidi, H Aronowitz, A Sizov, H Sun, TH Nguyen, Md Sahidullah, V Vestman, M Halonen, A Kanervisto The I4U submission to the 2016 NIST speaker recognition evaluation published pages: , ISSN: , DOI:	NIST SRE 2016 Workshop	2019-06-13
2017	Kong-Aik Lee, Ville HautamÃ¤ki, Tomi Kinnunen, Anthony Larcher, C Zhang, A Nautsch, T Stafylakis, G Liu, M Rouvier, W Rao, F Alegre, J Ma, MW Mak, AK Sarkar, H Delgado, R Saeidi, H Aronowitz, A Sizov, H Sun, TH Nguyen, Md Sahidullah, V Vestman, M Halonen, A Kanervisto The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016 published pages: , ISSN: , DOI:	Interspeech	2019-06-13
2018	Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Feipeng Cai, Georgios Tzimiropoulos, Maja Pantic End-to-end Audiovisual Speech Recognition published pages: , ISSN: , DOI:	IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)	2019-06-13
2018	Stafylakis, Themos; Tzimiropoulos, Georgios Deep word embeddings for visual speech recognition published pages: , ISSN: , DOI:	IEEE International Conference on Acoustics, Speech, and Signal Processing	2019-06-13
2018	Brummer, Niko; Silnova, Anna; Burget, Lukas; Stafylakis, Themos Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model published pages: , ISSN: , DOI:	Proceedings of Odyssey	2019-06-13

Are you the coordinator (or a participant) of this project? Plaese send me more information about the "TALKINGHEADS" project.

For instance: the website url (it has not provided by EU-opendata yet), the logo, a more detailed description of the project (in plain text as a rtf file or a word file), some pictures (as picture files, not embedded into any word file), twitter account, linkedin page, etc.

Send me an email (fabio@fabiodisconzi.com) and I put them in your project's page as son as possible.

Thanks. And then put a link of this page into your project's website.

The information about "TALKINGHEADS" are provided by the European Opendata Portal: CORDIS opendata.