Explore the words cloud of the TalkingHeads project. It provides you a very rough idea of what is the project "TalkingHeads" about.
The following table provides information about the project.
Coordinator |
THE UNIVERSITY OF NOTTINGHAM
Organization address contact info |
Coordinator Country | United Kingdom [UK] |
Project website | http://www.talking-heads.eu |
Total cost | 183˙454 € |
EC max contribution | 183˙454 € (100%) |
Programme |
1. H2020-EU.1.3.2. (Nurturing excellence by means of cross-border and cross-sector mobility) |
Code Call | H2020-MSCA-IF-2015 |
Funding Scheme | MSCA-IF-EF-ST |
Starting year | 2016 |
Duration (year-month-day) | from 2016-06-01 to 2018-05-31 |
Take a look of project's partnership.
# | ||||
---|---|---|---|---|
1 | THE UNIVERSITY OF NOTTINGHAM | UK (NOTTINGHAM) | coordinator | 183˙454.00 |
Audio-visual speech recognition refers to the problem of recognizing speech using both audio and video information. Speech is not a purely auditory process but the way that the listener perceives it is also through the recognition of the visual patterns associated with the mouth movement. This correlation of the audio-visual information has been occasionally explored in literature in order to develop more robust automatic speech recognition systems for cases in which the auditory environment is noisy (e.g. background noise, multiple speakers). However, the problem of audio-visual speech recognition has been mainly studied in controlled, laboratory conditions. TalkingHeads proposes, for the first time, the problem of audio-visual speech recognition in unconstrained (in-the-wild) videos collected from real-world multimedia databases and a set of methodologies that will work well under the assumed in-the-wild setting.
TalkingHeads brings together a talented but experienced researcher (ER) with expertise in speech analysis (diarization and recognition) and the Supervisor with large research experience in Computer Vision for face analysis in-the-wild (recognition, detection, alignment and tracking, and facial expression analysis). TalkingHeads will establish the ER as an independent and internationally recognized researcher in the area of audio-visual fusion and speech recognition. Through TalkingHeads’ achievable work plan, the ER will attain a high level of research maturity by (a) complementing his expertise on speech analysis through extensive training in Computer Vision, (b) conducting research on a challenging research problem (audio-visual speech recognition in-the-wild) with significant career opportunities in both the academia and the industry, (c) publishing at high impact factor conferences and journals, (d) establishing a network of research collaborators, and (e) enhancing personal skills (e.g. supervisory experience, leadership and management skills).
year | authors and title | journal | last update |
---|---|---|---|
2018 |
Themos Stafylakis, Georgios Tzimiropoulos Zero-shot keyword spotting for visual speech recognition in-the-wild published pages: , ISSN: , DOI: |
European Conference on Computer Vision (ECCV) | 2019-06-13 |
2017 |
Themos Stafylakis, Georgios Tzimiropoulos Combining Residual Networks with LSTMs for Lipreading published pages: , ISSN: , DOI: |
Interspeech | 2019-06-13 |
2016 |
Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, C Zhang, A Nautsch, T Stafylakis, G Liu, M Rouvier, W Rao, F Alegre, J Ma, MW Mak, AK Sarkar, H Delgado, R Saeidi, H Aronowitz, A Sizov, H Sun, TH Nguyen, Md Sahidullah, V Vestman, M Halonen, A Kanervisto The I4U submission to the 2016 NIST speaker recognition evaluation published pages: , ISSN: , DOI: |
NIST SRE 2016 Workshop | 2019-06-13 |
2017 |
Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, C Zhang, A Nautsch, T Stafylakis, G Liu, M Rouvier, W Rao, F Alegre, J Ma, MW Mak, AK Sarkar, H Delgado, R Saeidi, H Aronowitz, A Sizov, H Sun, TH Nguyen, Md Sahidullah, V Vestman, M Halonen, A Kanervisto The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016 published pages: , ISSN: , DOI: |
Interspeech | 2019-06-13 |
2018 |
Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Feipeng Cai, Georgios Tzimiropoulos, Maja Pantic End-to-end Audiovisual Speech Recognition published pages: , ISSN: , DOI: |
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | 2019-06-13 |
2018 |
Stafylakis, Themos; Tzimiropoulos, Georgios Deep word embeddings for visual speech recognition published pages: , ISSN: , DOI: |
IEEE International Conference on Acoustics, Speech, and Signal Processing | 2019-06-13 |
2018 |
Brummer, Niko; Silnova, Anna; Burget, Lukas; Stafylakis, Themos Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model published pages: , ISSN: , DOI: |
Proceedings of Odyssey | 2019-06-13 |
Are you the coordinator (or a participant) of this project? Plaese send me more information about the "TALKINGHEADS" project.
For instance: the website url (it has not provided by EU-opendata yet), the logo, a more detailed description of the project (in plain text as a rtf file or a word file), some pictures (as picture files, not embedded into any word file), twitter account, linkedin page, etc.
Send me an email (fabio@fabiodisconzi.com) and I put them in your project's page as son as possible.
Thanks. And then put a link of this page into your project's website.
The information about "TALKINGHEADS" are provided by the European Opendata Portal: CORDIS opendata.
Using a novel protein degradation approach to uncover IRF4-regulated genes in plasma cells
Read MoreTheorizing the Production of 'Comedia Nueva': The Process of Play Configuration in Spanish Golden Age Theater
Read MoreDevelopment of Narrow Band Blue and Red Emitting Macromolecules for Solution-Processed Solid State Lighting Devices
Read More