Opendata, web and dolomites

FTHPC SIGNED

Fault Tolerant High Performance Computing

Total Cost €

0

EC-Contrib. €

0

Partnership

0

Views

0

Project "FTHPC" data sheet

The following table provides information about the project.

Coordinator
THE HEBREW UNIVERSITY OF JERUSALEM 

Organization address
address: EDMOND J SAFRA CAMPUS GIVAT RAM
city: JERUSALEM
postcode: 91904
website: www.huji.ac.il

contact info
title: n.a.
name: n.a.
surname: n.a.
function: n.a.
email: n.a.
telephone: n.a.
fax: n.a.

 Coordinator Country Israel [IL]
 Total cost 1˙824˙467 €
 EC max contribution 1˙824˙467 € (100%)
 Programme 1. H2020-EU.1.1. (EXCELLENT SCIENCE - European Research Council (ERC))
 Code Call ERC-2018-COG
 Funding Scheme ERC-COG
 Starting year 2019
 Duration (year-month-day) from 2019-06-01   to  2024-05-31

 Partnership

Take a look of project's partnership.

# participants  country  role  EC contrib. [€] 
1    THE HEBREW UNIVERSITY OF JERUSALEM IL (JERUSALEM) coordinator 1˙824˙467.00

Map

 Project objective

Supercomputers are strategically crucial for facilitating advances in science and technology: in climate change research, accelerated genome sequencing towards cancer treatments, cutting edge physics, devising engineering innovative solutions, and many other compute intensive problems. However, the future of super-computing depends on our ability to cope with the ever increasing rate of faults (bit flips and component failure), resulting from the steadily increasing machine size and decreasing operating voltage. Indeed, hardware trends predict at least two faults per minute for next generation (exascale) supercomputers.

The challenge of ascertaining fault tolerance for high-performance computing is not new, and has been the focus of extensive research for over two decades. However, most solutions are either (i) general purpose, requiring little to no algorithmic effort, but severely degrading performance (e.g., checkpoint-restart), or (ii) tailored to specific applications and very efficient, but requiring high expertise and significantly increasing programmers' workload. We seek the best of both worlds: high performance and general purpose fault resilience.

Efficient general purpose solutions (e.g., via error correcting codes) have revolutionized memory and communication devices over two decades ago, enabling programmers to effectively disregard the very likely memory and communication errors. The time has come for a similar paradigm shift in the computing regimen. I argue that exciting recent advances in error correcting codes, and in short probabilistically checkable proofs, make this goal feasible. Success along these lines will eliminate the bottleneck of required fault-tolerance expertise, and open exascale computing to all algorithm designers and programmers, for the benefit of the scientific, engineering, and industrial communities.

Are you the coordinator (or a participant) of this project? Plaese send me more information about the "FTHPC" project.

For instance: the website url (it has not provided by EU-opendata yet), the logo, a more detailed description of the project (in plain text as a rtf file or a word file), some pictures (as picture files, not embedded into any word file), twitter account, linkedin page, etc.

Send me an  email (fabio@fabiodisconzi.com) and I put them in your project's page as son as possible.

Thanks. And then put a link of this page into your project's website.

The information about "FTHPC" are provided by the European Opendata Portal: CORDIS opendata.

More projects from the same programme (H2020-EU.1.1.)

NanoPD_P (2020)

High throughput multiplexed trace-analyte screening for diagnostics applications

Read More  

Malaria POC (2019)

Ultrasensitive detection of transmissible malaria

Read More  

EVOMENS (2020)

The evolution of menstruation in primates

Read More