June 21st 2025 @ Tokyo, Japan
Location: Building 121 Floor B1 Room 112
Held in conjunction with ISCA 2025
As semiconductor technologies continue to advance and chiplet integration becomes more widespread, the demand for highly reliable systems has never been greater. Modern chips face increasing susceptibility to various reliability-related mechanisms (described by different fault models) such as manufacturing defects, device variability, marginalities, aging, voltage drop. These faults pose significant challenges to the design of robust systems, as their impact can cascade from the physical level to the overall system level, affecting architecture, software, performance, functionality, and safety and leading to systems that either observably don’t operate (crash) or, even worse, produce unnoticed corrupted outputs (silent errors, silent data corruptions). To address these challenges, there is a pressing need for efficient modeling and simulation frameworks that can accurately capture and analyze the effects of these fault mechanisms across multiple abstraction levels.
This workshop aims to bring together researchers, practitioners, and industry experts to discuss state-of-the-art techniques for modeling, simulation, and on top of them mitigation of reliability-related mechanisms in modern silicon chips and systems built on them. We invite contributions that explore innovative approaches to fault modeling, simulation environments, and methodologies for evaluating and improving system reliability in the presence of silicon faults and marginalities.
Time | Topic | Speakers | Institution |
---|---|---|---|
8:30 – 9:00 | Coffee and Registration | ||
9:00 - 9:40 | Aging-Induced Faults in Systolic Arrays in Mission Critical Machine Learning Applications | Firas Ramadan | Technion |
9:40 - 10:20 | Understanding the Error Sensitivity of Privacy-Aware Computing | Augusto Vega | IBM Research |
10:20 - 11:00 | Expanding SoCurity for NoC-level Reliability Monitoring | Naorin Hossain | IBM Research |
11:00 - 11:30 | Coffee Break | ||
11:30 - 12:30 | Plenary Talk: From Performance to Reliability: The Generative AI Redshift in Reliable Cloud-Scale Systems | Amir Yazdanbakhsh | |
13:00 - 14:00 | Lunch Break |
Submit your paper here
Please note you have to be a CMT registered user to submit.
Register to CMT here
Prof. Fredd Gabbay, The Hebrew University.
Prof. Dimitris Gizopoulos, University of Athens.
Any questions may be directed to: freddy.gabbay@mail.huji.ac.il or dgizop@di.uoa.gr