research-article

Predictable, system-level fault tolerance in composite

Authors:
Jiguo Song

The George Washington University, Washington, DC

The George Washington University, Washington, DC
View Profile

,
Gabriel Parmer

The George Washington University, Washington, DC

The George Washington University, Washington, DC
View Profile

Authors Info & Claims

ACM SIGBED Review Volume 10 Issue 2July 2013pp 31https://doi.org/10.1145/2518148.2518169

Published:01 July 2013Publication History

ACM SIGBED Review

Abstract

Intermittent faults are an increasingly challenging difficulty in embedded and real-time systems. As process technologies shrink circuitry, it becomes increasingly susceptible to transient faults from radiation sources such as cosmic rays. Additionally, as software complexity increases, intermittent faults such as race conditions challenge software reliability. Given these motivations, research has approached the paired problems of recovering from a fault, and doing so predictably. However, most past research has been limited in focus to the predictable recovery of faults at the application-level. Examples include systems infrastructures [2] enabling application fault recovery, and scheduling theory [3] that considers periodic faults, and the impact on schedulability for recovery and re-execution of failed applications.

References

The Composite component-based system: http://composite.seas.gwu.edu.Google Scholar
A. Egan, D. Kutz, D. Mikulin, R. Melhem, and D. Mosse. Fault-tolerant rt-mach and an application to real-time train control. Software Practice and Experience, 1999. Google ScholarDigital Library
P. Mejia-Alvarez and H. Aydin. Scheduling optional computations in fault-tolerant real-time systems. In RTCSA, 2000. Google ScholarDigital Library
K. Pattabiraman, V. Grover, and B. Zorn. Protecting critical data in unsafe languages. In Eurosys, 2008. Google ScholarDigital Library

Recommendations

Application-Level Fault Tolerance as a Complement to System-Level Fault Tolerance
Special issue on embedded fault-tolerance systems

As multiprocessor systems become more complex, their reliability will need to increase as well. In this paper we propose a novel technique which is applicable to a wide variety of distributed real-time systems, especially those exhibiting data ...
Read More
Using dynamic task level redundancy for OpenMP fault tolerance
ARCS'12: Proceedings of the 25th international conference on Architecture of Computing Systems

Obtaining fault tolerant applications and systems is one of today's most important topics of research. Fault tolerance is becoming more and more essential in shared memory parallel programs and in multi/many core architectures due to the decreasing size ...
Read More
Application-Aware Byzantine Fault Tolerance
DASC '14: Proceedings of the 2014 IEEE 12th International Conference on Dependable, Autonomic and Secure Computing

Byzantine fault tolerance has been intensively studied over the past decade as a way to enhance the intrusion resilience of computer systems. However, state-machine-based Byzantine fault tolerance algorithms require deterministic application processing ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM SIGBED Review Volume 10, Issue 2
Special Issue on the Work-in-Progress (WiP) session of the 33rd IEEE Real-Time Systems Symposium (RTSS'12)
July 2013
30 pages
EISSN:1551-3688
DOI:10.1145/2518148
Issue’s Table of Contents

Copyright © 2013 Authors
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 July 2013
Check for updates
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 24
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Predictable, system-level fault tolerance in composite

ACM SIGBED Review

Abstract

References

Cited By

Recommendations

Application-Level Fault Tolerance as a Complement to System-Level Fault Tolerance

Using dynamic task level redundancy for OpenMP fault tolerance

Application-Aware Byzantine Fault Tolerance

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Predictable, system-level fault tolerance in composite

ACM SIGBED Review

Abstract

References

Cited By

Recommendations

Application-Level Fault Tolerance as a Complement to System-Level Fault Tolerance

Using dynamic task level redundancy for OpenMP fault tolerance

Application-Aware Byzantine Fault Tolerance

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media