ABSTRACT
We investigate a unified approach for fault tolerance and dynamic power management in distributed real-time embedded systems. Coordinated checkpointing is used to achieve fault tolerance, and power management is carried out using dynamic voltage scaling. We present feasibility-of-scheduling tests for coordinated checkpointing schemes for a constant processor speed as well as for DVS-enabled processors that can operate at variable speeds. Simulation results based on the CORDS hardware/software co-synthesis system show that, compared to fault-oblivious methods, the proposed approach significantly reduces power consumption while guaranteeing timely task completion in the presence of faults.
- G. Quan and X. Hu, "Energy efficient fixed-priority scheduling for real-time systems on variable voltage processors", Proc. DAC, pp. 828--833, 2001. Google ScholarDigital Library
- Y. Zhang and K. Chakrabarty, "Energy-aware adaptive checkpointing in embedded real-time systems", Proc. DATE, pp. 918--923, 2003. Google ScholarDigital Library
- L. Li et al, "Adaptive error protection for energy efficiency", Proc. ICCAD, pp. 2--7, 2003. Google ScholarDigital Library
- D. Marculescu et al., "Fault-tolerant techniques for ambient intelligent distributed systems", Proc. ICCAD, pp. 348--355, 2003. Google ScholarDigital Library
- J. Liu et al., "Communication speed selection for embedded systems with networked voltage-scalable processors", Proc. CODES, pp. 169--174, 2002. Google ScholarDigital Library
- E. N. Elnozahy et al., "A survey of rollback-recovery protocols in message-passing systems", ACM Computing Surveys, vol. 34, pp. 375--408, September 2002. Google ScholarDigital Library
- M. Chandy and L. Lamport, "Distributed snapshots: Determining global states of distributed systems", ACM Transactions on Computer Systems, pp. 63--75, February 1985. Google ScholarDigital Library
- R. P. Dick and N. K. Jha, "CORDS: Hardware-Software Co-Synthesis of Reconfigurable Real-Time Distributed Embedded Systems", Proc. ICCAD, pp. 62--68, 1998. Google ScholarDigital Library
- M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness, W. H. Freeman, New York, NY, 1979. Google ScholarDigital Library
- Embedded System Synthesis Benchmarks Suite (E3S): http://www.ece.northwestern.edu/ dickrp/e3s/.Google Scholar
- Embedded Microprocessor Benchmark Consortium (EEMBC): http://www.eembc.org.Google Scholar
- C.-Y. Lin et al., "A checkpointing tool for Palm operating system", Proc. DSN, pp. 71--76, 2001. Google ScholarDigital Library
Index Terms
Energy-aware deterministic fault tolerance in distributed real-time embedded systems
Recommendations
A unified approach for fault tolerance and dynamic power management in fixed-priority real-time embedded systems
This paper investigates an integrated approach for achieving fault tolerance and energy savings in real-time embedded systems. Fault tolerance is achieved via checkpointing, and energy is saved using dynamic voltage scaling (DVS). The authors present a ...
Low power fault tolerance scheduling scheme in DVS-enabled real-time systems
This paper investigates the power management and fault tolerance in DVS-enabled real-time systems based on the EDF scheme. The power management is achieved via dynamic voltage scaling DVS and fault tolerance is achieved via checkpoint scheme. We ...
A Fault-Tolerant Scheduling Algorithm Based on Checkpointing and Redundancy for Distributed Real-Time Systems
Real-time systems are becoming ever more widely used in life-critical applications, and the need for fault-tolerant scheduling can only grow in the years ahead. This article presents a novel fault tolerance approach for tolerating transient faults in ...
Comments