Skip to main content
Log in

Probabilistic Schedulability Analysis of Harmonic Multi-Task Systems with Dual-Modular Temporal Redundancy

  • Published:
Real-Time Systems Aims and scope Submit manuscript

Abstract

To improve reliability of real-time control systems, various fault-tolerance methods have been designed and implemented. We propose a highly reliable control system using modular and temporal redundancy, called dual-modular temporal redundancy (DMTR). Assuming that transient faults occur and recover with exponential probability distributions, we analyze the probabilistic schedulability of DMTR for multiple tasks with harmonic periods (DMTR-HP). After formulating a discrete-time reliability model for DMTR-HP, we formulate an efficient recursive computation algorithm for rapidly obtaining the probabilistic schedulability of the overall system. Considering the overhead for checkpointing in a DMTR-HP control system, we obtain the optimal number of subslots for maximum reliability using our DMTR-HP reliability model. In addition, we compare the reliabilities of DMTR-HP, DMTR using GCDP scheduling (DMTR-GCDP), and conventional dual-modular redundancy (DMR).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Avizienis, A., and Kelly, J. P. 1984. Fault-tolerance by design diversity. IEEE Transactions on Computers 17: 67-80.

    Google Scholar 

  • Bose, B., and Metzner, J. 1986. Coding Theory for Fault-Tolerant Systems. Englewood Cliffs, NJ: Prentice Hall.

    Google Scholar 

  • Chung, K. J. 1995. Optimal test-times for intermittent faults. IEEE Transactions on Reliability 44: 645-647.

    Google Scholar 

  • Crater, W. C., and Bouricius, W. G. 1971. A survey of fault-tolerant computer architecture and its evaluation. IEEE Transactions on Computer 4: 9-16.

    Google Scholar 

  • Geist, R., Raynolds, R., and Westall, J. 1988. Selection of a checkpoint interval in time-critical environments. IEEE Transactions on Reliability 37: 395-400.

    Google Scholar 

  • Ghosh, S., Melhem, R., Mosse, D., and Sarma, J. S. 1998. Fault-tolerant rate-monotonic scheduling. Real-Time Systems: The International Journal of Time-Critical Computing Systems 15: 149-181.

    Google Scholar 

  • Hou, K. J., and Shin, K. G. 1996. Determination of an optimal retry time in multiple modular computing systems. IEEE Transactions on Computers 45: 374-379.

    Google Scholar 

  • Kameyama, M., and Higuchi, T. 1980. Design of dependent failure-tolerant microprocessor system using triple modular redundancy. IEEE Transactions on Reliability C-29: 202-205.

    Google Scholar 

  • Kim, B. K. 1999. Reliability analysis of real-time controllers with dual-modular temporal redundancy. In Proceedings RTCSA'99, Hong Kong, pp. 364-371.

  • Kim, H., and Shin, K. G. 1994. Modeling of externally-induced/common-cause faults in fault-tolerant systems. IEEE/AIAA Digital Avionics System Conference, pp. 402-407.

  • Kim, H., and Shin, K. G. 1996. Design and analysis of an optimal instruction retry policy for TMR controller computers. IEEE Transactions on Computers 45: 1217-1225.

    Google Scholar 

  • Kim, H. White, A. L., and Shin, K. G. 1996. Susceptibility of controller computers to environmental disruptions and its effects on system stability. In IEEE Aerospace Applications Conference, pp. 309-320.

  • Koren, I., and Stephen, Y. H. S. 1979. Reliability analysis of N-modular redundancy systems with intermittent and permanent faults. IEEE Transactions on Computers 28: 514-520.

    Google Scholar 

  • Krishna, C. M., and Shin, K. G. 1997. Real-Time Systems. McGraw-Hill.

  • Krishna, C. M., and Singh, A. D. 1993. Reliability of checkpointed real-time systems using time redundancy. IEEE Transactions on Reliability 42: 427-435.

    Google Scholar 

  • Lin, T. H., and Shin, K. G. 1994. An optimal retry polity based on fault classification. IEEE Transactions on Computers 43: 1014-1025.

    Google Scholar 

  • Lorczak, P. R., Caglayan, A. K., and Eckhardt, D. E. 1989. A theoretical investigation of generalized voters for redundant systems. In Proceedings of the Fault-Tolerant Computing Symposium, Los Alamitos, CA: IEEE, pp. 444-451.

    Google Scholar 

  • Randel, B. 1975. System structure for software fault-tolerance. IEEE Transactions on Software Engineering SE-1: 220-232.

    Google Scholar 

  • Saleh, A. M., and Patel, J. H. 1988. Transient fault analysis for retry techniques. IEEE Transactions on Reliability 37: 323-330.

    Google Scholar 

  • Shin, K. G., and Kim, H. 1994. A time redundancy approach to TMR failures using fault-state likelihoods. IEEE Transactions on Computers 43: 1151-1162.

    Google Scholar 

  • Shin, K. G., Lin, T. H., and Lee, Y. H. 1987. Optimal checkpointing of real-time tasks. IEEE Transactions on Computers 36: 1328-1341.

    Google Scholar 

  • Ziv, A. 1995. Analysis and performance optimization of checkpointing schemes with task duplication. Ph.D. Dissertation, Stanford University.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kim, J.K., Kim, B.K. Probabilistic Schedulability Analysis of Harmonic Multi-Task Systems with Dual-Modular Temporal Redundancy. Real-Time Systems 26, 199–222 (2004). https://doi.org/10.1023/B:TIME.0000016130.91111.75

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/B:TIME.0000016130.91111.75

Navigation