skip to main content
10.1145/800027.808467acmconferencesArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article
Free access

System structure for software fault tolerance

Published: 01 April 1975 Publication History

Abstract

The paper presents, and discusses the rationale behind, a method for structuring complex computing systems by the use of what we term “recovery blocks”, “conversations” and “fault-tolerant interfaces”. The aim is to facilitate the provision of dependable error detection and recovery facilities which can cope with errors caused by residual design inadequacies, particularly in the system software, rather than merely the occasional malfunctioning of hardware components.

References

[1]
T.A. Anderson. Provably Safe Programs. Tech. Report, Computing Laboratory, University of Newcastle upon Tyne (in preparation).
[2]
C.T. Davies. Recovery Semantics for a DB/DC System. Proc. 1973 ACM National Conference. ACM, New York (1973) pp.136-141.
[3]
E.W. Dijkstra. The Structure of the 'THE'-Multiprogramming System. Comm. ACM 11, 5 (1968) pp. 341-346.
[4]
C. Hewitt. PLANNER: A language for proving theorems in robots. Proc. Int. Joint Conf. on Artificial Intelligence. Mitre Corp., Bedford, Mass. (1969) pp.295-301.
[5]
J.J. Horning, H.C. Lauer, P.M. Melliar-Smith and B. Randell. A Program Structure for Error Detection and Recovery. Proc. Conf. on Operating Systems: Theoretical and Practical Aspects, 23-25th April 1974. IRIA (1974) pp.177-193.
[6]
H.C. Lauer. Protection and Hierarchical Addressing Structures. Proc, International Workshop on Protection in Operating Systems. IRIA (1974) pp.137-148.
[7]
H.C. Lauer and D. Wyeth. A Recursive Virtual Machine Architecture. Tech. Report 54, Computing Laboratory, University of Newcastle upon Tyne (Sept. 1973).
[8]
S.E. Madnick and J.W. Alsop II. A Modular Approach to File System Design, 1969 S.J.C.C., Vol. 34, AFIPS Conference Proceedings, AFIPS, Montvale, N.J. (1969) pp.1-13.
[9]
D.L. Parnas. Response to Detected Errors in Well-Structured Programs. Technical Report, Department of Computer Science, Carnegie-Mellon University (July 1972).
[10]
W.H. Pierce. Failure-Tolerant Computer Design. Academic Press, New York (1965).
[11]
B. Randell. Highly Reliable Computing Systems. Tech. Report 20, Computing Laboratory, University of Newcastle upon Tyne (July 1971).
[12]
B. Randell. Research on Computing System Reliability at the University of Newcastle upon Tyne, 1972/73. Tech. Report 57, Computing Laboratory, University of Newcastle upon Tyne (Jan. 1974).

Cited By

View all
  • (2020)Dynamic Network Reconfiguration in Safety-Critical Aeronautical Systems2020 AIAA/IEEE 39th Digital Avionics Systems Conference (DASC)10.1109/DASC50938.2020.9256497(1-8)Online publication date: 11-Oct-2020
  • (2019)EC-Shuffle: Dynamic Erasure Coding Optimization for Efficient and Reliable Shuffle in Spark2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)10.1109/CCGRID.2019.00014(41-51)Online publication date: May-2019
  • (2018)Compiler-Assisted Code Randomization2018 IEEE Symposium on Security and Privacy (SP)10.1109/SP.2018.00029(461-477)Online publication date: May-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
Proceedings of the international conference on Reliable software
April 1975
567 pages
ISBN:9781450373852
DOI:10.1145/800027
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 April 1975

Check for updates

Author Tags

  1. Acceptance test
  2. Alternate block
  3. Checkpoint
  4. Conversation
  5. Error detection
  6. Error recovery
  7. Recovery block
  8. Recursive cache

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)230
  • Downloads (Last 6 weeks)34
Reflects downloads up to 17 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2020)Dynamic Network Reconfiguration in Safety-Critical Aeronautical Systems2020 AIAA/IEEE 39th Digital Avionics Systems Conference (DASC)10.1109/DASC50938.2020.9256497(1-8)Online publication date: 11-Oct-2020
  • (2019)EC-Shuffle: Dynamic Erasure Coding Optimization for Efficient and Reliable Shuffle in Spark2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)10.1109/CCGRID.2019.00014(41-51)Online publication date: May-2019
  • (2018)Compiler-Assisted Code Randomization2018 IEEE Symposium on Security and Privacy (SP)10.1109/SP.2018.00029(461-477)Online publication date: May-2018
  • (2018)Diversity on State Machine Replication2018 IEEE 32nd International Conference on Advanced Information Networking and Applications (AINA)10.1109/AINA.2018.00071(429-436)Online publication date: May-2018
  • (2018)Static Analysis-Based Approaches for�Secure Software DevelopmentSecurity in Computer and Information Sciences10.1007/978-3-319-95189-8_13(142-157)Online publication date: 14-Jul-2018
  • (2017)Designing and Implementing a Diversity Policy for Intrusion-Tolerant SystemsIEICE Transactions on Information and Systems10.1587/transinf.2015EDP7478E100.D:1(118-129)Online publication date: 2017
  • (2017)Experience Report: Study of Vulnerabilities of Enterprise Operating Systems2017 IEEE 28th International Symposium on Software Reliability Engineering (ISSRE)10.1109/ISSRE.2017.20(205-215)Online publication date: Oct-2017
  • (2017)A Formal Approach to implement java exceptions in cooperative systemsJournal of Systems and Software10.1016/j.jss.2016.07.033131:C(475-490)Online publication date: 1-Sep-2017
  • (2016)Searching for software diversityProceedings of the 2016 New Security Paradigms Workshop10.1145/3011883.3011891(80-91)Online publication date: 26-Sep-2016
  • (2016)Adaptive and Power-Aware Resilience for Extreme-Scale Computing2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress (UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld)10.1109/UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld.2016.0111(671-679)Online publication date: Jul-2016
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media