skip to main content
10.1145/2000259.2000274acmconferencesArticle/Chapter ViewAbstractPublication PagescomparchConference Proceedingsconference-collections
research-article

Reliability prediction for fault-tolerant software architectures

Published: 20 June 2011 Publication History

Abstract

Software fault tolerance mechanisms aim at improving the reliability of software systems. Their effectiveness (i.e., reliability impact) is highly application-specific and depends on the overall system architecture and usage profile. When examining multiple architecture configurations, such as in software product lines, it is a complex and error-prone task to include fault tolerance mechanisms effectively. Existing approaches for reliability analysis of software architectures either do not support modelling fault tolerance mechanisms or are not designed for an efficient evaluation of multiple architecture variants. We present a novel approach to analyse the effect of software fault tolerance mechanisms in varying architecture configurations. We have validated the approach in multiple case studies, including a large-scale industrial system, demonstrating its ability to support architecture design, and its robustness against imprecise input data.

References

[1]
M. Auerswald, M. Herrmann, S. Kowalewski, and V. Schulte-Coerne. Software Product-Family Engineering, volume 2290 of LNCS, chapter Reliability-Oriented Product Line Engineering of Embedded Systems, pages 237--280. Springer, 2001.
[2]
S. Becker, H. Koziolek, and R. Reussner. The Palladio Component Model for model-driven performance prediction. Journal of Systems and Software, 82(1):3--22, 2009.
[3]
S. Bernardi, J. Merseguer, and D. Petriu. A dependability profile within MARTE. Software and Systems Modeling, pages 1--24, 2009.
[4]
F. Brosch, H. Koziolek, B. Buhnova, and R. Reussner. Parameterized reliability prediction for component-based software architectures. In Proc. of QoSA'10, volume 6093 of LNCS, pages 36--51. Springer, 2010.
[5]
L. Cheung, R. Roshandel, N. Medvidovic, and L. Golubchik. Early prediction of software component reliability. In Proc. of ICSE'08, pages 111--120. ACM Press, 2008.
[6]
R. C. Cheung. A user-oriented software reliability model. IEEE Trans. Softw. Eng., 6(2):118--125, 1980.
[7]
P. Clements and L. Northrop. Software Product Lines: Practices and Patterns. Addison-Wesley, 2001.
[8]
V. Cortellessa, H. Singh, and B. Cukic. Early reliability assessment of UML based software models. In Proc. of WOSP'02, pages 302--309. ACM, 2002.
[9]
J. Dehlinger and R. R. Lutz. Plfaultcat: A product-line software fault tree analysis tool. Automated Software Engineering, 13(1):169--193, 2006.
[10]
A. Filieri, C. Ghezzi, V. Grassi, and R. Mirandola. Reliability analysis of component-based systems with multiple failure modes. In Proc. of CBSE'10, volume 6092 of LNCS, pages 1--20. Springer, 2010.
[11]
S. S. Gokhale. Architecture-based software reliability analysis: Overview and limitations. IEEE Trans. on Dependable and Secure Computing, 4(1):32--40, 2007.
[12]
K. Goseva-Popstojanova, A. Hassan, A. Guedem, W. Abdelmoez, D. E. M. Nassar, H. Ammar, and A. Mili. Architectural-level risk analysis using UML. IEEE Trans. on Softw. Eng., 29(10):946--960, 2003.
[13]
K. Goseva-Popstojanova and K. S. Trivedi. Architecture-based approach to reliability assessment of software systems. Performance Evaluation, 45(2--3):179--204, 2001.
[14]
A. Immonen. Software Product Lines, chapter A Method for Predicting Reliability and Availability at the Architecture Level, pages 373--422. Springer, 2006.
[15]
A. Immonen and E. Niemel�. Survey of reliability and availability prediction methods from the viewpoint of software architecture. Software and Systems Modeling, 7(1):49--65, 2008.
[16]
K. Kanoun and M. Ortalo-Borrel. Fault-tolerant system dependability-explicit modeling of hardware and software component-interactions. IEEE Transactions on Reliability, 49(4):363--376, 2000.
[17]
H. Koziolek, B. Schlich, and C. Bilich. A large-scale industrial case study on architecture-based software reliability analysis. In Proc. of ISSRE'10, pages 279--288. IEEE Computer Society, 2010.
[18]
H. Muccini and A. Romanovsky. Architecting fault tolerant systems. Technical Report CS-TR-1051, University of Newcastle upon Tyne, 2007.
[19]
F. G. Olumofin and V. B. Misic. Extending the atam architecture evaluation to product line architectures. In Proc. of WICSA'05, pages 45--56. IEEE Computer Society, 2005.
[20]
B. Randell. System structure for software fault tolerance. In Proc. Int. Conf. on Reliable software, pages 437--449. ACM, 1975.
[21]
R. H. Reussner, H. W. Schmidt, and I. H. Poernomo. Reliability prediction for component-based software architectures. Journal of Systems and Software, 66(3):241--252, 2003.
[22]
N. Sato and K. S. Trivedi. Accurate and efficient stochastic reliability analysis of composite services using their compact Markov reward model representations. In Proc. of SCC'07, pages 114--121. IEEE Computer Society, 2007.
[23]
B. Schroeder and G. A. Gibson. Understanding disk failure rates: What does an mttf of 1,000,000 hours mean to you? ACM Trans. Storage, 3(3):8, 2007.
[24]
V. Sharma and K. Trivedi. Quantifying software performance, reliability and security: An architecture-based approach. Journal of Systems and Software, 80:493--509, 2007.
[25]
V. S. Sharma and K. S. Trivedi. Reliability and performance of component based software systems with restarts, retries, reboots and repairs. In Proc. of ISSRE'06, pages 299--310. IEEE, 2006.
[26]
W.-L. Wang, D. Pan, and M.-H. Chen. Architecture-based software reliability modeling. Journal of Systems and Software, 79(1):132--146, 2006.
[27]
Z. Zheng and M. R. Lyu. Collaborative reliability prediction of service-oriented systems. In Proc. of ICSE'10, pages 35--44. ACM Press, 2010.

Cited By

View all
  • (2022)Continuous and Proactive Software Architecture Evaluation: An IoT CaseACM Transactions on Software Engineering and Methodology10.1145/349276231:3(1-54)Online publication date: 15-Mar-2022
  • (2020)An Algorithm for Multi-objective Software Performance Optimisation at the Architecture LevelProceedings of the 2020 4th International Conference on Electronic Information Technology and Computer Engineering10.1145/3443467.3443914(1113-1119)Online publication date: 6-Nov-2020
  • (2020)Model-Based Analysis of Microservice Resiliency Patterns2020 IEEE International Conference on Software Architecture (ICSA)10.1109/ICSA47634.2020.00019(114-124)Online publication date: Mar-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
QoSA-ISARCS '11: Proceedings of the joint ACM SIGSOFT conference -- QoSA and ACM SIGSOFT symposium -- ISARCS on Quality of software architectures -- QoSA and architecting critical systems -- ISARCS
June 2011
206 pages
ISBN:9781450307246
DOI:10.1145/2000259
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 June 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. component-based software architectures
  2. fault tolerance
  3. reliability prediction
  4. software product lines

Qualifiers

  • Research-article

Conference

Comparch '11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 46 of 131 submissions, 35%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Continuous and Proactive Software Architecture Evaluation: An IoT CaseACM Transactions on Software Engineering and Methodology10.1145/349276231:3(1-54)Online publication date: 15-Mar-2022
  • (2020)An Algorithm for Multi-objective Software Performance Optimisation at the Architecture LevelProceedings of the 2020 4th International Conference on Electronic Information Technology and Computer Engineering10.1145/3443467.3443914(1113-1119)Online publication date: 6-Nov-2020
  • (2020)Model-Based Analysis of Microservice Resiliency Patterns2020 IEEE International Conference on Software Architecture (ICSA)10.1109/ICSA47634.2020.00019(114-124)Online publication date: Mar-2020
  • (2017)A multi-scale modeling approach for systems of systems architecturesACM SIGAPP Applied Computing Review10.1145/3161534.316153617:3(17-26)Online publication date: 14-Nov-2017
  • (2017)Iterative multi-scale modeling of software-intensive systems of systems architecturesProceedings of the Symposium on Applied Computing10.1145/3019612.3019801(1781-1786)Online publication date: 3-Apr-2017
  • (2016)Software Architectures: Multi-Scale Refinement2016 IEEE 14th International Conference on Software Engineering Research, Management and Applications (SERA)10.1109/SERA.2016.7516155(265-272)Online publication date: Jun-2016
  • (2016)Functional Safety in Product Lines -- A Systematic Mapping Study2016 42th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)10.1109/SEAA.2016.58(313-322)Online publication date: Aug-2016
  • (2016)Modeling the Failure Pathology of Software Components2016 12th International ACM SIGSOFT Conference on Quality of Software Architectures (QoSA)10.1109/QoSA.2016.15(41-49)Online publication date: Apr-2016
  • (2015)Modeling Fault Tolerance Tactics with Reusable AspectsProceedings of the 11th International ACM SIGSOFT Conference on Quality of Software Architectures10.1145/2737182.2737189(43-52)Online publication date: 4-May-2015
  • (2015)Estimating Software System Reliability: An Architecture-Based Approach2015 9th International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing10.1109/IMIS.2015.30(184-191)Online publication date: Jul-2015
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media