skip to main content
10.1145/3637543.3654620acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
invited-talk

IO-SEA: Storage I/O and Data Management for Exascale Architectures

Published: 01 July 2024 Publication History

Abstract

The new emerging scientific workloads to be executed in the upcoming exascale supercomputers face major challenges in terms of storage, given their extreme volume of data. In particular, intelligent data placement, instrumentation, and workflow handling are central to application performance. The IO-SEA project developed multiple solutions to aid the scientific community in adressing these challenges: a Workflow Manager, a hierarchical storage management system, and a semantic API for storage. All of these major products incorporate additional minor products that support their mission. In this paper, we discuss both the roles of all these products and how they can assist the scientific community in achieving exascale performance.

References

[1]
Mohammadamin Ajdari, Patrick Raaf, Mostafa Kishani, Reza Salkhordeh, Hossein Asadi, and Andr� Brinkmann. 2022. An Enterprise-Grade Open-Source Data Reduction Architecture for All-Flash Storage Systems. Proc. ACM Meas. Anal. Comput. Syst. 6, 2 (2022), 30:1--30:27. https://doi.org/10.1145/3530896
[2]
Daniel Ara�jo De Medeiros, Stefano Markidis, and Ivy Bo Peng. 2023. LibCOS: Enabling Converged HPC and Cloud Data Stores with MPI. In Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region. 106--116.
[3]
Peter Braam. 2019. The Lustre storage architecture. arXiv preprint arXiv:1903.01955 (2019).
[4]
Robert G. Edwards and Balint Joo. 2005. The Chroma software system for Lattice QCD. Nucl. Phys. B Proc. Suppl. 140 (2005), 832. https://doi.org/10.1016/j.nuclphysbps.2004.11.254 arXiv:hep-lat/0409003
[5]
Wolfgang Frings, Felix Wolf, and Ventsislav Petkov. 2009. Scalable massively parallel I/O to task-local files. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (Portland, Oregon) (SC '09). Association for Computing Machinery, New York, NY, USA, Article 17, 11 pages. https://doi.org/10.1145/1654059.1654077
[6]
Sam Hartman, Karthik Jaganathan, and Larry Zhu. 2005. The Kerberos Version 5 Generic Security Service Application Program Interface (GSS-API) Mechanism: Version 2. RFC 4121. https://doi.org/10.17487/RFC4121
[7]
Dorian Krause. 2019. JUWELS: Modular Tier-0/1 supercomputer at the J�lich supercomputing centre. Journal of large-scale research facilities JLSRF 5 (2019), A135-A135.
[8]
John Linn. 2000. Generic Security Service Application Program Interface Version 2, Update 1. RFC 2743. https://doi.org/10.17487/RFC2743
[9]
Jay Lofstead, Ivo Jimenez, Carlos Maltzahn, Quincey Koziol, John Bent, and Eric Barton. 2016. DAOS and friends: a proposal for an exascale storage system. In SC'16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE, 585--596.
[10]
Nafiseh Moti, Andr� Brinkmann, Marc-Andr� Vef, Philippe Deniel, Jes�s Carretero, Philip H. Carns, Jean-Thomas Acquaviva, and Reza Salkhordeh. 2023. The I/O Trace Initiative: Building a Collaborative I/O Archive to Advance HPC. In Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, SC-W2023, Denver, CO, USA, November 12-17, 2023. ACM, 1216--1222. https://doi.org/10.1145/3624062.3624192
[11]
Sai Narasimhamurthy, Nikita Danilov, Sining Wu, Ganesan Umanesan, Stefano Markidis, Sergio Rivas-Gomez, Ivy Bo Peng, Erwin Laure, Dirk Pleiter, and Shaun De Witt. 2019. Sage: percipient storage for exascale data centric computing. Parallel computing 83 (2019), 22--33.
[12]
Dr. Clifford Neuman and Theodore Ts'o. 1993. The Kerberos Network Authentication Service (V5). RFC 1510. https://doi.org/10.17487/RFC1510
[13]
Simon D. Smart, Tiago Quintino, and Baudouin Raoult. 2017. A Scalable Object Store for Meteorological and Climate Data. In Proceedings of the Platform for Advanced Scientific Computing Conference (Lugano, Switzerland) (PASC '17). Association for Computing Machinery, New York, NY, USA, Article 13, 8 pages. https://doi.org/10.1145/3093172.3093238
[14]
Simon D. Smart, Tiago Quintino, and Baudouin Raoult. 2019. A High-Performance Distributed Object-Store for Exascale Numerical Weather Prediction and Climate. In Proceedings of the Platform for Advanced Scientific Computing Conference (Zurich, Switzerland) (PASC '19). Association for Computing Machinery, New York, NY, USA, Article 16, 11 pages. https://doi.org/10.1145/3324989.3325726
[15]
Estela Suarez, Norbert Eicker, and Thomas Lippert. 2019. Modular supercomputing architecture: from idea to production. In Contemporary high performance computing. CRC Press, 223--255.
[16]
Philipp Th�rnig. 2021. JURECA: Data centric and booster modules implementing the modular supercomputing architecture at J�lich supercomputing centre. Journal of large-scale research facilities JLSRF 7 (2021), A182-A182.
[17]
Sage Weil, Scott A Brandt, Ethan L Miller, Darrell DE Long, and Carlos Maltzahn. 2006. Ceph: A scalable, high-performance distributed file system. In Proceedings of the 7th Conference on Operating Systems Design and Implementation (OSDI'06). 307--320.
[18]
Sage A Weil, Andrew W Leung, Scott A Brandt, and Carlos Maltzahn. 2007. Rados: a scalable, reliable storage service for petabyte-scale storage clusters. In Proceedings of the 2nd international workshop on Petascale data storage: held in conjunction with Supercomputing'07. 35--44.

Index Terms

  1. IO-SEA: Storage I/O and Data Management for Exascale Architectures

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        CF '24 Companion: Proceedings of the 21st ACM International Conference on Computing Frontiers: Workshops and Special Sessions
        May 2024
        163 pages
        ISBN:9798400704925
        DOI:10.1145/3637543
        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 01 July 2024

        Check for updates

        Author Tags

        1. data movement
        2. exascale
        3. hierarchical storage
        4. semantic interface
        5. storage
        6. workflows

        Qualifiers

        • Invited-talk
        • Research
        • Refereed limited

        Funding Sources

        • The European High Performance Computing Joint Undertaking

        Conference

        CF '24
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 273 of 785 submissions, 35%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 55
          Total Downloads
        • Downloads (Last 12 months)55
        • Downloads (Last 6 weeks)16
        Reflects downloads up to 17 Oct 2024

        Other Metrics

        Citations

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media