skip to main content
10.5555/1960475.1960481acmotherconferencesArticle/Chapter ViewAbstractPublication PagesfastConference Proceedingsconference-collections
Article

CAFTL: a content-aware flash translation layer enhancing the lifespan of flash memory based solid state drives

Published: 15 February 2011 Publication History

Abstract

Although Flash Memory based Solid State Drive (SSD) exhibits high performance and low power consumption, a critical concern is its limited lifespan along with the associated reliability issues. In this paper, we propose to build a Content-Aware Flash Translation Layer (CAFTL) to enhance the endurance of SSDs at the device level. With no need of any semantic information from the host, CAFTL can effectively reduce write traffic to flash memory by removing unnecessary duplicate writes and can also substantially extend available free flash memory space by coalescing redundant data in SSDs, which further improves the efficiency of garbage collection and wear-leveling. In order to retain high data access performance, we have also designed a set of acceleration techniques to reduce the runtime overhead and minimize the performance impact caused by extra computational cost. Our experimental results show that our solution can effectively identify up to 86.2% of the duplicate writes, which translates to a write traffic reduction of up to 24.2% and extends the flash space by a factor of up to 31.2%. Meanwhile, CAFTL only incurs a minimized performance overhead by a factor of up to 0.5%.

References

[1]
FIPS 180-1, Secure Hash Standard, April 1995.
[2]
Hadoop. http://hadoop.apache.org/, 2010.
[3]
Personal communications with an SSD architect, 2010.
[4]
SimpleScalar 4.0. http://www.simplescalar.com/v4test.html, 2010.
[5]
SSD extension for DiskSim simulation environment. http://research.microsoft.com/en-us/downloads/b41019e2- 1d2b-44d8-b512-ba35ab814cd4/, 2010.
[6]
AGRAWAL, N., PRABHAKARAN, V., WOBBER, T., DAVIS, J. D., MANASSE, M., AND PANIGRAHY, R. Design tradeoffs for SSD performance. In Proceedings of USENIX'08 (Boston, MA, June 2008).
[7]
ANDERSEN, D. G., AND SWANSON, S. Rethinking flash in the data center. In IEEE Micro (July/Aug 2010).
[8]
ARM. Cortex R4. http://www.arm.com/products/ processors/cortex-r/cortex-r4.php, 2010.
[9]
BALAKRISHNAN, M., KADAV, A., PRABHAKARAN, V., AND MALKHI, D. Differential RAID: Rethinking RAID for SSD Reliability. In Proceedings of EuroSys'10 (Paris, France, April 2010).
[10]
BARROSO, L. A. Warehouse-scale computing. In Keynote in the SIGMOD'10 conference (Indianapolis, IN, June 2010).
[11]
BHAGWAT, D., ESHGHI, K., LONG, D. D. E., AND LILLIBRIDGE, M. Extreme binning: Scalable, parallel deduplication for chunk-based file backup. In Proceedings of MASCOTS'09 (London, UK, September 2009).
[12]
BLOOM, B. H. Space/time trade-offs in hash coding with allowable errors. In Communications of the ACM (1970), vol. 13(7), pp. 422-426.
[13]
BOBOILA, S., AND DESNOYERS, P. Write endurance in flash drives: Measurements and analysis. In Proceedings of FAST'10 (San Jose, CA, February 2010).
[14]
BUCY, J., SCHINDLER, J., SCHLOSSER, S., AND GANGER, G. DiskSim 4.0. http://www.pdl.cmu.edu/DiskSim, 2010.
[15]
CHEN, F., JIANG, S., AND ZHANG, X. SmartSaver: Turning flash drive into a disk energy saver for mobile computers. In Proceedings of ISLPED'06 (Tegernsee, Germany, October 2006).
[16]
CHEN, F., KOUFATY, D. A., AND ZHANG, X. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. In Proceedings of SIGMETRICS/Performance'09 (Seattle, WA, June 2009).
[17]
CHEN, F., LEE, R., AND ZHANG, X. Essential roles of exploiting internal parallelism of flash memory based solid state drives in high-speed data processing. In Proceedings of HPCA'11 (San Antonio, TX, Feb 2011).
[18]
CHEN, S. FlashLogging: Exploiting flash devices for synchronous logging performance. In Proceedings of SIGMOD'09 (Providence, RI, June 2009).
[19]
DEBNATH, B., SENGUPTA, S., AND LI, J. ChunkStash: Speeding up inline storage deduplication using flash memory. In Proceedings of USENIX'10 (Boston, MA, June 2010).
[20]
DIRIK, C., AND JACOB, B. The performance of PC solid-state disks (SSDs) as a function of bandwidth, concurrency, device, architecture, and system organization. In Proceedings of ISCA'09 (Austin, TX, June 2009).
[21]
GAL, E., AND TOLEDO, S. Algorithms and data structures for flash memories. In ACM Computing Survey'05 (2005), vol. 37(2), pp. 138-163.
[22]
GRUPP, L. M., CAULFIELD, A. M., COBURN, J., SWANSON, S., YAAKOBI, E., SIEGEL, P. H., AND WOLF, J. K. Characterizing flash memory: Anomalies, observations, and applications. In Proceedings of MICRO'09 (New York, NY, December 2009).
[23]
GUPTA, A., KIM, Y., AND URGAONKAR, B. DFTL: a flash translation layer employing demand-based selective caching of page-level address mappings. In Proceedings of ASPLOS'09 (Washington, D.C., March 2009).
[24]
GUPTA, D., LEE, S., VRABLE, M., SAVAGE, S., SNOEREN, A. C., VARGHESE, G., VOELKER, G. M., AND VAHDAT, A. Difference Engine: Harnessing memory redundancy in virtual machines. In Proceedings of OSDI'08 (San Diego, CA, 2008).
[25]
INTEL. Intel X25-E extreme SATA solid-state drive. http://www.intel.com/design/flash/nand/extreme, 2008.
[26]
JOSEPHSON, W. K., BONGO, L. A., FLYNN, D., AND LI, K. DFS: A file system for virtualized flash storage. In Proceedings of FAST'10 (San Jose, CA, February 2010).
[27]
KAWAGUCHI, A., NISHIOKA, S., AND MOTODA, H. A flash-memory based file system. In Proceedings of USENIX Winter (New Orleans, LA, Jan 1995), pp. 155-164.
[28]
KIM, H., AND AHN, S. BPLRU: A buffer management scheme for improving random writes in flash storage. In Proceedings of FAST'08 (San Jose, CA, February 2008).
[29]
LEE, S., AND MOON, B. Design of flash-based DBMS: An inpage logging approach. In Proceedings of SIGMOD'07 (Beijing, China, June 2007).
[30]
LILLIBRIDGE, M., ESHGHI, K., BHAGWAT, D., DEOLALIKAR, V., TREZISE, G., AND CAMBLE, P. Sparse indexing: Large scale, inline deduplication using sampling and locality. In Proceedings of FAST'09 (San Jose, CA, 2009).
[31]
MAKATOS, T., KLONATOS, Y., MARAZAKIS, M., FLOURIS, M. D., AND BILAS, A. Using transparent compression to improve SSD-based I/O caches. In Proceedings of EuroSys'10 (Paris, France, April 2010).
[32]
MENEZES, A. J., V. OORSCHOT, P. C., AND VANSTONE, S. A. Handbook of applied cryptography. In CRC Press (1996).
[33]
MOHAN, V., SIDDIQUA, T., GURUMURTHI, S., AND STAN, M. R. How I learned to stop worrying and love flash endurance. In Proceedings of HotStorage'10 (Boston, MA, June 2010).
[34]
NARAYANAN, D., THERESKA, E., DONNELLY, A., ELNIKETY, S., AND ROWSTRON, A. Migrating enterprise storage to SSDs: analysis of tradeoffs. In Proceedings of EuroSys'09 (Nuremberg, Germany, March 2009).
[35]
NETCONSOLE. http://www.kernel.org/doc/Documentation/networking/ netconsole.txt, 2010.
[36]
PRABHAKARAN, V., BALAKRISHNAN, M., DAVIS, J. D., AND WOBBER, T. Depletable storage systems. In Proceedings of HotStorage'10 (Boston, MA, June 2010).
[37]
PRABHAKARAN, V., RODEHEFFEER, T. L., AND ZHOU, L. Transactional flash. In Proceedings of OSDI'08 (San Diego, CA, December 2008).
[38]
PRITCHETT, T., AND THOTTETHODI, M. SieveStore: A highly-selective, ensemble-level disk cache for cost-performance. In Proceedings of ISCA'10 (Saint-Malo, France, June 2010).
[39]
RAMAKRISHNAN, R., AND GEHRKE, J. Database managment systems. McGraw-Hill, 2030.
[40]
RIVEST, R. The MD5 message-digest algorithm. http://www.ietf.org/rfc/rfc1321.txt, April 1992.
[41]
ROSENBLUM, M., AND OUSTERHOUT, J. K. The design and implementation of a log-structured file system. In ACM Transactions on Computer Systems (1992), vol. 10(1):26-52.
[42]
SOUNDARARAJAN, G., PRABHAKARAN, V., BALAKRISHNAN, M., AND WOBBER, T. Extending SSD lifetimes with disk-based write caches. In Proceedings of FAST'10 (San Jose, CA, February 2010).
[43]
SUN, G., JOO, Y., CHEN, Y., NIU, D., XIE, Y., CHEN, Y., AND LI, H. A hybrid solid-state storage architecture for the performance, energy consumption, and lifetime improvement. In Proceedings of HPCA'10 (Bangalore, India, Jan 2010).
[44]
TSIROGIANNIS, D., HARIZOPOULOS, S., AND SHAH, M. A. Query processing techniques for solid state drives. In Proceedings of SIGMOD'09 (Providence, RI, June 2009).
[45]
UNGUREANU, C., ATKIN, B., ARANYA, A., GOKHALE, S., RAGO, S., CALKOWSKI, G., DUBNICKI, C., AND BOHRA, A. HydraFS: A high-throughput file system for the HYDRAstor content-addressable storage system. In Proceedings of FAST'10 (San Jose, CA, 2010).
[46]
WIKIPEDIA. Battery or supercap. http://en.wikipedia.org/wiki/ Solid-state-drive#Battery_or_SuperCap, 2010.
[47]
ZHU, B., LI, K., AND PATTERSON, H. Avoiding the disk bottleneck in the data domain deduplication file system. In Proceedings of FAST'08 (San Jose, CA, 2008).

Cited By

View all
  • (2024)FSDedup: Feature-Aware and Selective Deduplication for Improving Performance of Encrypted Non-Volatile Main MemoryACM Transactions on Storage10.1145/366273620:4(1-33)Online publication date: 1-May-2024
  • (2024)Eliminating Storage Management Overhead of Deduplication over SSD Arrays Through a Hardware/Software Co-DesignProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3620665.3640368(320-335)Online publication date: 27-Apr-2024
  • (2023)Explorations and Exploitation for Parity-based RAIDs with Ultra-fast SSDsACM Transactions on Storage10.1145/3627992Online publication date: 16-Oct-2023
  • Show More Cited By
  1. CAFTL: a content-aware flash translation layer enhancing the lifespan of flash memory based solid state drives

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    FAST'11: Proceedings of the 9th USENIX conference on File and stroage technologies
    February 2011
    20 pages
    ISBN:9781931971829

    Sponsors

    • OFS: OrangeFS
    • NetApp
    • Google Inc.
    • DELL
    • USENIX Assoc: USENIX Assoc

    In-Cooperation

    Publisher

    USENIX Association

    United States

    Publication History

    Published: 15 February 2011

    Check for updates

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 21 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)FSDedup: Feature-Aware and Selective Deduplication for Improving Performance of Encrypted Non-Volatile Main MemoryACM Transactions on Storage10.1145/366273620:4(1-33)Online publication date: 1-May-2024
    • (2024)Eliminating Storage Management Overhead of Deduplication over SSD Arrays Through a Hardware/Software Co-DesignProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3620665.3640368(320-335)Online publication date: 27-Apr-2024
    • (2023)Explorations and Exploitation for Parity-based RAIDs with Ultra-fast SSDsACM Transactions on Storage10.1145/3627992Online publication date: 16-Oct-2023
    • (2023)LeaFTL: A Learning-Based Flash Translation Layer for Solid-State DrivesProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3575693.3575744(442-456)Online publication date: 27-Jan-2023
    • (2022)An Enterprise-Grade Open-Source Data Reduction Architecture for All-Flash Storage SystemsProceedings of the ACM on Measurement and Analysis of Computing Systems10.1145/35308966:2(1-27)Online publication date: 6-Jun-2022
    • (2021)GoSeed: Optimal Seeding Plan for Deduplicated StorageACM Transactions on Storage10.1145/345330117:3(1-28)Online publication date: 16-Aug-2021
    • (2020)GoSeedProceedings of the 18th USENIX Conference on File and Storage Technologies10.5555/3386691.3386710(193-208)Online publication date: 24-Feb-2020
    • (2020)Cache What You Need to CacheACM Transactions on Storage10.1145/339776616:3(1-24)Online publication date: 16-Jul-2020
    • (2019)SmartdedupProceedings of the 2019 USENIX Conference on Usenix Annual Technical Conference10.5555/3358807.3358861(633-646)Online publication date: 10-Jul-2019
    • (2019)Latency/Wearout in a Flash-based Storage System with Replication on WriteProceedings of the 24th Conference of Open Innovations Association FRUCT10.5555/3338290.3338340(360-366)Online publication date: 15-Apr-2019
    • Show More Cited By

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media