在块存储设备上快速大量高效地写入小数据的研究

基本信息
批准号:61572487
项目类别:面上项目
资助金额:64.00
负责人:江松
学科分类:
依托单位:中国科学院深圳先进技术研究院
批准年份:2015
结题年份:2019
起止时间:2016-01-01 - 2019-12-31
项目状态: 已结题
项目参与者:赵毓斌,张瑞,田盼,曾经纬,赵娟娟,赵东辉
关键词:
存储虚拟化操作系统内核I/O调度文件与存储系统
结项摘要

While big data poses a big challenge to storage systems on their capability and efficiency, small data also presents an equally serious threat on their access efficiency. Examples of small data include filesystem’s metadata, lookup table entries in virtual storage devices, and small updates. Almost all storage devices use block interface. When small data (e.g., 128Bytes or less) is substantially smaller than ablock (e.g., 4KB), the actual access size would be much larger than the data itself, potentially resulting in wasted device bandwidth and significantly reduced I/O efficiency. Because of demand on immediate data persistency, writes of the small-data continue to be the Achilles’ heel of block devices..There are multiple software layers in the I/O stack interacting with the block interface, where small data writes can inflict performance penalty. The first layer is the virtual block device, which is hosted onblock device(s) to provide a block interface to virtual machines. For every write of a data block to a virtual device, the device needs a small write recording where the data block is stored. The second layer is the I/O scheduler, where requests of block writes are received and then sent to the (virtual) storage device. Small writes that can be identified and removed include small update in a data block, a data block that can be substantially reduced by compression, and a data block that can be removed by on-line deduplication. The third layer is file system, where metadata often need to be saved immediately after data write. Servicing these small writes via block interface is not efficient due to significant write amplification and often requires order between writes of corresponding data block and metadata..To address these challenges raised by small writes, we propose a highly effective and fundamental approach that is applicable at any layers in an I/O stack. Instead of relying on any special hardware support or demanding any interface changes, this approach leverages data compression technique. As it is observed that small data usually accompany data blocks, they can be embedded into the data blocks should the data in the blocks be compressible. This use of compression technique represents a deviation from its conventional employment for reducing data volume, and holds much larger performance advantages. First, because small data are often not contiguous with data blocks, eliminating them not only reduces disk wear (for flash-based SSD) but also removes extra disk seeks (for hard disks). Second, when small data are metadata of a data block and two (or more) writes of them are required to complete a write request of the data block, the writes must follow a certain order to ensure on-disk data consistency. Using flushes to enforce the order would seriously degrade I/O performance. By integrating the metadata and data into one block, which is written in one atomic write, the order issue can be efficiently addressed.

大数据向存储系统发起全方位挑战的同时,小数据则威胁到存储系统的访问效率。典型的小数据包括文件系统的元数据,虚拟存储设备中的查找表项等。如今,绝大多数存储设备使用块接口,基于I/O栈上的虚拟块设备、I/O调度器和文件系统都需与块接口交互。由于小数据远小于一个块单位,实际访问成本远大于数据本身,导致设备带宽的浪费,显著降低I/O效率。尤其在即时持久化的要求下,小数据写入成为块设备的致命弱点。针对该问题,本项目提出一种高效、彻底的解决方案。不同于某些依赖特殊硬件或需要更改接口的方法,本方案运用了数据压缩技术,小数据通常跟随数据块,当该数据块被压缩时,小数据便可被嵌入数据块中,从而大幅减少小数据写入。不仅降低磁盘磨损,还省去多余的硬盘查找以及为强制顺序执行而使用的昂贵的flush命令。不同于传统数据压缩技术中追求节省存储空间的目标,该方案首次提出利用压缩技术来突破块设备接口的限制以大大提高性能。

项目摘要

尽管大数据在向存储系统发起全方位的挑战,小数据则威胁到存储系统的访问效率。典型的小数据包括文件系统的元数据,虚拟存储设备中的查找表项等。如今,绝大多数存储设备使用 块接口,基于I/O栈上的虚拟块设备、I/O调度器和文件系统都需与块接口交互。由于小数据远小于一个块单位,实际访问成本远大于数据本身,导致设备带宽的浪费,显著降低I/O效率。尤其在即时持久化的要求下,小数据写入成为块设备的致命弱点。针对该问题,本项目提出了一种高效、彻底的解决方案。不同于某些依赖特殊硬件或需要更改接口的方法,本方案运用了数据压缩技术,小数据通常跟随数据块,当该数据块被压缩时,小数据便可被嵌入数据块中,从而大幅减少小数据写入。不仅降低磁盘磨损,还省去多余的硬盘查找以及为强制顺序执行而使用的昂贵的flush命令。不同于传统数据压缩技术中追求节省存储空间的目标,该方案首次提出利用压缩技术来突破块设备接口的限制以大大提高性能。目前,项目已经大体上完成了预定目标,相信其落地应用会对存储系统技术产生一定影响。

项目成果
{{index+1}}

{{i.achievement_title}}

{{i.achievement_title}}

DOI:{{i.doi}}
发表时间:{{i.publish_year}}

暂无此项成果

数据更新时间:2023-05-31

其他相关文献

1

Efficient photocatalytic degradation of organic dyes and reaction mechanism with Ag2CO3/Bi2O2CO3 photocatalyst under visible light irradiation

Efficient photocatalytic degradation of organic dyes and reaction mechanism with Ag2CO3/Bi2O2CO3 photocatalyst under visible light irradiation

DOI:
发表时间:2016
2

Intensive photocatalytic activity enhancement of Bi5O7I via coupling with band structure and content adjustable BiOBrxI1-x

Intensive photocatalytic activity enhancement of Bi5O7I via coupling with band structure and content adjustable BiOBrxI1-x

DOI:10.1016/j.scib.2017.12.016
发表时间:2018
3

硬件木马:关键问题研究进展及新动向

硬件木马:关键问题研究进展及新动向

DOI:
发表时间:2018
4

The Role of Osteokines in Sarcopenia: Therapeutic Directions and Application Prospects

The Role of Osteokines in Sarcopenia: Therapeutic Directions and Application Prospects

DOI:10.3389/fcell.2021.735374
发表时间:2021
5

面向云工作流安全的任务调度方法

面向云工作流安全的任务调度方法

DOI:10.7544/issn1000-1239.2018.20170425
发表时间:2018

江松的其他基金

批准号:91330000
批准年份:2013
资助金额:300.00
项目类别:重大研究计划
批准号:91230000
批准年份:2012
资助金额:300.00
项目类别:重大研究计划
批准号:11051001
批准年份:2010
资助金额:10.00
项目类别:专项基金项目
批准号:11026009
批准年份:2010
资助金额:10.00
项目类别:数学天元基金项目
批准号:91730000
批准年份:2017
资助金额:300.00
项目类别:重大研究计划
批准号:91530000
批准年份:2015
资助金额:300.00
项目类别:重大研究计划

相似国自然基金

1

基于访问模式分析的存储设备高效利用方法研究

批准号:61103020
批准年份:2011
负责人:王瑀屏
学科分类:F0202
资助金额:24.00
项目类别:青年科学基金项目
2

大数据环境下基于新型存储设备的I/O系统数据布局优化研究

批准号:61572377
批准年份:2015
负责人:何水兵
学科分类:F0204
资助金额:67.00
项目类别:面上项目
3

由光折变空间孤子写入并存储波导的研究

批准号:69678018
批准年份:1996
负责人:刘思敏
学科分类:F0505
资助金额:10.00
项目类别:面上项目
4

机械振动无线传感器网络大量数据低功耗高效可靠传输方法研究

批准号:51675067
批准年份:2016
负责人:汤宝平
学科分类:E0511
资助金额:62.00
项目类别:面上项目