项目名称: 异构服务器共享RAID的IO流互扰分析和数据排布优化研究
项目编号: No.61202106
项目类型: 青年科学基金项目
立项/批准年度: 2013
项目学科: 计算机科学学科
项目作者: 刘艳
作者单位: 华侨大学
项目金额: 23万元
中文摘要: 当今数量庞大的中小型IT企业中RAID存储子系统往往被多个异构服务器共享,对存储资源的竞争,以及异构服务器的IO流互相干扰引起的RAID磁盘磁头频繁寻道,导致共享RAID性能降低,违背了企业存储共享的初衷。本项目提出基于对IO访问的在线分析,结合数据动态重排,实施优化的共享RAID数据排布方案来解决该问题:在共享RAID的物理存储空间划分一个缓存区,用于为每个服务器配置一个"子RAID";再通过在线的IO访问特征分析,基于对单个磁盘顺序访问的数据块最大化或对多个阵列磁盘并行访问的磁盘数目最大化原则,确定"子RAID"结构,并选择最佳数据集从大磁盘阵列动态重排到"子RAID",使各"子RAID"始终对相应服务器呈现"专有逻辑高速缓存"的功能,从而隔离异构服务器IO流互扰,提高存储系统IO吞吐率。课题研究成果能让共享RAID为异构服务器提供优化、均衡的存储服务,实现真正的企业RAID存储共享。
中文关键词: 存储系统;异构服务器;IO请求;数据副本;RAID
英文摘要: In IT departments of small to medium size enterprises, RAID storage systems are generally shared by multiple heterogeneous servers.The shared RAID always show a rapid decrease in IO performance, as the direct result of resource contention as well as frequent disk head movements caused by intermix of disk requests from different servers. Which goes against the aim of enterprise storage sharing. Based on online profiling of disk IO operations from multiple servers sharing the disk array, combined with data migration, we propose a new dynamic data relocation technique on shared RAID storages to optimize performance. A physical partition of the entire shared array is constructed as "cache area", providing one sub-RAID for each server sharing the storage. Each sub-RAID forms different RAID configuration to match the IO access pattern of its corresponding server. By analyzing profiling data, we determine optimal data relocation, the portion of hot data and partition of disks in the shared RAID to maximize large sequential block accesses on individual disks and at the same time maximize parallel accesses across disks in the array. Obviously, the "cache area" stores active data and works in a similar way as a second level storage cache with a capacity several orders of magnitude larger than a RAM cache. Besides, each su
英文关键词: storage system;heterogeneous server;IO request;data replica;RAID