Magnetic tapes are often considered as an outdated storage technology, yet they are still used to store huge amounts of data. Their main interests are a large capacity and a low price per gigabyte, which come at the cost of a much larger file access time than on disks. With tapes, finding the right ordering of multiple file accesses is thus key to performance. Moving the reading head back and forth along a kilometer long tape has a non-negligible cost and unnecessary movements thus have to be avoided. However, the optimization of tape request ordering has then rarely been studied in the scheduling literature, much less than I/O scheduling on disks. For instance, minimizing the average service time for several read requests on a linear tape remains an open question. Therefore, in this paper, we aim at improving the quality of service experienced by users of tape storage systems, and not only the peak performance of such systems. To this end, we propose a reasonable polynomial-time exact algorithm while this problem and simpler variants have been conjectured NP-hard. We also refine the proposed model by considering U-turn penalty costs accounting for inherent mechanical accelerations. Then, we propose a low-cost variant of our optimal algorithm by restricting the solution space, yet still yielding an accurate suboptimal solution. Finally, we compare our algorithms to existing solutions from the literature on logs of the mass storage management system of a major datacenter. This allows us to assess the quality of previous solutions and the improvement achieved by our low-cost algorithm. Aiming for reproducibility, we make available the complete implementation of the algorithms used in our evaluation, alongside the dataset of tape requests that is, to the best of our knowledge, the first of its kind to be publicly released.
翻译:磁带通常被视为一种过时的存储技术,但它们仍然用于存储大量数据。 它们的主要利益是巨大的容量和低价每千兆字节, 其成本比磁盘上的文件存取时间要大得多。 因此, 磁带可以找到多个文件存取的正确顺序, 因而是性能的关键。 将读头沿千米长的磁带移动, 其成本是不可忽略的, 因而必须避免不必要的移动。 然而, 磁带订购的优化在排程文献中很少研究, 远低于磁盘上的I/ O排程。 例如, 最大限度地减少线性磁带上若干读取请求的平均服务时间。 因此, 在本文中, 我们的目标是提高磁带存储系统用户所经历的服务质量, 而不是仅仅提高这种系统的高峰性能。 为此, 我们提出合理的多米时精确算算法, 这个问题和最简单的变体已经硬化了。 我们还改进了拟议的模型, 考虑用U 将惩罚成本计算算算算算算出我们内部的磁带存取的磁带存磁带磁带存带存带存取时间, 的精度评估, 然后, 将我们现有的磁带存储率加速化数据转换算法的预变法, 将我们的现有数据转换到现有的磁带解法, 将我们现有的磁带存取到现有的磁带存取的原始变法 。