This paper presents the Container Profiler, a software tool that measures and records the resource usage of any containerized task. Our tool profiles the CPU, memory, disk, and network utilization of containerized tasks collecting over fifty Linux operating system metrics at the virtual machine, container, and process levels. The Container Profiler supports performing time series profiling at a configurable sampling interval to enable continuous monitoring of the resources consumed by containerized tasks and pipelines. To investigate the utility of the Container Profiler, we profile the resource utilization requirements of a multi-stage bioinformatics analytical pipeline (RNA sequencing using unique molecular identifiers). We examine profiling metrics to assess patterns of CPU, disk, and network resource utilization across the different stages of the pipeline. We also quantify the profiling overhead of our Container Profiler tool to assess the impact of profiling a running pipeline with different levels of profiling granularity verifying that impacts are negligible. The Container Profiler provides a useful tool that can be used to continuously monitor the resource consumption of long and complex containerized applications that run locally or on the cloud. This can help identify bottlenecks where more resources are needed to improve performance.
翻译:本文介绍集装箱剖面仪,这是一个衡量和记录任何集装箱任务资源使用情况的软件工具; 我们的工具剖析了在虚拟机器、集装箱和工艺层面收集超过50个Linux操作系统度量的集装箱作业的CPU、存储器、磁盘和网络使用情况; 集装箱剖面仪支持在一个可配置取样区进行时间序列剖面,以便能够持续监测集装箱剖面仪和管道所消耗的资源; 为了调查集装箱剖面仪的效用, 我们描述了多阶段生物信息分析管道(利用独特的分子识别器进行RNA测序)的资源利用要求; 我们审视了剖面图,以评估管道不同阶段的CPU、磁盘和网络资源利用模式; 我们还量化了集装箱剖面仪工具的剖面间接位置,以评估以不同程度的剖面颗粒度对管道进行剖面的影响,以核实这种影响是微不足道的; 集装箱剖面仪提供了一个有用的工具,可用于持续监测本地或云层上长期和复杂的集装箱化应用的资源消耗情况。 这有助于查明需要更多资源的瓶颈。