Distributed monitoring plays a crucial role in managing the activities of cloud-based datacenters. System administrators have long relied on monitoring systems such as Nagios and Ganglia to obtain status alerts on their desktop-class machines. However, the popularity of mobile devices is pushing the community to develop datacenter monitoring solutions for smartphone-class devices. Here we lay out desirable characteristics of such smartphone-based monitoring and identify quantitatively the shortcomings from directly applying existing solutions to this domain. Then we introduce a possible design that addresses some of these shortcomings and provide results from an early prototype, called MAVIS, using one month of monitoring data from approximately 3,000 machines hosted by Purdue's central IT organization.
翻译:分布式监测在管理基于云的数据中心活动方面发挥着关键作用。系统管理员长期以来一直依靠诸如Nagios和Ganglia等监测系统获得桌面级机器的状态警报。然而,移动设备的普及性正在推动社区为智能手机级设备开发数据中心监测解决方案。这里我们列出了这种基于智能手机的监测的可取特征,并从数量上确定直接将现有解决方案应用于该领域的缺点。然后我们引入了一种可能的设计,以解决其中一些缺点,并提供一个早期原型(称为MAVIS)的结果,即利用Purdue中央信息技术组织托管的大约3 000台机器一个月的监测数据。