Star sampling (SS) is a random sampling procedure on a graph wherein each sample consists of a randomly selected vertex (the star center) and its (one-hop) neighbors (the star points). We consider the use of SS to find any member of a target set of vertices in a graph, where the figure of merit (cost) is either the expected number of samples (unit cost) or the expected number of star centers plus star points (linear cost) until a vertex in the target set is encountered, either as a star center or as a star point. We analyze these two performance measures on three related star sampling paradigms: SS with replacement (SSR), SS without center replacement (SSC), and SS without star replacement (SSS). Exact and approximate expressions are derived for the expected unit and linear costs of SSR, SSC, and SSS on Erd\H{o}s-R\'{e}nyi (ER) random graphs. The approximations are seen to be accurate. SSC/SSS are notably better than SSR under unit cost for low-density ER graphs, while SSS is notably better than SSR/SSC under linear cost for low- to moderate-density ER graphs. Simulations on twelve "real-world" graphs shows the cost approximations to be of variable quality: the SSR and SSC approximations are uniformly accurate, while the SSS approximation, derived for an ER graph, is of variable accuracy.
翻译:星取样(SS)是一个随机抽样程序,每个样本由随机选择的顶点(星中心)及其(一秒)相邻(星点)组成。我们考虑使用SS在图形中找到一组目标顶点的任何成员,其优点数字(成本)要么是样本的预期数量(单位成本),要么是星中心加上恒点的预期数量(线性成本),直到目标组遇到一个顶点(线性成本),或者作为星中心,或者作为星点。我们分析了三个相关的恒取样模式的这两个性能措施:替换的SS(SSR)、没有中心替换的SS(SSC)和没有更换星点的SS(SS)。我们考虑使用SS在图表中找到一组目标顶点(SS)的任何成员。根据三个相关恒取样模式分析这两种性能措施,SSS的性能措施:SSS的单位和线性价比S-SB的精确度值,而SIS的精确性价比SIM的精确性价比SB的精确性价比。SSS的SIS的精确性平流-直流-slimal-ralal-ral-ral-ral-slational-slations 和Sylal-sural-sural-s 的SIS-slal-slal-slal-smaisal-s