In eDiscovery, a party to a lawsuit or similar action must search through available information to identify those documents and files that are relevant to the suit. Search efforts tend to identify less than 100% of the relevant documents and courts are frequently asked to adjudicate whether the search effort has been reasonable, or whether additional effort to find more of the relevant documents is justified. This article provides a method for estimating the probability that significant additional information will be found from extended effort. Modeling and two data sets indicate that the probability that facts/topics exist among the so-far unidentified documents that have not been observed in the identified documents is low for even moderate levels of Recall.
翻译:在e Discovery案中,诉讼或类似行动的当事方必须通过现有资料进行搜索,以查明与诉讼有关的文件和档案; 往往要求搜索努力查明不到100%的相关文件和法院,以裁定搜索努力是否合理,或是否有理由为寻找更多相关文件作出更多努力; 本条提供了一种方法,用以估计从长期努力中可能找到大量额外信息的可能性; 建模和两套数据集表明,在所查明的文件中未观察到的远未识别文件中存在事实/专题的可能性很小,即使是中等程度的回调也很低。