In eDiscovery, it is critical to ensure that each page produced in legal proceedings conforms with the requirements of court or government agency production requests. Errors in productions could have severe consequences in a case, putting a party in an adverse position. The volume of pages produced continues to increase, and tremendous time and effort has been taken to ensure quality control of document productions. This has historically been a manual and laborious process. This paper demonstrates a novel automated production quality control application which leverages deep learning-based image recognition technology to extract Bates Number and Confidentiality Stamping from legal case production images and validate their correctness. Effectiveness of the method is verified with an experiment using a real-world production data.
翻译:在eDiscovery案中,必须确保在法律诉讼中产生的每页都符合法院或政府机构生产请求的要求; 制作中的错误对案件可能产生严重后果,使当事方处于不利的地位; 制作的页数继续增加,为确保文件制作的质量控制花费了大量时间和精力; 这历来是一个人工和艰苦的过程; 本文展示了一种新型的自动化生产质量控制应用,利用深层次的基于学习的图像识别技术,从法律案例制作图像中提取Bates编号和保密印章,并验证其正确性; 方法的有效性由使用真实世界生产数据的实验加以验证。