When arranging for third-party data annotation, it can be hard to compare how well the competing providers apply best practices to create high-quality datasets. This leads to a "race to the bottom," where competition based solely on price makes it hard for vendors to charge for high-quality annotation. We propose a voluntary rubric which can be used (a) as a scorecard to compare vendors' offerings, (b) to communicate our expectations of the vendors more clearly and consistently than today, (c) to justify the expense of choosing someone other than the lowest bidder, and (d) to encourage annotation providers to improve their practices.
翻译:在安排第三方数据说明时,很难比较相互竞争的供应商采用最佳做法创建高质量数据集的最佳办法的好坏,这导致“竞相逐下”竞争,因为完全以价格为基础的竞争使得供应商很难收取高质量说明的费用。 我们提议一个自愿的标语,可以(a) 用作比较供应商报价的记分卡,(b) 以比现在更清楚和一致的方式传达我们对供应商的期望,(c) 证明选择除最低投标人以外的人的费用是合理的,以及(d) 鼓励批注供应商改进其做法。