National governments use border information to efficiently manage the biosecurity risk presented by travel and commerce. In the Australian border biosecurity system, data about cargo consignments are collected from records of directions: that is, the records of actions taken by the biosecurity regulator. This data collection is complicated by the way directions for a given entry are recorded. An entry is a collection of import lines where each line is a single type of item or commodity. Analysis is simple when the data are recorded in line mode: the directions are recorded individually for each line. The challenge comes when data are recorded in container mode, because the same direction is recorded against each line in the entry. In other words, if at least one line in an entry has a non-compliant inspection result, then all lines in that entry are recorded as non-compliant. Therefore, container mode data creates a challenge for estimating the probability that certain items are non-compliant, because matching the records of non-compliance to the line information is impossible. We develop a statistical model to use container mode data to help inform biosecurity risk of items. We use asymptotic analysis to estimate the value of container mode data compared to line mode data, do a simulation study to verify that we can accurately estimate parameters in a large dataset, and we apply our methods to a real dataset, for which important information about the risk of non-compliance is recovered using the new model.
翻译:在澳大利亚边境生物安保系统中,货物托运的数据是从方向记录中收集的:即生物安保监管机构的行动记录。这种数据收集由于记录特定条目的方向而变得复杂。条目是每条线为单一种类物品或商品的进口线的收集。当数据记录在线条模式中时,分析是简单的:每条线是单独记录方向。当数据记录在集装箱模式中时,挑战就出现。当数据记录在集装箱模式中时,因为输入的每行中都记录着相同的方向。换句话说,如果一个条目中至少有一条线的检查结果不符合要求,那么该条目中的所有行都记录为不符合要求。因此,集装箱模式数据对估计某些物品不符合要求的可能性构成挑战,因为将不遵守要求的记录与线条信息相匹配是不可能的。我们开发了一个统计模型,用集装箱模式数据来帮助了解物品的生物安保风险。我们使用随机分析来估计集装箱模式数据的价值,与线条数据模式中的数据相比较,进行模拟研究,以核实某些物品不符合要求的可能性。