We propose a data format for Monte Carlo (MC) events, or any structural data, including experimental data, in a compact binary form using variable-size integer encoding as implemented in the Google's Protocol Buffers package. This approach is implemented in the so-called ProMC library which produces smaller file sizes for MC records compared to the existing input-output libraries used in high-energy physics (HEP). Other important features are a separation of abstract data layouts from concrete programming implementations, self-description and random access. Data stored in ProMC files can be written, read and manipulated in a number of programming languages, such C++, Java and Python.
翻译:我们提出蒙特卡洛(Monte Carlo)事件的数据格式,或任何结构数据,包括实验数据,采用谷歌协议缓冲软件包中执行的可变大小整数编码,采用压缩的二进制形式,在所谓的ProMC图书馆中采用这一方法,与高能物理(HEP)中的现有输入-产出图书馆相比,该图书馆为MC记录制作的文件体积较小。其他重要特征是将抽象数据布局与具体的编程实施、自我描述和随机访问分开。 ProMC文件中储存的数据可以用一些编程语言(如C++、Java和Python)书写、阅读和操作。