We describe an instantiation of a new concept for multimodal multisensor data collection of real life in-the-wild free standing social interactions in the form of a Conference Living Lab (ConfLab). ConfLab contains high fidelity data of 49 people during a real-life professional networking event capturing a diverse mix of status, acquaintanceship, and networking motivations at an international conference. Recording such a dataset is challenging due to the delicate trade-off between participant privacy and fidelity of the data, and the technical and logistic challenges involved. We improve upon prior datasets in the fidelity of most of our modalities: 8-camera overhead setup, personal wearable sensors recording body motion (9-axis IMU), Bluetooth-based proximity, and low-frequency audio. Additionally, we use a state-of-the-art hardware synchronization solution and time-efficient continuous technique for annotating body keypoints and actions at high frequencies. We argue that our improvements are essential for a deeper study of interaction dynamics at finer time scales. Our research tasks showcase some of the open challenges related to in-the-wild privacy-preserving social data analysis: keypoints detection from overhead camera views, skeleton based no-audio speaker detection, and F-formation detection. With the ConfLab dataset, we aim to bridge the gap between traditional computer vision tasks and in-the-wild ecologically valid socially-motivated tasks.
翻译:我们描述了以 " 会议生活实验室 " (ConfLab)为形式,对现实生活中的真实生活进行多式联运、多传感器或数据收集的新概念的即时化。 " 康特拉 " 包含在一次真实的、专业的网络活动期间49人的高度忠诚数据,在一次国际会议上捕捉各种身份、熟识和网络动机的多种组合;记录这样一个数据集具有挑战性,因为参与者隐私和数据忠诚以及所涉及的技术和后勤挑战之间的微妙取舍,因此具有挑战性。我们改进了我们大多数方式的忠实性先前数据集:8个摄像头安装、个人可磨损传感器记录身体运动(9-xasis IMU)、蓝牙近距离和低频率音频。此外,我们使用了一种最先进的硬件同步解决方案和具有时间效率的连续技术,用于说明机构关键点和高频率的行动。我们主张,改进我们的改进对于更深入地研究更精确的时间尺度的互动动态至关重要。我们的研究任务展示了与以下两个方面的实际差距:在视频内部检测机头的检测、基于直观的直观的直径的直径定位、基于直径定位的直径定位的直径定位的直径定位的直径定位分析、断的直径直径定位的图像定位分析社会目标的数据分析。