The predicted increase in demand for data-intensive solution development is driving the need for software, data, and domain experts to effectively collaborate in multi-disciplinary data-intensive software teams (MDSTs). We conducted a socio-technical grounded theory study through interviews with 24 practitioners in MDSTs to better understand the challenges these teams face when delivering data-intensive software solutions. The interviews provided perspectives across different types of roles including domain, data and software experts, and covered different organisational levels from team members, team managers to executive leaders. We found that the key concern for these teams is dealing with data-related challenges. In this paper, we present the theory of dealing with data challenges to explain the challenges faced by MDSTs including gaining access to data, aligning data, understanding data, and resolving data quality issues; the context in and condition under which these challenges occur, the causes that lead to the challenges, and the related consequences such as having to conduct remediation activities, inability to achieve expected outcomes and lack of trust in the delivered solutions. We also identified contingencies or strategies applied to address the challenges including high-level strategic approaches such as implementing data governance, implementing new tools and techniques such as data quality visualisation and monitoring tools, as well as building stronger teams by focusing on people dynamics, communication skill development and cross-skilling. Our findings have direct implications for practitioners and researchers to better understand the landscape of data challenges and how to deal with them.
翻译:预计对数据密集型解决方案开发的需求将增加,这促使需要软件、数据和领域专家在多学科数据密集型软件团队(MDST)中进行有效合作。我们通过采访MDST的24名从业者,进行了社会-技术理论研究,以更好地了解这些团队在提供数据密集型软件解决方案时面临的挑战。访谈提供了不同类型角色的视角,包括域、数据和软件专家,并涵盖小组成员、团队管理人员和行政首长的不同组织层面。我们发现,这些团队的主要关切是应对数据相关挑战。在本文件中,我们提出了应对数据挑战的理论,以解释MDST面临的挑战,包括获取数据、统一数据、理解数据和解决数据质量问题;这些挑战的背景和条件、导致挑战的原因,以及相关后果,如必须开展补救活动、无法实现预期结果和对交付解决方案缺乏信任。我们还确定了应对挑战的应急或战略,包括实施数据治理、实施新工具和技术,如数据质量整合、数据质量和数据质量问题解决数据质量问题以及数据质量问题解决;通过建立更强有力的数据质量和数据动态分析工具,通过建立更强有力地分析人和掌握数据技能,从而更好地了解数据动态分析工具,从而更准确地了解数据质量和掌握动态工具。