In the big data era, the relationship between entries becomes more and more complex. Many graph (or network) algorithms have already paid attention to dynamic networks, which are more suitable than static ones for fitting the complex real-world scenarios with evolving structures and features. To contribute to the dynamic network representation learning and mining research, we provide a new bunch of label-adequate, dynamics-meaningful, and attribute-sufficient dynamic networks from the health domain. To be specific, in our proposed repository DPPIN, we totally have 12 individual dynamic network datasets at different scales, and each dataset is a dynamic protein-protein interaction network describing protein-level interactions of yeast cells. We hope these domain-specific node features, structure evolution patterns, and node and graph labels could inspire the regularization techniques to increase the performance of graph machine learning algorithms in a more complex setting. Also, we link potential applications with our DPPIN by designing various dynamic graph experiments, where DPPIN could indicate future research opportunities for some tasks by presenting challenges on state-of-the-art baseline algorithms. Finally, we identify future directions to improve the utility of this repository and welcome constructive inputs from the community. All resources (e.g., data and code) of this work are deployed and publicly available at https://github.com/DongqiFu/DPPIN.
翻译:在大数据时代,条目之间的关系越来越复杂。许多图表(或网络)算法已经注意到动态网络,这些网络比静态网络更适合以不断变化的结构和特点来适应复杂的现实世界情景。为了促进动态网络代表性学习和采矿研究,我们从卫生领域提供了一套新的标签充分、动态感知和属性无量的动态网络。具体地说,在我们提议的数据库DPPIN中,我们完全拥有12个不同尺度的个体动态网络数据集,每个数据集都是一个动态蛋白质-蛋白质互动网络,描述酵母细胞蛋白质层面的相互作用。我们希望这些特定领域的节点特征、结构演变模式以及节点和图表标签能够激励正规化技术,以提高图表机器学习算法在更复杂环境下的性能。此外,我们通过设计各种动态图形实验,将潜在应用程序与我们的DPPIN联系起来,DPIN可以通过对状态/艺术基线算法提出挑战来显示某些任务的未来研究机会。最后,我们确定未来方向,以改善该数据库的实用性、结构/数字代码(httpsalgiu/qi 部署的所有投入)。