GitEvoluve:预测GitHub仓库的演变 (GitEvolve: Predicting the Evolution of GitHub Repositories) - 专知论文

会员服务 ·

0

INTERACT · Better · MoDELS · GitHub · Networking ·

2020 年 10 月 9 日

GitEvolve: Predicting the Evolution of GitHub Repositories

翻译：GitEvoluve:预测GitHub仓库的演变

Honglu Zhou,Hareesh Ravi,Carlos M. Muniz,Vahid Azizi,Linda Ness,Gerard de Melo,Mubbasir Kapadia

Software development is becoming increasingly open and collaborative with the advent of platforms such as GitHub. Given its crucial role, there is a need to better understand and model the dynamics of GitHub as a social platform. Previous work has mostly considered the dynamics of traditional social networking sites like Twitter and Facebook. We propose GitEvolve, a system to predict the evolution of GitHub repositories and the different ways by which users interact with them. To this end, we develop an end-to-end multi-task sequential deep neural network that given some seed events, simultaneously predicts which user-group is next going to interact with a given repository, what the type of the interaction is, and when it happens. To facilitate learning, we use graph based representation learning to encode relationship between repositories. We map users to groups by modelling common interests to better predict popularity and to generalize to unseen users during inference. We introduce an artificial event type to better model varying levels of activity of repositories in the dataset. The proposed multi-task architecture is generic and can be extended to model information diffusion in other social networks. In a series of experiments, we demonstrate the effectiveness of the proposed model, using multiple metrics and baselines. Qualitative analysis of the model's ability to predict popularity and forecast trends proves its applicability.

翻译：GitHub 等平台的出现使软件开发变得日益开放和协作。鉴于其关键作用, 有必要更好地了解和模拟GitHub作为社交平台的动态。以前的工作主要考虑了Twitter和Facebook等传统社交网络网站的动态。我们提议GitEvolve, 这个系统可以预测GitHub 库的演变以及用户与它们互动的不同方式。为此, 我们开发了一个终端到终端的多任务相继的深层神经网络, 以某些种子事件为特点, 同时预测哪个用户群体将下一个与某个特定存储库互动, 互动的类型是什么, 以及何时发生这种互动。为了便利学习, 我们使用基于图表的演示学习来编码存储库之间的关系。我们绘制用户的地图, 通过模拟共同利益, 更好地预测受欢迎程度, 并在推断过程中向看不见的用户推广。我们引入了一种人工事件类型, 以更好地模拟数据集中储存库的不同活动水平。拟议的多任务结构是通用的, 可以扩展到其他社交网络的信息传播模式。在一系列实验中, 我们展示了以图表为基础的模型和预测能力。

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

多媒体顶会ACM Multimedia2020各大奖项出炉！南开获最佳论文，西安交大获最佳学生论文

多媒体顶会ACM Multimedia2020各大奖项出炉！南开获最佳论文，西安交大获最佳学生论文

专知会员服务

26+阅读 · 2020年10月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

Time2Vec：学习时间的向量表示，Time2Vec: Learning a Vector Representation of Time

Time2Vec：学习时间的向量表示，Time2Vec: Learning a Vector Representation of Time

专知会员服务

36+阅读 · 2020年5月10日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

Diganta Misra等人提出新激活函数Mish，在一些任务上超越RuLU

Diganta Misra等人提出新激活函数Mish，在一些任务上超越RuLU

专知会员服务

15+阅读 · 2019年10月15日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

2019热门开源机器学习项目汇总

2019热门开源机器学习项目汇总

专知

9+阅读 · 2020年1月3日

CCF推荐 | 国际会议信息10条

CCF推荐 | 国际会议信息10条

Call4Papers

8+阅读 · 2019年5月27日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

【数据集】新的YELP数据集官方下载

【数据集】新的YELP数据集官方下载

机器学习研究会

16+阅读 · 2017年8月31日

Individual-level Anxiety Detection and Prediction from Longitudinal YouTube and Google Search Engagement Logs

Arxiv

0+阅读 · 2020年11月30日

A Comprehensive Review on Recent Methods and Challenges of Video Description

Arxiv

1+阅读 · 2020年11月30日

Recent Trends in Wearable Computing Research: A Systematic Review

Arxiv

0+阅读 · 2020年11月27日

A Sheaf and Topology Approach to Generating Local Branch Numbers in Digital Images

Arxiv

0+阅读 · 2020年11月27日

Probing Model Signal-Awareness via Prediction-Preserving Input Minimization

Arxiv

0+阅读 · 2020年11月25日

A Comprehensive Comparison of Unsupervised Network Representation Learning Methods

Arxiv

5+阅读 · 2019年3月19日

A Comprehensive Survey on Graph Neural Networks

A Comprehensive Survey on Graph Neural Networks

Arxiv

21+阅读 · 2019年1月3日

Constructing Narrative Event Evolutionary Graph for Script Event Prediction

Arxiv

11+阅读 · 2018年5月16日

Predicting Cyber Events by Leveraging Hacker Sentiment

Arxiv

3+阅读 · 2018年4月14日

DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications

Arxiv

4+阅读 · 2017年11月15日

VIP会员

文章信息

相关主题

相关VIP内容

多媒体顶会ACM Multimedia2020各大奖项出炉！南开获最佳论文，西安交大获最佳学生论文

多媒体顶会ACM Multimedia2020各大奖项出炉！南开获最佳论文，西安交大获最佳学生论文

专知会员服务

26+阅读 · 2020年10月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

Time2Vec：学习时间的向量表示，Time2Vec: Learning a Vector Representation of Time

Time2Vec：学习时间的向量表示，Time2Vec: Learning a Vector Representation of Time

专知会员服务

36+阅读 · 2020年5月10日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

Diganta Misra等人提出新激活函数Mish，在一些任务上超越RuLU

Diganta Misra等人提出新激活函数Mish，在一些任务上超越RuLU

专知会员服务

15+阅读 · 2019年10月15日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《毁灭算法：解析以色列在加沙的AI军事行动》

【COLT 2025最新教程】语言生成

以机器速度锁定目标：人工智能的能力与局限

【ICML2025】通过在线世界模型规划的持续强化学习

相关资讯

2019热门开源机器学习项目汇总

2019热门开源机器学习项目汇总

专知

9+阅读 · 2020年1月3日

CCF推荐 | 国际会议信息10条

CCF推荐 | 国际会议信息10条

Call4Papers

8+阅读 · 2019年5月27日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

【数据集】新的YELP数据集官方下载

【数据集】新的YELP数据集官方下载

机器学习研究会

16+阅读 · 2017年8月31日

相关论文

Individual-level Anxiety Detection and Prediction from Longitudinal YouTube and Google Search Engagement Logs

Arxiv

0+阅读 · 2020年11月30日

A Comprehensive Review on Recent Methods and Challenges of Video Description

Arxiv

1+阅读 · 2020年11月30日

Recent Trends in Wearable Computing Research: A Systematic Review

Arxiv

0+阅读 · 2020年11月27日

A Sheaf and Topology Approach to Generating Local Branch Numbers in Digital Images

Arxiv

0+阅读 · 2020年11月27日

Probing Model Signal-Awareness via Prediction-Preserving Input Minimization

Arxiv

0+阅读 · 2020年11月25日

A Comprehensive Comparison of Unsupervised Network Representation Learning Methods

Arxiv

5+阅读 · 2019年3月19日

A Comprehensive Survey on Graph Neural Networks

A Comprehensive Survey on Graph Neural Networks

Arxiv

21+阅读 · 2019年1月3日

Constructing Narrative Event Evolutionary Graph for Script Event Prediction

Arxiv

11+阅读 · 2018年5月16日

Predicting Cyber Events by Leveraging Hacker Sentiment

Arxiv

3+阅读 · 2018年4月14日

DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications

Arxiv

4+阅读 · 2017年11月15日

微信扫码咨询专知VIP会员