音乐流会话数据集 (The Music Streaming Sessions Dataset) - 专知论文

会员服务 ·

0

流 · INTERACT · 数据集 · INFORMS · MoDELS ·

2020 年 10 月 14 日

The Music Streaming Sessions Dataset

翻译：音乐流会话数据集

Brian Brost,Rishabh Mehrotra,Tristan Jehan

from arxiv, Web conference 2019 version with updated link to dataset

At the core of many important machine learning problems faced by online streaming services is a need to model how users interact with the content they are served. Unfortunately, there are no public datasets currently available that enable researchers to explore this topic. In order to spur that research, we release the Music Streaming Sessions Dataset (MSSD), which consists of 160 million listening sessions and associated user actions. Furthermore, we provide audio features and metadata for the approximately 3.7 million unique tracks referred to in the logs. This is the largest collection of such track metadata currently available to the public. This dataset enables research on important problems including how to model user listening and interaction behaviour in streaming, as well as Music Information Retrieval (MIR), and session-based sequential recommendations. Additionally, a subset of sessions were collected using a uniformly random recommendation setting, enabling their use for counterfactual evaluation of such sequential recommendations. Finally, we provide an analysis of user behavior and suggest further research problems which can be addressed using the dataset.

翻译：在线流服务所面临的许多重要机器学习问题的核心是需要模拟用户如何与所服务的内容互动。不幸的是,目前没有可供研究人员探索这个主题的公共数据集。为了刺激这一研究,我们发布了音乐流会数据集(MSSD),其中包括1.6亿次监听会和相关用户行动。此外,我们为日志中提及的大约370万条独有轨道提供了音频特征和元数据。这是目前可供公众使用的最大一批此类跟踪元数据。该数据集使人们能够研究重要问题,包括如何模拟用户在流传中的监听和互动行为,以及音乐信息检索和会议顺序建议。此外,还利用统一的随机建议设置收集了一组会议,以便利用它们来反事实评估这些顺序建议。最后,我们分析了用户的行为,并建议了可以用数据集解决的进一步研究问题。

0

相关内容

最新《时序分类:深度序列模型》教程，172页ppt

最新《时序分类:深度序列模型》教程，172页ppt

专知会员服务

43+阅读 · 2020年11月11日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【中科大】上下文感知推荐系统的图卷积机：Graph Convolution Machine for Context-aware Recommender System

【中科大】上下文感知推荐系统的图卷积机：Graph Convolution Machine for Context-aware Recommender System

专知会员服务

71+阅读 · 2020年2月5日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【北京智源大会2019】增强人类智能：从搜索引擎到智能任务助理（ Augmenting Human Intelligence: From Search Engines to Intelligent Task Assistants ）

【北京智源大会2019】增强人类智能：从搜索引擎到智能任务助理（ Augmenting Human Intelligence: From Search Engines to Intelligent Task Assistants ）

专知会员服务

20+阅读 · 2019年11月22日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

已删除

AI掘金志

7+阅读 · 2019年7月8日

Retracing the Flow of the Stream: Investigating Kodi Streaming Services

Arxiv

0+阅读 · 2020年12月2日

MTM Dataset for Joint Representation Learning among Sheet Music, Lyrics, and Musical Audio?

Arxiv

0+阅读 · 2020年12月1日

Streaming end-to-end multi-talker speech recognition

Arxiv

0+阅读 · 2020年11月26日

Learning to Personalize for Web Search Sessions

Arxiv

7+阅读 · 2020年9月17日

Music Transformer

Music Transformer

Arxiv

5+阅读 · 2018年12月12日

Learning Recommender Systems from Multi-Behavior Data

Learning Recommender Systems from Multi-Behavior Data

Arxiv

8+阅读 · 2018年9月21日

Current Challenges and Visions in Music Recommender Systems Research

Arxiv

7+阅读 · 2018年3月21日

Netizen-Style Commenting on Fashion Photos: Dataset and Diversity Measures

Arxiv

4+阅读 · 2018年1月31日

Learning Tree-based Deep Model for Recommender Systems

Arxiv

7+阅读 · 2018年1月8日

Content based video retrieval

Arxiv

3+阅读 · 2012年11月20日

VIP会员

文章信息

相关主题

相关VIP内容

最新《时序分类:深度序列模型》教程，172页ppt

最新《时序分类:深度序列模型》教程，172页ppt

专知会员服务

43+阅读 · 2020年11月11日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【中科大】上下文感知推荐系统的图卷积机：Graph Convolution Machine for Context-aware Recommender System

【中科大】上下文感知推荐系统的图卷积机：Graph Convolution Machine for Context-aware Recommender System

专知会员服务

71+阅读 · 2020年2月5日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【北京智源大会2019】增强人类智能：从搜索引擎到智能任务助理（ Augmenting Human Intelligence: From Search Engines to Intelligent Task Assistants ）

【北京智源大会2019】增强人类智能：从搜索引擎到智能任务助理（ Augmenting Human Intelligence: From Search Engines to Intelligent Task Assistants ）

专知会员服务

20+阅读 · 2019年11月22日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

已删除

AI掘金志

7+阅读 · 2019年7月8日

相关论文

Retracing the Flow of the Stream: Investigating Kodi Streaming Services

Arxiv

0+阅读 · 2020年12月2日

MTM Dataset for Joint Representation Learning among Sheet Music, Lyrics, and Musical Audio?

Arxiv

0+阅读 · 2020年12月1日

Streaming end-to-end multi-talker speech recognition

Arxiv

0+阅读 · 2020年11月26日

Learning to Personalize for Web Search Sessions

Arxiv

7+阅读 · 2020年9月17日

Music Transformer

Music Transformer

Arxiv

5+阅读 · 2018年12月12日

Learning Recommender Systems from Multi-Behavior Data

Learning Recommender Systems from Multi-Behavior Data

Arxiv

8+阅读 · 2018年9月21日

Current Challenges and Visions in Music Recommender Systems Research

Arxiv

7+阅读 · 2018年3月21日

Netizen-Style Commenting on Fashion Photos: Dataset and Diversity Measures

Arxiv

4+阅读 · 2018年1月31日

Learning Tree-based Deep Model for Recommender Systems

Arxiv

7+阅读 · 2018年1月8日

Content based video retrieval

Arxiv

3+阅读 · 2012年11月20日

微信扫码咨询专知VIP会员