ED助理:支持在计算笔记本中进行关于现场代码搜索和建议的探索性数据分析 (EDAssistant: Supporting Exploratory Data Analysis in Computational Notebooks with In-Situ Code Search and Recommendation) - 专知论文

会员服务 ·

0

EDA · Extensibility · Jupyter · 数据分析 · INTERACT ·

2021 年 12 月 15 日

EDAssistant: Supporting Exploratory Data Analysis in Computational Notebooks with In-Situ Code Search and Recommendation

翻译：ED助理:支持在计算笔记本中进行关于现场代码搜索和建议的探索性数据分析

Xingjun Li,Yizhi Zhang,Justin Leung,Chengnian Sun,Jian Zhao

Using computational notebooks (e.g., Jupyter Notebook), data scientists rationalize their exploratory data analysis (EDA) based on their prior experience and external knowledge such as online examples. For novices or data scientists who lack specific knowledge about the dataset or problem to investigate, effectively obtaining and understanding the external information is critical to carry out EDA. This paper presents EDAssistant, a JupyterLab extension that supports EDA with in-situ search of example notebooks and recommendation of useful APIs, powered by novel interactive visualization of search results. The code search and recommendation are enabled by state-of-the-art machine learning models, trained on a large corpus of EDA notebooks collected online. A user study is conducted to investigate both EDAssistant and data scientists' current practice (i.e., using external search engines). The results demonstrate the effectiveness and usefulness of EDAssistant, and participants appreciated its smooth and in-context support of EDA. We also report several design implications regarding code recommendation tools.

翻译：利用计算笔记本(如Jupyter Notesbook),数据科学家根据他们以往的经验和网上实例等外部知识,使其探索性数据分析合理化(EDA),对于缺乏关于数据集或问题的具体知识以调查、有效获取和理解外部信息的新学者或数据科学家来说,这是实施EDA的关键。本文介绍了EDA助理公司,这是一个JupyterLab扩展公司,支持EDA的现场搜索示例笔记本和有用的API的建议,其动力是新颖的交互可视化搜索结果。代码搜索和建议是由最新的机器学习模型促成的,该模型在网上收集了大量的EDA笔记本上接受培训。用户研究是为了调查ED A助理公司和数据科学家的现行做法(即使用外部搜索引擎),结果显示EDA助理公司的有效性和效用,与会者赞赏EDA助理公司的平稳和文字支持。我们还报告了关于代码建议工具的若干设计影响。

0

相关内容

EDA

电子设计自动化（英语：Electronic design automation，缩写：EDA）是指利用计算机辅助设计（CAD）软件，来完成超大规模集成电路（VLSI）芯片的功能设计、综合、验证、物理设计（包括布局、布线、版图、设计规则检查等）等流程的设计方式。

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【干货书】Python高级数据科学分析，424页pdf

【干货书】Python高级数据科学分析，424页pdf

专知会员服务

117+阅读 · 2020年8月7日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Yoshua Bengio，使算法知道“为什么”

Yoshua Bengio，使算法知道“为什么”

专知会员服务

8+阅读 · 2019年10月10日

【报告推荐】线上食品推荐中的数据分析（Computational Data Analytics on the Web for Better Food Decision Making）

【报告推荐】线上食品推荐中的数据分析（Computational Data Analytics on the Web for Better Food Decision Making）

专知会员服务

16+阅读 · 2019年10月2日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

计算机类 | PLDI 2020等国际会议信息6条

计算机类 | PLDI 2020等国际会议信息6条

Call4Papers

3+阅读 · 2019年7月8日

学术会议 | 知识图谱顶会 ISWC 征稿：Poster/Demo

学术会议 | 知识图谱顶会 ISWC 征稿：Poster/Demo

开放知识图谱

5+阅读 · 2019年4月16日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

LibRec 精选：连通知识图谱与推荐系统

LibRec 精选：连通知识图谱与推荐系统

LibRec智能推荐

3+阅读 · 2018年8月9日

LibRec 每周精选：近期推荐系统论文及进展

LibRec 每周精选：近期推荐系统论文及进展

LibRec智能推荐

30+阅读 · 2018年2月5日

已删除

将门创投

3+阅读 · 2017年10月27日

Machine learning models and facial regions videos for estimating heart rate: a review on Patents, Datasets and Literature

Arxiv

0+阅读 · 2022年2月17日

Improving Rating and Relevance with Point-of-Interest Recommender System

Improving Rating and Relevance with Point-of-Interest Recommender System

Arxiv

0+阅读 · 2022年2月17日

Conjugate priors and bias reduction for logistic regression models

Arxiv

0+阅读 · 2022年2月17日

The Development and Prospect of Code Clone

Arxiv

0+阅读 · 2022年2月17日

Heterogeneous Graph Learning for Explainable Recommendation over Academic Networks

Arxiv

0+阅读 · 2022年2月16日

Learning to Personalize for Web Search Sessions

Arxiv

7+阅读 · 2020年9月17日

A Context-Aware Citation Recommendation Model with BERT and Graph Convolutional Networks

A Context-Aware Citation Recommendation Model with BERT and Graph Convolutional Networks

Arxiv

9+阅读 · 2019年3月15日

Modeling Cognitive Processes in Social Tagging to Improve Tag Recommendations

Arxiv

3+阅读 · 2018年5月30日

Sequence-Aware Recommender Systems

Arxiv

8+阅读 · 2018年2月23日

CryptoRec: Secure Recommendations as a Service

Arxiv

6+阅读 · 2018年2月7日

VIP会员

文章信息

相关主题

相关VIP内容

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【干货书】Python高级数据科学分析，424页pdf

【干货书】Python高级数据科学分析，424页pdf

专知会员服务

117+阅读 · 2020年8月7日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Yoshua Bengio，使算法知道“为什么”

Yoshua Bengio，使算法知道“为什么”

专知会员服务

8+阅读 · 2019年10月10日

【报告推荐】线上食品推荐中的数据分析（Computational Data Analytics on the Web for Better Food Decision Making）

【报告推荐】线上食品推荐中的数据分析（Computational Data Analytics on the Web for Better Food Decision Making）

专知会员服务

16+阅读 · 2019年10月2日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

计算机类 | PLDI 2020等国际会议信息6条

计算机类 | PLDI 2020等国际会议信息6条

Call4Papers

3+阅读 · 2019年7月8日

学术会议 | 知识图谱顶会 ISWC 征稿：Poster/Demo

学术会议 | 知识图谱顶会 ISWC 征稿：Poster/Demo

开放知识图谱

5+阅读 · 2019年4月16日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

LibRec 精选：连通知识图谱与推荐系统

LibRec 精选：连通知识图谱与推荐系统

LibRec智能推荐

3+阅读 · 2018年8月9日

LibRec 每周精选：近期推荐系统论文及进展

LibRec 每周精选：近期推荐系统论文及进展

LibRec智能推荐

30+阅读 · 2018年2月5日

已删除

将门创投

3+阅读 · 2017年10月27日

相关论文

Machine learning models and facial regions videos for estimating heart rate: a review on Patents, Datasets and Literature

Arxiv

0+阅读 · 2022年2月17日

Improving Rating and Relevance with Point-of-Interest Recommender System

Improving Rating and Relevance with Point-of-Interest Recommender System

Arxiv

0+阅读 · 2022年2月17日

Conjugate priors and bias reduction for logistic regression models

Arxiv

0+阅读 · 2022年2月17日

The Development and Prospect of Code Clone

Arxiv

0+阅读 · 2022年2月17日

Heterogeneous Graph Learning for Explainable Recommendation over Academic Networks

Arxiv

0+阅读 · 2022年2月16日

Learning to Personalize for Web Search Sessions

Arxiv

7+阅读 · 2020年9月17日

A Context-Aware Citation Recommendation Model with BERT and Graph Convolutional Networks

A Context-Aware Citation Recommendation Model with BERT and Graph Convolutional Networks

Arxiv

9+阅读 · 2019年3月15日

Modeling Cognitive Processes in Social Tagging to Improve Tag Recommendations

Arxiv

3+阅读 · 2018年5月30日

Sequence-Aware Recommender Systems

Arxiv

8+阅读 · 2018年2月23日

CryptoRec: Secure Recommendations as a Service

Arxiv

6+阅读 · 2018年2月7日

微信扫码咨询专知VIP会员