训练和评价一名Jupyter笔记本数据科学助理 (Training and Evaluating a Jupyter Notebook Data Science Assistant) - 专知论文

会员服务 ·

0

Jupyter · 可理解性 · Markdown · 统计量 · 变换 ·

2022 年 1 月 30 日

Training and Evaluating a Jupyter Notebook Data Science Assistant

翻译：训练和评价一名Jupyter笔记本数据科学助理

Shubham Chandel,Colin B. Clement,Guillermo Serrato,Neel Sundaresan

We study the feasibility of a Data Science assistant powered by a sequence-to-sequence transformer by training a new model JuPyT5 on all publicly available Jupyter Notebook GitHub repositories and developing a new metric: Data Science Problems (DSP). DSP is a collection of 1119 problems curated from 306 pedagogical notebooks with 92 dataset dependencies, natural language and Markdown problem descriptions, and assert-based unit tests. These notebooks were designed to test university students' mastery of various Python implementations of Math and Data Science, and we now leverage them to study the ability of JuPyT5 to understand and pass the tests. We analyze the content of DSP, validate its quality, and we find that given 100 sampling attempts JuPyT5 is able to solve 77.5\% of the DSP problems. We further present various ablation and statistical analyses and compare DSP to other recent natural language to code benchmarks.

翻译：我们研究数据科学助理的可行性,该助理以序列到序列变压器为动力,对所有公开的Jupyter Notesbook GitHub 库进行新的模型JuPyT5 培训,并开发新的指标:数据科学问题(DSP)。DSP是306个教学笔记本汇编的1119个问题,其中有92个数据集依赖性、自然语言和马克唐问题说明,以及基于维护的单位测试。这些笔记本旨在测试大学生掌握各种数学和数据科学的Python应用能力,我们现在利用这些笔记本研究JuPyT5 了解和通过测试的能力。我们分析了DSP的内容,验证其质量,我们发现有100个抽样尝试JuPyT5 能够解决DSP问题中的77.5 ⁇ 。我们还提出了各种通货膨胀和统计分析,并将DSP与其他最近自然语言的代码基准进行比较。

0

相关内容

Jupyter

Jupyter Notebook是以网页的形式打开，可以在网页页面中直接编写代码和运行代码，代码的运行结果也会直接在代码块下显示的程序。如在编程过程中需要编写说明文档，可在同一个页面中直接编写，便于作及时的说明和解释。

【经典书】用Python学数据科学(Data Science from Scratch)，464页pdf

【经典书】用Python学数据科学(Data Science from Scratch)，464页pdf

专知会员服务

43+阅读 · 2021年2月13日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【Strata Data Conference】用于自然语言处理的深度学习方法

【Strata Data Conference】用于自然语言处理的深度学习方法

专知会员服务

49+阅读 · 2019年9月23日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

基于血脑PK-PD和结构方程模型的银杏叶提取物多组分协同抗脑缺血作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

栅极调制的气体传感器量子输运特性的理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

压缩感知与稀疏信号恢复

国家自然科学基金

2+阅读 · 2014年12月31日

连续变量量子误差修正的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

脊髓损伤微环境对BMSCs内质网应激反应的影响及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

RHIC能量扫描中逐事件净质子数分布的高阶矩实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

隔壁塔与反应精馏耦合过程的设计与优化研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于集对分析的交通信号控制评价及优化方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于常规逻辑设计理论与技术移植的较大规模可逆逻辑电路设计方法

国家自然科学基金

0+阅读 · 2012年12月31日

具重复数据删除大规模存储系统可靠性技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

Model Reduction via Dynamic Mode Decomposition

Model Reduction via Dynamic Mode Decomposition

Arxiv

0+阅读 · 2022年4月20日

Contrastive Demonstration Tuning for Pre-trained Language Models

Arxiv

0+阅读 · 2022年4月18日

METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals

Arxiv

0+阅读 · 2022年4月16日

Evaluation Benchmarks for Spanish Sentence Representations

Arxiv

0+阅读 · 2022年4月15日

Evaluating few shot and Contrastive learning Methods for Code Clone Detection

Evaluating few shot and Contrastive learning Methods for Code Clone Detection

Arxiv

0+阅读 · 2022年4月15日

HIVE: Evaluating the Human Interpretability of Visual Explanations

Arxiv

0+阅读 · 2022年4月15日

So2Sat POP -- A Curated Benchmark Data Set for Population Estimation from Space on a Continental Scale

Arxiv

0+阅读 · 2022年4月7日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

Exploring Models and Data for Remote Sensing Image Caption Generation

Arxiv

14+阅读 · 2017年12月21日

VIP会员

文章信息

相关主题

相关VIP内容

【经典书】用Python学数据科学(Data Science from Scratch)，464页pdf

【经典书】用Python学数据科学(Data Science from Scratch)，464页pdf

专知会员服务

43+阅读 · 2021年2月13日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【Strata Data Conference】用于自然语言处理的深度学习方法

【Strata Data Conference】用于自然语言处理的深度学习方法

专知会员服务

49+阅读 · 2019年9月23日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人水面艇的实战应用》最新42页报告

《评估人工智能在判定自卫行动之必要性与相称性中的作用》报告

人工智能代理提升战时舰船战备水平

《利用虚拟现实与增强现实技术加强海港海岸线监测》报告

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

相关论文

Model Reduction via Dynamic Mode Decomposition

Model Reduction via Dynamic Mode Decomposition

Arxiv

0+阅读 · 2022年4月20日

Contrastive Demonstration Tuning for Pre-trained Language Models

Arxiv

0+阅读 · 2022年4月18日

METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals

Arxiv

0+阅读 · 2022年4月16日

Evaluation Benchmarks for Spanish Sentence Representations

Arxiv

0+阅读 · 2022年4月15日

Evaluating few shot and Contrastive learning Methods for Code Clone Detection

Evaluating few shot and Contrastive learning Methods for Code Clone Detection

Arxiv

0+阅读 · 2022年4月15日

HIVE: Evaluating the Human Interpretability of Visual Explanations

Arxiv

0+阅读 · 2022年4月15日

So2Sat POP -- A Curated Benchmark Data Set for Population Estimation from Space on a Continental Scale

Arxiv

0+阅读 · 2022年4月7日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

Exploring Models and Data for Remote Sensing Image Caption Generation

Arxiv

14+阅读 · 2017年12月21日

相关基金

基于血脑PK-PD和结构方程模型的银杏叶提取物多组分协同抗脑缺血作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

栅极调制的气体传感器量子输运特性的理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

压缩感知与稀疏信号恢复

国家自然科学基金

2+阅读 · 2014年12月31日

连续变量量子误差修正的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

脊髓损伤微环境对BMSCs内质网应激反应的影响及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

RHIC能量扫描中逐事件净质子数分布的高阶矩实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

隔壁塔与反应精馏耦合过程的设计与优化研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于集对分析的交通信号控制评价及优化方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于常规逻辑设计理论与技术移植的较大规模可逆逻辑电路设计方法

国家自然科学基金

0+阅读 · 2012年12月31日

具重复数据删除大规模存储系统可靠性技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员