机器翻译实践者优先升级模型的帮助工具：Angler (Angler: Helping Machine Translation Practitioners Prioritize Model Improvements) - 专知论文

会员服务 ·

0

机器翻译 · ML · 工具 · 模型调试 · 视觉分析 ·

2023 年 4 月 12 日

Angler: Helping Machine Translation Practitioners Prioritize Model Improvements

翻译：机器翻译实践者优先升级模型的帮助工具：Angler

Samantha Robertson,Zijie J. Wang,Dominik Moritz,Mary Beth Kery,Fred Hohman

from arxiv, Accepted to CHI 2023. 20 pages, 6 figures

Machine learning (ML) models can fail in unexpected ways in the real world, but not all model failures are equal. With finite time and resources, ML practitioners are forced to prioritize their model debugging and improvement efforts. Through interviews with 13 ML practitioners at Apple, we found that practitioners construct small targeted test sets to estimate an error's nature, scope, and impact on users. We built on this insight in a case study with machine translation models, and developed Angler, an interactive visual analytics tool to help practitioners prioritize model improvements. In a user study with 7 machine translation experts, we used Angler to understand prioritization practices when the input space is infinite, and obtaining reliable signals of model quality is expensive. Our study revealed that participants could form more interesting and user-focused hypotheses for prioritization by analyzing quantitative summary statistics and qualitatively assessing data by reading sentences.

翻译：机器学习 (ML) 模型在现实世界中可能以意外的方式失败，但不是所有的模型失效都是相等的。由于时间和资源有限，ML 实践者被迫优先考虑其模型调试和改进的工作。通过对苹果公司 13 位 ML 实践者进行访谈，我们发现实践者构建小型目标测试集以估计错误的性质、范围和对用户的影响。我们在机器翻译模型的案例研究中实践了这一洞察，并开发了 Angler，一种交互式视觉分析工具，帮助实践者优先处理模型改进。在与 7 位机器翻译专家进行的用户研究中，我们使用 Angler 来了解当输入空间是无限的、可靠信号的模型质量昂贵的情况下，优先处理实践的做法。我们的研究揭示了，参与者可以通过分析数量摘要统计数据和通过阅读句子来 qualitatively 评估数据，形成更有趣和更为用户关注的优先级假设。

0

相关内容

机器翻译

机器翻译，又称为自动翻译，是利用计算机将一种自然语言(源语言)转换为另一种自然语言(目标语言)的过程。它是计算语言学的一个分支，是人工智能的终极目标之一，具有重要的科学研究价值。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【2022新书】机器学习中的统计建模:概念和应用，398页pdf

【2022新书】机器学习中的统计建模:概念和应用，398页pdf

专知会员服务

142+阅读 · 2022年11月5日

【2022新书】深度学习R语言实战，第二版，568页pdf

【2022新书】深度学习R语言实战，第二版，568页pdf

专知会员服务

86+阅读 · 2022年10月23日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

专知会员服务

310+阅读 · 2020年2月26日

【新书】用Python六步掌握机器学习，第二版，469页pdf，使用Python进行预测数据分析的实用实现指南Mastering Machine Learning with Python in Six Steps, 2nd Edition A Practical Implementation Guide to Predictive Data Analytics Using Python

【新书】用Python六步掌握机器学习，第二版，469页pdf，使用Python进行预测数据分析的实用实现指南Mastering Machine Learning with Python in Six Steps, 2nd Edition A Practical Implementation Guide to Predictive Data Analytics Using Python

专知会员服务

88+阅读 · 2020年2月2日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【2022新书】机器学习中的统计建模:概念和应用，398页pdf

【2022新书】机器学习中的统计建模:概念和应用，398页pdf

专知

45+阅读 · 2022年11月5日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

专知

22+阅读 · 2020年2月26日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

Underlay频谱共享方式下信号参数估计和调制识别的方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

MACC1调控葡萄糖代谢抑制胃癌细胞衰老的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

定性地理信息检索的模型与方法

国家自然科学基金

0+阅读 · 2012年12月31日

地理信息检索中语境的获取、推理及应用

国家自然科学基金

6+阅读 · 2012年12月31日

柔性衬底氢掺杂ZnO:Al(AZO)薄膜的类金属导电性质及其应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

可用于纳米孔DNA识别测序的小分子的设计合成及其与DNA碱基和核苷酸的相互作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用磁近邻效应进一步提高(Ga,Mn)As的居里温度

国家自然科学基金

0+阅读 · 2012年12月31日

启动子状态对RNA激活效力的影响以及前列腺癌中激活RNA筛选策略的优化

国家自然科学基金

0+阅读 · 2012年12月31日

IT效力前因及作用机理：能力与情绪视角下的跨层建模

国家自然科学基金

0+阅读 · 2012年12月31日

高灵敏液晶DNA生物传感技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

Using Data Analytics to Derive Business Intelligence: A Case Study

Arxiv

0+阅读 · 2023年5月30日

Predicting Survey Response with Quotation-based Modeling: A Case Study on Favorability towards the United States

Arxiv

0+阅读 · 2023年5月27日

Exact and Heuristic Algorithms for Energy-Efficient Scheduling

Arxiv

0+阅读 · 2023年5月27日

Improving Stability in Decision Tree Models

Arxiv

0+阅读 · 2023年5月26日

Data Owner Benefit-Driven Design of People Analytics

Arxiv

0+阅读 · 2023年5月26日

Calibration of Transformer-based Models for Identifying Stress and Depression in Social Media

Arxiv

0+阅读 · 2023年5月26日

AIBugHunter: A Practical Tool for Predicting, Classifying and Repairing Software Vulnerabilities

Arxiv

0+阅读 · 2023年5月26日

AI for Next Generation Computing: Emerging Trends and Future Directions

Arxiv

19+阅读 · 2022年3月5日

Informed Machine Learning -- A Taxonomy and Survey of Integrating Knowledge into Learning Systems

Arxiv

37+阅读 · 2021年5月28日

Explainable Recommendation: A Survey and New Perspectives

Explainable Recommendation: A Survey and New Perspectives

Arxiv

66+阅读 · 2019年8月15日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】机器学习中的统计建模:概念和应用，398页pdf

【2022新书】机器学习中的统计建模:概念和应用，398页pdf

专知会员服务

142+阅读 · 2022年11月5日

【2022新书】深度学习R语言实战，第二版，568页pdf

【2022新书】深度学习R语言实战，第二版，568页pdf

专知会员服务

86+阅读 · 2022年10月23日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

专知会员服务

310+阅读 · 2020年2月26日

【新书】用Python六步掌握机器学习，第二版，469页pdf，使用Python进行预测数据分析的实用实现指南Mastering Machine Learning with Python in Six Steps, 2nd Edition A Practical Implementation Guide to Predictive Data Analytics Using Python

【新书】用Python六步掌握机器学习，第二版，469页pdf，使用Python进行预测数据分析的实用实现指南Mastering Machine Learning with Python in Six Steps, 2nd Edition A Practical Implementation Guide to Predictive Data Analytics Using Python

专知会员服务

88+阅读 · 2020年2月2日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《毁灭算法：解析以色列在加沙的AI军事行动》

【COLT 2025最新教程】语言生成

以机器速度锁定目标：人工智能的能力与局限

【ICML2025】通过在线世界模型规划的持续强化学习

相关资讯

【2022新书】机器学习中的统计建模:概念和应用，398页pdf

【2022新书】机器学习中的统计建模:概念和应用，398页pdf

专知

45+阅读 · 2022年11月5日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

专知

22+阅读 · 2020年2月26日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Using Data Analytics to Derive Business Intelligence: A Case Study

Arxiv

0+阅读 · 2023年5月30日

Predicting Survey Response with Quotation-based Modeling: A Case Study on Favorability towards the United States

Arxiv

0+阅读 · 2023年5月27日

Exact and Heuristic Algorithms for Energy-Efficient Scheduling

Arxiv

0+阅读 · 2023年5月27日

Improving Stability in Decision Tree Models

Arxiv

0+阅读 · 2023年5月26日

Data Owner Benefit-Driven Design of People Analytics

Arxiv

0+阅读 · 2023年5月26日

Calibration of Transformer-based Models for Identifying Stress and Depression in Social Media

Arxiv

0+阅读 · 2023年5月26日

AIBugHunter: A Practical Tool for Predicting, Classifying and Repairing Software Vulnerabilities

Arxiv

0+阅读 · 2023年5月26日

AI for Next Generation Computing: Emerging Trends and Future Directions

Arxiv

19+阅读 · 2022年3月5日

Informed Machine Learning -- A Taxonomy and Survey of Integrating Knowledge into Learning Systems

Arxiv

37+阅读 · 2021年5月28日

Explainable Recommendation: A Survey and New Perspectives

Explainable Recommendation: A Survey and New Perspectives

Arxiv

66+阅读 · 2019年8月15日

相关基金

Underlay频谱共享方式下信号参数估计和调制识别的方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

MACC1调控葡萄糖代谢抑制胃癌细胞衰老的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

定性地理信息检索的模型与方法

国家自然科学基金

0+阅读 · 2012年12月31日

地理信息检索中语境的获取、推理及应用

国家自然科学基金

6+阅读 · 2012年12月31日

柔性衬底氢掺杂ZnO:Al(AZO)薄膜的类金属导电性质及其应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

可用于纳米孔DNA识别测序的小分子的设计合成及其与DNA碱基和核苷酸的相互作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用磁近邻效应进一步提高(Ga,Mn)As的居里温度

国家自然科学基金

0+阅读 · 2012年12月31日

启动子状态对RNA激活效力的影响以及前列腺癌中激活RNA筛选策略的优化

国家自然科学基金

0+阅读 · 2012年12月31日

IT效力前因及作用机理：能力与情绪视角下的跨层建模

国家自然科学基金

0+阅读 · 2012年12月31日

高灵敏液晶DNA生物传感技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员