读取计数数据模型的估测 (Parameter Shrinkage Estimation of Reading Count Data Models) - 专知论文

会员服务 ·

0

估计/估计量 · MoDELS · 似然 · Extensibility · 对数似然 ·

2021 年 9 月 28 日

Parameter Shrinkage Estimation of Reading Count Data Models

翻译：读取计数数据模型的估测

Minh Thu Bui,Cornelis J. Potgieter,Akihito Kamata

from arxiv, 20 pages, 6 figures, 3 tables

The paper investigates the efficacy of parameter shrinkage on count data models through the use of penalized likelihood methods. The goal is to fit models to count data where multiple independent count variables are observed with only a moderate sample size per variable. The possibility of zero-inflated counts is also plausible for the data. In the context considered here, elementary school-aged kids were given passages of different lengths to read. We aim to find a suitable model that accurately captures their oral reading fluency (ORF) as measured by number of words read incorrectly (WRI) scores. The dataset contains information about the length of the passages (number of words) and WRI scores obtained from recorded reading sessions. The idea is to find passage-level parameter estimates with good MSE properties. Improvement over maximum likelihood MSE is considered by applying appending penalty functions to the negative log-likelihood. Three statistical models are considered for WRI scores, namely the binomial, zero-inflated binomial, and beta-binomial. The paper explores two types of penalty functions resulting in estimators that are either closer to $0$ or closer to the equivalent parameters corresponding to other passages. The efficacy of the shrinkage methods are explored in an extensive simulation study.

翻译：本文通过使用惩罚性可能性方法调查计算数据模型参数缩缩的功效。目标是将观察多独立计数变量的数据模型与每个变量的中度样本大小相匹配。对数据来说, 零膨胀计数的可能性也是有道理的。在此处审议的背景下, 小学适龄儿童可以读取不同长度的段落。我们的目标是找到一个合适的模型, 准确捕捉他们的口读流( ORF), 以错误读取( WRI) 分数的字数来衡量。数据集包含关于从记录读取的段落长度( 字数) 和 WRI 分数的信息。其想法是找到具有良好MSE 属性的跨行级别参数估计值。考虑通过对负日志相似性应用附加惩罚功能来提高最大的可能性。我们考虑三个统计模型, 即二进制、零进缩二进制和 bebinomial 分数。本文探索了两种类型的惩罚函数, 导致从记录阅读会话中获得的长度( 字数) 和 WRI 评分数。。在模拟研究中, 对等同式的参数进行。

0

相关内容

估计/估计量

估计/估计量

中国算力发展指数白皮书，48页pdf

专知会员服务

102+阅读 · 2021年9月21日

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【Manning新书】C++并行实战，592页pdf，C++ Concurrency in Action

【Manning新书】C++并行实战，592页pdf，C++ Concurrency in Action

专知会员服务

63+阅读 · 2021年1月16日

深度学习下的医学影像分割算法综述

专知会员服务

116+阅读 · 2021年1月11日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

目标检测算法优化技巧：Bag of Freebies for Training Object Detection

目标检测算法优化技巧：Bag of Freebies for Training Object Detection

极市平台

6+阅读 · 2019年3月22日

一文道尽softmax loss及其变种

一文道尽softmax loss及其变种

极市平台

14+阅读 · 2019年2月19日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

关关的刷题日记97 – Leetcode 105. Construct Binary Tree

关关的刷题日记97 – Leetcode 105. Construct Binary Tree

专知

3+阅读 · 2018年1月14日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

关关的刷题日记13——Leetcode 414. Third Maximum Number

关关的刷题日记13——Leetcode 414. Third Maximum Number

专知

3+阅读 · 2017年10月8日

时间序列算法ARIMA介绍

时间序列算法ARIMA介绍

凡人机器学习

5+阅读 · 2017年6月2日

New data structure for univariate polynomial approximation and applications to root isolation, numerical multipoint evaluation, and other problems

Arxiv

0+阅读 · 2021年11月23日

An Asymptotically Optimal Approximation of the Conditional Mean Channel Estimator based on Gaussian Mixture Models

Arxiv

0+阅读 · 2021年11月22日

Density Ratio Estimation via Infinitesimal Classification

Arxiv

0+阅读 · 2021年11月22日

Converting ADMM to a Proximal Gradient for Efficient Sparse Estimation

Arxiv

0+阅读 · 2021年11月22日

Deep Probability Estimation

Arxiv

0+阅读 · 2021年11月21日

Study of Polar Codes Based on Piecewise Gaussian Approximation

Arxiv

0+阅读 · 2021年11月20日

Loss Functions for Discrete Contextual Pricing with Observational Data

Arxiv

0+阅读 · 2021年11月18日

Estimating the concentration parameter of a von Mises distribution: a systematic simulation benchmark

Arxiv

0+阅读 · 2021年11月18日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

Latent nested nonparametric priors

Arxiv

4+阅读 · 2018年1月15日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

中国算力发展指数白皮书，48页pdf

专知会员服务

102+阅读 · 2021年9月21日

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【Manning新书】C++并行实战，592页pdf，C++ Concurrency in Action

【Manning新书】C++并行实战，592页pdf，C++ Concurrency in Action

专知会员服务

63+阅读 · 2021年1月16日

深度学习下的医学影像分割算法综述

专知会员服务

116+阅读 · 2021年1月11日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

目标检测算法优化技巧：Bag of Freebies for Training Object Detection

目标检测算法优化技巧：Bag of Freebies for Training Object Detection

极市平台

6+阅读 · 2019年3月22日

一文道尽softmax loss及其变种

一文道尽softmax loss及其变种

极市平台

14+阅读 · 2019年2月19日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

关关的刷题日记97 – Leetcode 105. Construct Binary Tree

关关的刷题日记97 – Leetcode 105. Construct Binary Tree

专知

3+阅读 · 2018年1月14日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

关关的刷题日记13——Leetcode 414. Third Maximum Number

关关的刷题日记13——Leetcode 414. Third Maximum Number

专知

3+阅读 · 2017年10月8日

时间序列算法ARIMA介绍

时间序列算法ARIMA介绍

凡人机器学习

5+阅读 · 2017年6月2日

相关论文

New data structure for univariate polynomial approximation and applications to root isolation, numerical multipoint evaluation, and other problems

Arxiv

0+阅读 · 2021年11月23日

An Asymptotically Optimal Approximation of the Conditional Mean Channel Estimator based on Gaussian Mixture Models

Arxiv

0+阅读 · 2021年11月22日

Density Ratio Estimation via Infinitesimal Classification

Arxiv

0+阅读 · 2021年11月22日

Converting ADMM to a Proximal Gradient for Efficient Sparse Estimation

Arxiv

0+阅读 · 2021年11月22日

Deep Probability Estimation

Arxiv

0+阅读 · 2021年11月21日

Study of Polar Codes Based on Piecewise Gaussian Approximation

Arxiv

0+阅读 · 2021年11月20日

Loss Functions for Discrete Contextual Pricing with Observational Data

Arxiv

0+阅读 · 2021年11月18日

Estimating the concentration parameter of a von Mises distribution: a systematic simulation benchmark

Arxiv

0+阅读 · 2021年11月18日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

Latent nested nonparametric priors

Arxiv

4+阅读 · 2018年1月15日

微信扫码咨询专知VIP会员