机器学习方法的生态建模：一种寻找相似度指数的新方法 (A Machine Learning approach of Ecological Modeling: A New method to find Similarity Index) - 专知论文

会员服务 ·

0

相似性 · 相似度 · 共现 · LDA · 极大似然估计 ·

2023 年 4 月 3 日

A Machine Learning approach of Ecological Modeling: A New method to find Similarity Index

翻译：机器学习方法的生态建模：一种寻找相似度指数的新方法

Srijan Chattopadhyay,Swapnaneel Bhattachayya

from arxiv, 19 pages, 29 figures

In many scientific research, it is often imperative to determine whether pairs of entities have similarities in themselves or not. There are standard approaches to this problem, such as Jaccard, Sorensen Dice, and Simpson. Recently, a better index for the analysis of cooccurrence and similarity was developed and it reversed all the results obtained by standard indices and supported theoretical predictions. In this paper, we propose a new method of similarity using MLE, PCA, LDA, and clustering. Our index depends strongly on the data before introducing randomness in prevalence. Then we propose a new method of randomization which changed the whole pattern of the results. Before randomization, it was strongly dependent o the prevalence and hence was following the pattern of the Jaccard index. So, we introduce the new randomization technique, and hence the whole results reversed and followed that of alpha. Also, we will show some limitations of alpha which we try to resolve through different pathways.

翻译：在许多科学研究中，确定实体对是否具有相似性通常是必要的。有一些标准方法可以解决这个问题，例如贾卡德、索雷森·迪斯和辛普森等方法。最近，针对共现和相似性分析开发了更好的指数，它颠覆了所有标准指数获得的结果，并支持理论预测。在本文中，我们提出了一种使用MLE、PCA、LDA和聚类的新的相似性方法。我们的指数在引入流行病随机性之前强烈依赖于数据。然后，我们提出了一种新的随机化方法，改变了整个结果的模式。在随机化之前，它强烈依赖于流行病，因此遵循延卡德指数的模式。因此，我们引入了新的随机化技术，然后整个结果被颠覆，并遵循alpha的模式。此外，我们还将展示一些alpha的局限性，并尝试通过不同的途径解决这些问题。

0

相关内容

相似性

【干货书】工程和科学中的概率和统计，

【干货书】工程和科学中的概率和统计，

专知会员服务

58+阅读 · 2022年12月24日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【硬核书】矩阵代数基础，248页pdf

【硬核书】矩阵代数基础，248页pdf

专知会员服务

88+阅读 · 2021年12月9日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

专知会员服务

50+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

《模式识别与机器学习(PRML)》正式开放免费下载

《模式识别与机器学习(PRML)》正式开放免费下载

AINLP

27+阅读 · 2018年11月27日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

多重排序数据的整合分析

国家自然科学基金

0+阅读 · 2015年12月31日

再生核希尔伯特空间图像稀疏表达算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于水资源系统演变不确定性的水资源短缺风险评估

国家自然科学基金

0+阅读 · 2013年12月31日

工程地震动随机场非平稳各向异性特征分析与物理建模

国家自然科学基金

0+阅读 · 2013年12月31日

基于多尺度结构特征和图模型的异源图像配准

国家自然科学基金

0+阅读 · 2013年12月31日

加工与运输协同供应链排序的复杂性与算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

高维数据的图模型学习与统计推断

国家自然科学基金

8+阅读 · 2012年12月31日

基于信道Time/Power度量指标的TOA测距误差模型及其应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

复杂数据的统计分析与建模

国家自然科学基金

5+阅读 · 2011年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

Improving the performance of classical linear algebra iterative methods via hybrid parallelism

Arxiv

0+阅读 · 2023年5月23日

funLOCI: a local clustering algorithm for functional data

Arxiv

0+阅读 · 2023年5月22日

Analytical approximations in short times of exact operational solutions to reaction diffusion problems on bounded intervals

Arxiv

0+阅读 · 2023年5月22日

On the approximability and energy-flow modeling of the electric vehicle sharing problem

Arxiv

0+阅读 · 2023年5月20日

Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion

Arxiv

0+阅读 · 2023年5月19日

Anderson acceleration with approximate calculations: applications to scientific computing

Arxiv

0+阅读 · 2023年5月18日

Causal Machine Learning: A Survey and Open Problems

Arxiv

70+阅读 · 2022年6月30日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

A Survey of Learning Causality with Data: Problems and Methods

Arxiv

31+阅读 · 2020年5月5日

Tensor Decompositions for temporal knowledge base completion

Arxiv

10+阅读 · 2020年4月10日

VIP会员

文章信息

相关主题

极大似然估计

相关VIP内容

【干货书】工程和科学中的概率和统计，

【干货书】工程和科学中的概率和统计，

专知会员服务

58+阅读 · 2022年12月24日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【硬核书】矩阵代数基础，248页pdf

【硬核书】矩阵代数基础，248页pdf

专知会员服务

88+阅读 · 2021年12月9日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

【电子书】人工智能编程范式（Paradigms of Artificial Intelligence Programming）1048页PDF免费下载

专知会员服务

50+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

《模式识别与机器学习(PRML)》正式开放免费下载

《模式识别与机器学习(PRML)》正式开放免费下载

AINLP

27+阅读 · 2018年11月27日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Improving the performance of classical linear algebra iterative methods via hybrid parallelism

Arxiv

0+阅读 · 2023年5月23日

funLOCI: a local clustering algorithm for functional data

Arxiv

0+阅读 · 2023年5月22日

Analytical approximations in short times of exact operational solutions to reaction diffusion problems on bounded intervals

Arxiv

0+阅读 · 2023年5月22日

On the approximability and energy-flow modeling of the electric vehicle sharing problem

Arxiv

0+阅读 · 2023年5月20日

Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion

Arxiv

0+阅读 · 2023年5月19日

Anderson acceleration with approximate calculations: applications to scientific computing

Arxiv

0+阅读 · 2023年5月18日

Causal Machine Learning: A Survey and Open Problems

Arxiv

70+阅读 · 2022年6月30日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

A Survey of Learning Causality with Data: Problems and Methods

Arxiv

31+阅读 · 2020年5月5日

Tensor Decompositions for temporal knowledge base completion

Arxiv

10+阅读 · 2020年4月10日

相关基金

多重排序数据的整合分析

国家自然科学基金

0+阅读 · 2015年12月31日

再生核希尔伯特空间图像稀疏表达算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于水资源系统演变不确定性的水资源短缺风险评估

国家自然科学基金

0+阅读 · 2013年12月31日

工程地震动随机场非平稳各向异性特征分析与物理建模

国家自然科学基金

0+阅读 · 2013年12月31日

基于多尺度结构特征和图模型的异源图像配准

国家自然科学基金

0+阅读 · 2013年12月31日

加工与运输协同供应链排序的复杂性与算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

高维数据的图模型学习与统计推断

国家自然科学基金

8+阅读 · 2012年12月31日

基于信道Time/Power度量指标的TOA测距误差模型及其应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

复杂数据的统计分析与建模

国家自然科学基金

5+阅读 · 2011年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员