Taxonomy 专知荟萃
入门学习
实体抽取
关系预测
数据集/共享任务
- SemEval-2015 Task 17: Taxonomy Extraction Evaluation (TExEval-1), Home, Report
- SemEval-2016 Task 13: Taxonomy Extraction Evaluation (TExEval-2), Home, Report
- SemEval-2016 Task 14: Semantic Taxonomy Enrichment, Home, Report
- SemEval-2018 Task 9: Hypernym Discovery, Home, Report
- UnsupervisedHypernymy, Home, EACL 2017 paper, including 4 datasets:
- (Hypernymy Detection): EVAL, BLESS, LEDS (a.k.a Lenci/Benotto), Weeds
- HypernymySuite, Home, ACL 2018 paper, including (somewhat modified) datasets:
- (Hypernymy Detection): BLESS, LEDS, EVAL, SHWARTZ, WBLESS
- (Hypernymy Direction): BLESS, WBLESS, BIBLESS
- (Graded Entailment): HyperLex
- NCBI Taxonomy Harvest
- GBIF Backbone Taxonomy
Tutorial
- KDD 2019 tutorial Constructing and Mining Heterogeneous Information Networks from Massive Text
- VLDB 2019 Tutorial TextCube: Automated Construction and Multidimensional Exploration
论文
综述&报告
Hypernymy Discovery & Lexical Entailment
- Zheng Yu, Haixun Wang, Xuemin Lin, Min Wang: Learning Term Embeddings for Hypernymy Identification IJCAI 2015: 1390-1397
- Vered Shwartz, Yoav Goldberg, Ido Dagan:Improving Hypernymy Detection with an Integrated Path-based and Distributional Method [code] ACL (1) 2016
- Vered Shwartz, Ido Dagan:LexNET -- Integrated Path-based and Distributional Method for the Identification of Semantic Relations [code] CogALex@COLING 2016: 80-85
- Alexander Panchenko, Stefano Faralli, Eugen Ruppert, Steffen Remus, Hubert Naets, Cédrick Fairon, Simone Paolo Ponzetto, Chris Biemann:TAXI at SemEval-2016 Task 13: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling SemEval@NAACL-HLT 2016: 1320-1327
- Stephen Roller, Katrin Erk:Relations such as Hypernymy: Identifying and Exploiting Hearst Patterns in Distributional Vectors for Lexical Entailment [code] EMNLP 2016: 2163-2172
- Anh Tuan Luu, Yi Tay, Siu Cheung Hui, See-Kiong Ng: Learning Term Embeddings for Taxonomic Relation Identification Using Dynamic Weighting Neural Network EMNLP 2016: 403-413
- Enrico Santus, Vered Shwartz, Dominik Schlechtweg:Hypernyms under Siege: Linguistically-motivated Artillery for Hypernymy Detection [code] EACL 2017:65-75
- Kim Anh Nguyen, Maximilian Köper, Sabine Schulte im Walde,Ngoc Thang Vu:Hierarchical Embeddings for Hypernymy Detection and Directionality (HyperVec) [code] EMNLP 2017: 233-243
- Haw-Shiuan Chang, ZiYun Wang, Luke Vilnis, Andrew McCallum:Distributional Inclusion Vector Embedding for Unsupervised Hypernymy Detection NAACL 2018:485-495
- Ivan Vulic, Nikola Mrksic:Specialising Word Vectors for Lexical Entailment [code] NAACL 2018:1134-1145
- Wenpeng Yin, Dan Roth:Term Definitions Help Hypernymy Detection *SEM@NAACL-HLT 2018:203-213
- Stephen Roller, Douwe Kiela, Maximilian Nickel:Hearst Patterns Revisited: Automatic Hypernym Detection from Large Text Corpora [code] ACL(2) 2018:358-363
- Chengyu Wang, Xiaofeng He, Aoying Zhou:Improving Hypernymy Prediction via Taxonomy Enhanced Adversarial Learning [code] AAAI 2019: 7128-7135
- Chengyu Wang, Yan Fan, Xiaofeng He, Aoying Zhou: A Family of Fuzzy Orthogonal Projection Models for Monolingual and Cross-lingual Hypernymy Prediction [code] WWW 2019:1965-1976
- Yu Shi, Jiaming Shen, Yuchen Li, Naijing Zhang, Xinwei He, Zhengzhi Lou, Qi Zhu, Matthew Walker, Myunghwan Kim, Jiawei Han: Discovering Hypernymy in Text-Rich Heterogeneous Information Network by Exploiting Context Granularity [code] CIKM 2019: 599-608
基于实例的分类构建
- Rion Snow, Daniel Jurafsky, Andrew Y. Ng: Semantic Taxonomy Induction from Heterogenous Evidence ACL 2006
- Simone Paolo Ponzetto, Michael Strube: Deriving a Large Scale Taxonomy from Wikipedia AAAI 2007: 1440-1445
- Hui Yang, Jamie Callan:A Metric-based Framework for Automatic Taxonomy Induction ACL 2009:271-279
- Zornitsa Kozareva, Eduard H. Hovy: A Semi-Supervised Method to Learn and Construct Taxonomies using the Web EMNLP 2010: 1110-1118
- Gerard de Melo, Gerhard Weikum:MENTA: Inducing Multilingual Taxonomies from Wikipedia CIKM 2010: 1099-1108
- Roberto Navigli, Paola Velardi, Stefano Faralli A graph-based algorithm for inducing lexical taxonomies from scratch IJCAI 2011:1872-1877
- Wentao Wu, Hongsong Li, Haixun Wang, Kenny Qili Zhu: Probase: A probabilistic taxonomy for text understanding SIGMOD 2012:481-492
- Ndapandula Nakashole, Gerhard Weikum, Fabian M. Suchanek:A Taxonomy of Relational Patterns with Semantic Types EMNLP-CoNLL 2012:1135-1145
- Ruiji Fu, Jiang Guo, Bing Qin, Wanxiang Che, Haifeng Wang, Ting Liu:Learning Semantic Hierarchies via Word Embeddings ACL (1) 2014:1199-1209
- Tiziano Flati, Daniele Vannella, Tommaso Pasini, Roberto Navigli:Two Is Bigger (and Better) Than One -- the Wikipedia Bitaxonomy Project ACL(1) 2014:945-955
- Mohit Bansal, David Burkett, Gerard de Melo, Dan Klein:Structured Learning for Taxonomy Induction with Belief Propagation ACL(1) 2014:1041-1051
- Anh Tuan Luu, Jung-jae Kim, See-Kiong Ng:Taxonomy Construction Using Syntactic Contextual Evidence EMNLP 2014:810-819
- Yuchen Zhang, Amr Ahmed, Vanja Josifovski, Alexander J. Smola: Taxonomy discovery for personalized recommendation WSDM 2014:243-252
- Anh Tuan Luu, Jung-jae Kim, See-Kiong Ng:Incorporating Trustiness and Collective Synonym and Contrastive Evidence into Taxonomy Construction EMNLP 2015:
- Tobias Kötter, Stephan Günnemann, Michael R. Berthold, Christos Faloutsos: Automatic Taxonomy Extraction from Bipartite Graphs ICDM 2015:221-230
- Georgeta Bordea, Paul Buitelaar, Stefano Faralli, Roberto Navigli:Taxonomy Extraction Evaluation (TExEval) SemEval@NAACL-HLT 2015:902-910
- Hao Zhang, Zhiting Hu, Yuntian Deng, Mrinmaya Sachan, Zhicheng Yan, Eric P. Xing:Learning Concept Taxonomies from Multi-modal Data ACL(1) 2016
- Amit Gupta, Francesco Piccinno, Mikhail Kozhevnikov, Marius Pasca, Daniele Pighin:Revisiting Taxonomy Induction over Wikipedia COLING 2016:2300-2309
- Vered Shwartz, Yoav Goldberg, Ido Dagan:Improving Hypernymy Detection with an Integrated Path-based and Distributional Method ACL(1) 2016
- Chengyu Wang, Xiaofeng He, Aoying Zhou:A Short Survey on Taxonomy Learning from Text Corpora EMNLP 2017:1190-1203
- Jiaming Shen, Zeqiu Wu, Dongming Lei, Jingbo ShangXiang Ren, Jiawei Han:SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble ECML/PKDD(1) 2017:288-304
- Amit Gupta, Rémi Lebret, Hamza Harkous, Karl Aberer:Taxonomy Induction using Hypernym Subsequences CIKM 2017:1329-1338
- Anne Cocos, Marianna Apidianaki, Chris Callison-BurchComparing Constraints for Taxonomic Organization NAACL-HLT 2018:323-333
- Jiaming Shen, Zeqiu Wu, Dongming Lei, Chao Zhang, Xiang Ren, Michelle T. Vanni, Brian M. Sadler, Jiawei Han:HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion [code] KDD 2018:2180-2189
- Yuning Mao, Xiang Ren, Jiaming Shen, Xiaotao Gu, Jiawei Han:End-to-End Reinforcement Learning for Automatic Taxonomy Induction [code] ACL (1) 2018: 2462-2472
- Matt Le, Stephen Roller, Laetitia Papaxanthos, Douwe Kiela, Maximilian Nickel:Inferring Concept Hierarchies from Text Corpora via Hyperbolic Embeddings ACL (1) 2019: 3231-3241
基于聚类的分类构建
- Yangqiu Song, Shixia Liu, Xueqing Liu, Haixun Wang:Automatic Taxonomy Construction from Keywords KDD 2012: 1433-1441
- Ndapandula Nakashole, Gerhard Weikum, Fabian M. Suchanek:Patty: A taxonomy of relational patterns with semantic types EMNLP-CoNLL 2012: 1135-1145
- Chi Wang, Marina Danilevsky, Nihit Desai, Yinan Zhang, Phuong Nguyen, Thrivikrama Taula, Jiawei Han:A Phrase Mining Framework for Recursive Construction of a Topical Hierarchy KDD 2013: 437-445
- Enrico Santus, Alessandro Lenci, Qin Lu, Sabine Schulte im Walde:Chasing Hypernyms in Vector Spaces with Entropy EACL 2014:38-42
- Anh Tuan Luu, Jung-jae Kim, See-Kiong Ng:Taxonomy construction using syntactic contextual evidence EMNLP 2014: 810-819
- Doug Downey, Chandra Bhagavatula, Yi Yang:Efficient Methods for Inferring Large Sparse Topic Hierarchies ACL (1) 2015:774-784
- Alexander Panchenko, Stefano Faralli, Eugen Ruppert, Steffen Remus, Hubert Naets, Cédrick Fairon, Simone Paolo Ponzetto, Chris Biemann:a taxonomy induction method based on lexico-syntactic patterns, substrings and focused crawling NAACL-HLT 2016:1320-1327
- Chao Zhang, Fangbo Tao, Xiusi Chen, Jiaming Shen, Meng Jiang, Brian M. Sadler, Michelle Vanni, Jiawei Han:TaxoGen: Unsupervised Topic Taxonomy Construction by Adaptive Term Embedding and Clustering [code] KDD 2018:2701-2709
- Haodong Bai, Frank Z. Xing, Erik Cambria, Win-Bin Huang:Business Taxonomy Construction Using Concept-Level Hierarchical Clustering CoRR abs/1906.09694 (2019)
- Dmitry Frolov, Susana Nascimento, Trevor I. Fenner, Boris G. Mirkin:Using Taxonomy Tree to Generalize a Fuzzy Thematic Cluster. FUZZ-IEEE 2019:1-6
分类树扩展
- Wei Shen, Jianyong Wang, Ping Luo, Min Wang:A graph-based approach for ontology population with named entities CIKM 2012:345-354
- Jingjing Wang, Changsung Kang, Yi Chang, Jiawei Han:A Hierarchical Dirichlet Model for Taxonomy Expansion for Search Engines WWW 2014:961-970
- David Jurgens, Mohammad Taher Pilehvar:Reserating the awesometastic: An automatic extension of the WordNet taxonomy for novel terms NAACL 2015:1459-1465
- Luis Espinosa Anke, Francesco Ronzano, Horacio Saggion:SemEval-2016 Task 14: Semantic Taxonomy Enrichment SemEval@NAACL-HLT 2016: 1332-1336
- Rami Aly, Shantanu Acharya, Alexander Ossa, Arne Köhn, Chris Biemann, Alexander Panchenko:Every child should have parents: a taxonomy refinement algorithm based on hyperbolic term embeddings [code] ACL (1) 2019:4811-4817
Taxonomy 应用
- Cai-Nicolas Ziegler, Georg Lausen, Lars Schmidt-Thieme:Taxonomy-driven computation of product recommendations CIKM 2004:406-415
- Bhargav Kanagal, Amr Ahmed, Sandeep Pandey, Vanja Josifovski, Jeffrey Yuan, Lluis Garcia Pueyo:Supercharging Recommender Systems using Taxonomies for Learning User Purchase Behavior PVLDB 5(10): 956=967(2012)
- Yuchen Zhang, Amr Ahmed, Vanja Josifovski, Alexander J. Smola:Taxonomy discovery for personalized recommendation WSDM 2014: 243-252
- Taxonomy-Aware Multi-Hop Reasoning Networks for Sequential Recommendation [code] WSDM 2019:573-581
- Bang Liu, Weidong Guo, Di Niu, Chaoyue Wang, Shunnan Xu, Jinghong Lin, Kunfeng Lai, Yu Xu:A User-Centered Concept Mining System for Query and Document Understanding at Tencent [code] KDD 2019:1831-1841
- Junheng Hao, Muhao Chen, Wenchao Yu, Yizhou Sun, Wei Wang:Universal Representation Learning of Knowledge Bases by Jointly Embedding Instances and Ontological Concepts [code] KDD 2019: 1709-1719
- Jingyue Gao, Yuanduo He, Yasha Wang, Xiting Wang, Jiangtao Wang, Guangju Peng, Xu Chu:STAR: Spatio-Temporal Taxonomy-Aware Tag Recommendation for Citizen Complaints [code] CIKM 2019:1903-1912
- Davide Taibi, Valentina Lenarduzzi, Claus Pahl:Microservices Anti Patterns: A Taxonomy CoRR abs/1908.04101 (2019)
- Wanling Cai, Li Chen:Towards a Taxonomy of User Feedback Intents for Conversational Recommendations. RecSys 2019:51-55
- Rulei Yu, Lei Shi: A user-based taxonomy for deep learning visualization Visual Informatics 2(3):147-154
联合分类的构建与应用
领域专家
1、韩家炜,美国伊利诺伊大学香槟分校计算机系教授,IEEE和ACM院士,美国信息网络学术研究中心主任。曾担任KDD、SDM和ICDM等国际知名会议的程序委员会主席,创办了ACM TKDD学报并任主编。在数据挖掘、数据库和信息网络领域发表论文600余篇。韩家炜主页 韩家炜dblp