自然语言处理 | 使用Spacy 进行自然语言处理（二） - 专知

会员服务 ·

0

自然语言处理 | 使用Spacy 进行自然语言处理（二）

2018 年 8 月 27 日 机器学习和数学

上次我们简单介绍了Spacy，学习了它的安装以及实体识别等基本的方法。今天我继续给大家介绍一下它的其他功能如何操作，主要有词性还原，词性标注，名词块识别，依存分析等内容。废话不多说，直接看代码。

import en_core_web_sm
parser = en_core_web_sm.load()
sentences = "There is an art, it says, or rather, a knack to flying." \
"The knack lies in learning how to throw yourself at the ground and miss." \
"In the beginning the Universe was created. This has made a lot of people " \
"very angry and been widely regarded as a bad move."
print("解析文本中包含的句子：")
sents = [sent for sent in parser(sentences).sents]
for x in sents:
print(x)
"""
There is an art, it says, or rather, a knack to flying.
The knack lies in learning how to throw yourself at the ground and miss.
In the beginning the Universe was created.
This has made a lot of people very angry and been widely regarded as a bad move.
"""
print("- * -"*20)
# 分词
print()
tokens = [token for token in sents[0] if len(token) > 1]
print(tokens)
print("- * -"*20)
# 词性还原
lemma_tokens = [token.lemma_ for token in sents[0] if len(token) > 1]
print(lemma_tokens)
print("- * -"*20)
# 简化版的词性标注
pos_tokens = [token.pos_ for token in sents[0] if len(token) > 1]
print(pos_tokens)
print("- * -"*20)
# 词性标注的细节版
tag_tokens = [token.tag_ for token in sents[0] if len(token) > 1]
print(tag_tokens)
print("- * -"*20)
# 依存分析
dep_tokens = [token.dep_ for token in sents[0] if len(token) > 1]
print(dep_tokens)
print("- * -"*20)
print("名词块分析")
doc = parser(u"Autonomous cars shift insurance liability toward manufacturers")
# 获取名词块文本
chunk_text = [chunk.text for chunk in doc.noun_chunks]
print(chunk_text)
print("- * -"*20)
# 获取名词块根结点的文本
chunk_root_text = [chunk.root.text for chunk in doc.noun_chunks]
print(chunk_root_text)
print("- * -"*20)
# 依存分析
chunk_root_dep_ = [chunk.root.dep_ for chunk in doc.noun_chunks]
print(chunk_root_dep_)
print("- * -"*20)
#
chunk_root_head_text = [chunk.root.head.text for chunk in doc.noun_chunks]
print(chunk_root_head_text)
print("- * -"*20)

最后给大家附上一个句法依存分析的结果解释的资料，是斯坦福自然语言处理的一个依存句法分析的解释文档

链接：https://nlp.stanford.edu/software/dependencies_manual.pdf

如果下载不下来，可以微信和我要。

百度文库有中文版：https://wenku.baidu.com/view/1e92891dbceb19e8b8f6bae5.html

登录查看更多

10

相关内容

spaCy

【实用书】Python文本分析第二版，688页pdf带你入门自然语言处理

【实用书】Python文本分析第二版，688页pdf带你入门自然语言处理

专知会员服务

162+阅读 · 2020年5月15日

【论文推荐】自然语言处理与查询扩展综述，Natural Language Processing and Query Expansion

【论文推荐】自然语言处理与查询扩展综述，Natural Language Processing and Query Expansion

专知会员服务

44+阅读 · 2020年5月3日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

深度学习自然语言处理概述，216页ppt，Jindřich Helcl

深度学习自然语言处理概述，216页ppt，Jindřich Helcl

专知会员服务

216+阅读 · 2020年4月26日

【教程】自然语言处理中的迁移学习原理，41 页PPT

【教程】自然语言处理中的迁移学习原理，41 页PPT

专知会员服务

96+阅读 · 2020年2月8日

【实战电子书+代码】自然语言处理的实战，545页pdf，使用Python理解、分析和生成文本

【实战电子书+代码】自然语言处理的实战，545页pdf，使用Python理解、分析和生成文本

专知会员服务

266+阅读 · 2019年12月28日

【电子书】自然语言处理（Natural Language Processing）587页PDF免费下载

【电子书】自然语言处理（Natural Language Processing）587页PDF免费下载

专知会员服务

67+阅读 · 2019年10月30日

【下载】Python自然语言处理实战书籍和代码《Natural Language Processing in Action》

【下载】Python自然语言处理实战书籍和代码《Natural Language Processing in Action》

专知会员服务

80+阅读 · 2019年10月27日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

学习自然语言处理路线图

学习自然语言处理路线图

专知会员服务

140+阅读 · 2019年9月24日

使用 Python 构建可扩展的社交媒体情感分析服务 | Linux 中国

使用 Python 构建可扩展的社交媒体情感分析服务 | Linux 中国

Linux中国

3+阅读 · 2019年5月18日

Python自然语言处理: 使用SpaCycle库进行标记化、词干提取和词形还原

Python自然语言处理: 使用SpaCycle库进行标记化、词干提取和词形还原

Python程序员

18+阅读 · 2019年3月28日

R语言自然语言处理：词性标注与命名实体识别

R语言自然语言处理：词性标注与命名实体识别

R语言中文社区

7+阅读 · 2019年3月5日

Python自然语言处理工具NLTK学习导引及相关资料

Python自然语言处理工具NLTK学习导引及相关资料

AINLP

5+阅读 · 2019年1月28日

自然语言处理NLP快速入门

自然语言处理NLP快速入门

专知

20+阅读 · 2018年10月8日

自然语言处理 | 使用Spacy 进行自然语言处理

自然语言处理 | 使用Spacy 进行自然语言处理

机器学习和数学

19+阅读 · 2018年8月22日

在Python中使用SpaCy进行文本分类

在Python中使用SpaCy进行文本分类

专知

24+阅读 · 2018年5月8日

教你用Python进行自然语言处理（附代码）

教你用Python进行自然语言处理（附代码）

数据派THU

6+阅读 · 2018年3月28日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

NLP自然语言处理（二）——基础文本分析

NLP自然语言处理（二）——基础文本分析

乐享数据DataScientists

12+阅读 · 2017年2月7日

Pre-trained Models for Natural Language Processing: A Survey

Arxiv

113+阅读 · 2020年3月18日

OmniNet: A unified architecture for multi-modal multi-task learning

OmniNet: A unified architecture for multi-modal multi-task learning

Arxiv

6+阅读 · 2019年7月17日

Language Modeling with Deep Transformers

Arxiv

6+阅读 · 2019年7月11日

Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Arxiv

21+阅读 · 2019年2月4日

Dialogue Natural Language Inference

Arxiv

7+阅读 · 2018年11月1日

Neural Approaches to Conversational AI

Arxiv

26+阅读 · 2018年9月21日

Notes on Deep Learning for NLP

Arxiv

22+阅读 · 2018年8月30日

A Tidy Data Model for Natural Language Processing using cleanNLP

Arxiv

4+阅读 · 2018年5月3日

PEYMA: A Tagged Corpus for Persian Named Entities

Arxiv

5+阅读 · 2018年1月30日

Analyzing Language Learned by an Active Question Answering Agent

Arxiv

6+阅读 · 2018年1月23日

VIP会员

相关主题

自然语言处理

词元分析器

相关VIP内容

【实用书】Python文本分析第二版，688页pdf带你入门自然语言处理

【实用书】Python文本分析第二版，688页pdf带你入门自然语言处理

专知会员服务

162+阅读 · 2020年5月15日

【论文推荐】自然语言处理与查询扩展综述，Natural Language Processing and Query Expansion

【论文推荐】自然语言处理与查询扩展综述，Natural Language Processing and Query Expansion

专知会员服务

44+阅读 · 2020年5月3日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

深度学习自然语言处理概述，216页ppt，Jindřich Helcl

深度学习自然语言处理概述，216页ppt，Jindřich Helcl

专知会员服务

216+阅读 · 2020年4月26日

【教程】自然语言处理中的迁移学习原理，41 页PPT

【教程】自然语言处理中的迁移学习原理，41 页PPT

专知会员服务

96+阅读 · 2020年2月8日

【实战电子书+代码】自然语言处理的实战，545页pdf，使用Python理解、分析和生成文本

【实战电子书+代码】自然语言处理的实战，545页pdf，使用Python理解、分析和生成文本

专知会员服务

266+阅读 · 2019年12月28日

【电子书】自然语言处理（Natural Language Processing）587页PDF免费下载

【电子书】自然语言处理（Natural Language Processing）587页PDF免费下载

专知会员服务

67+阅读 · 2019年10月30日

【下载】Python自然语言处理实战书籍和代码《Natural Language Processing in Action》

【下载】Python自然语言处理实战书籍和代码《Natural Language Processing in Action》

专知会员服务

80+阅读 · 2019年10月27日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

学习自然语言处理路线图

学习自然语言处理路线图

专知会员服务

140+阅读 · 2019年9月24日

热门VIP内容

开通专知VIP会员享更多权益服务

美军小型无人机项目

无人机蜂群——作为执行非常规战争的创新工具 | 2025最新文献

不确定环境下无人机与无人地面车辆编队的地下勘探规划算法 | 122页

接纳无人机多样性：西方军事在无人机战争中适应的五个挑战 | 28页报告

相关资讯

使用 Python 构建可扩展的社交媒体情感分析服务 | Linux 中国

使用 Python 构建可扩展的社交媒体情感分析服务 | Linux 中国

Linux中国

3+阅读 · 2019年5月18日

Python自然语言处理: 使用SpaCycle库进行标记化、词干提取和词形还原

Python自然语言处理: 使用SpaCycle库进行标记化、词干提取和词形还原

Python程序员

18+阅读 · 2019年3月28日

R语言自然语言处理：词性标注与命名实体识别

R语言自然语言处理：词性标注与命名实体识别

R语言中文社区

7+阅读 · 2019年3月5日

Python自然语言处理工具NLTK学习导引及相关资料

Python自然语言处理工具NLTK学习导引及相关资料

AINLP

5+阅读 · 2019年1月28日

自然语言处理NLP快速入门

自然语言处理NLP快速入门

专知

20+阅读 · 2018年10月8日

自然语言处理 | 使用Spacy 进行自然语言处理

自然语言处理 | 使用Spacy 进行自然语言处理

机器学习和数学

19+阅读 · 2018年8月22日

在Python中使用SpaCy进行文本分类

在Python中使用SpaCy进行文本分类

专知

24+阅读 · 2018年5月8日

教你用Python进行自然语言处理（附代码）

教你用Python进行自然语言处理（附代码）

数据派THU

6+阅读 · 2018年3月28日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

NLP自然语言处理（二）——基础文本分析

NLP自然语言处理（二）——基础文本分析

乐享数据DataScientists

12+阅读 · 2017年2月7日

相关论文

Pre-trained Models for Natural Language Processing: A Survey

Arxiv

113+阅读 · 2020年3月18日

OmniNet: A unified architecture for multi-modal multi-task learning

OmniNet: A unified architecture for multi-modal multi-task learning

Arxiv

6+阅读 · 2019年7月17日

Language Modeling with Deep Transformers

Arxiv

6+阅读 · 2019年7月11日

Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Arxiv

21+阅读 · 2019年2月4日

Dialogue Natural Language Inference

Arxiv

7+阅读 · 2018年11月1日

Neural Approaches to Conversational AI

Arxiv

26+阅读 · 2018年9月21日

Notes on Deep Learning for NLP

Arxiv

22+阅读 · 2018年8月30日

A Tidy Data Model for Natural Language Processing using cleanNLP

Arxiv

4+阅读 · 2018年5月3日

PEYMA: A Tagged Corpus for Persian Named Entities

Arxiv

5+阅读 · 2018年1月30日

Analyzing Language Learned by an Active Question Answering Agent

Arxiv

6+阅读 · 2018年1月23日

大家都在搜

2025最新文献

NTU博士论文

久别重逢话双塔

精排模型-从MLP到行为序列：DIN、DIEN、MIMN、SIM、DSIN

微信扫码咨询专知VIP会员