【2020新书】Python大数据处理，Mastering Large Datasets with Python，311页pdf - 专知VIP

会员服务 ·

43

Python · 大数据处理 ·

2020 年 2 月 1 日

【2020新书】Python大数据处理，Mastering Large Datasets with Python，311页pdf

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

https://www.manning.com/books/mastering-large-datasets-with-python

现代数据科学解决方案需要简洁、易于阅读和可伸缩。在《用Python掌握大型数据集》一书中，作者J.T. Wolohan向您介绍了如何使用Python编码的功能影响方法来处理小型项目并对其进行扩展。您将探索有助于清晰性和可伸缩性的方法和内置Python工具，比如高性能并行方法，以及支持高数据吞吐量的分布式技术。本实用教程中丰富的实践练习将为任何大型数据科学项目锁定这些基本技能。

对这项技术

当应用于大量文件或分布式数据集时，在笔记本大小的数据上运行良好的编程技术可能会变慢，甚至完全失败。通过掌握强大的map和reduce范型，以及支持它的基于python的工具，您可以编写以数据为中心的应用程序，这些应用程序可以有效地扩展，而不需要在需求发生变化时重写代码库。

关于这本书

使用Python掌握大型数据集教会您编写可以处理任何大小的数据集的代码。您将从笔记本大小的数据集开始，这些数据集通过将大任务分解为可以同时运行的小任务来教会您并行化数据分析。然后将这些程序扩展到云服务器集群上的工业级数据集。有了map和reduce范型，您将探索像Hadoop和PySpark这样的工具来有效地处理大量的分布式数据集，使用机器学习加速决策制定，并使用AWS S3简化数据存储。

里面有什么

对map和reduce范例的介绍
并行化与多处理模块框架
分布式计算的Hadoop和Spark
运行AWS作业来处理大型数据集

成为VIP会员查看完整内容

Mastering Large Datasets with Python.pdf

相关内容

Python

Python是一种面向对象的解释型计算机程序设计语言，在设计中注重代码的可读性，同时也是一种功能强大的通用型语言。

【干货书】Python机器学习导论，340页pdf数据科学家指南

专知会员服务

175+阅读 · 2020年6月4日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

101+阅读 · 2020年5月22日

Python导论，476页pdf，现代Python计算

Python导论，476页pdf，现代Python计算

专知会员服务

264+阅读 · 2020年5月17日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【经典书】《算法精解：C语言描述》，562页pdf，Mastering Algorithms with C

【经典书】《算法精解：C语言描述》，562页pdf，Mastering Algorithms with C

专知会员服务

106+阅读 · 2020年4月25日

【实用书】掌握Python数据分析，282页pdf，Mastering Python Data Analysis

【实用书】掌握Python数据分析，282页pdf，Mastering Python Data Analysis

专知会员服务

103+阅读 · 2020年4月22日

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

专知会员服务

310+阅读 · 2020年2月26日

【新书】用Python3六步掌握机器学习第二版，469页pdf，Mastering Machine Learning

【新书】用Python3六步掌握机器学习第二版，469页pdf，Mastering Machine Learning

专知会员服务

223+阅读 · 2020年2月2日

【2020新书】Python大数据处理，Mastering Large Datasets with Python

【2020新书】Python大数据处理，Mastering Large Datasets with Python

专知会员服务

54+阅读 · 2020年2月2日

【书籍推荐】简洁的Python编程（Clean Python），附274页pdf

【书籍推荐】简洁的Python编程（Clean Python），附274页pdf

专知会员服务

183+阅读 · 2020年1月1日

【干货书】Python机器学习导论，340页pdf数据科学家指南

【干货书】Python机器学习导论，340页pdf数据科学家指南

专知

97+阅读 · 2020年6月4日

【Manning2020新书】R/mlr机器学习，513页pdf，Machine Learning with R

【Manning2020新书】R/mlr机器学习，513页pdf，Machine Learning with R

专知

69+阅读 · 2020年3月7日

Python 杠上 Java、C/C++，赢面有几成？

Python 杠上 Java、C/C++，赢面有几成？

CSDN

6+阅读 · 2018年4月12日

这几本Python新书特别赞

这几本Python新书特别赞

图灵教育

21+阅读 · 2018年3月1日

Python & 机器学习之项目实践 | 赠书

Python & 机器学习之项目实践 | 赠书

人工智能头条

14+阅读 · 2017年12月26日

小学生开始学Python,最接近AI的编程语言:安利一波Python书单

小学生开始学Python,最接近AI的编程语言:安利一波Python书单

程序人生

13+阅读 · 2017年12月24日

【入门】数据分析六部曲

【入门】数据分析六部曲

36大数据

18+阅读 · 2017年12月6日

【下载】Scikit-learn作者新书《Python机器学习导论》, 教程+代码手把手带你实践机器学习算法

【下载】Scikit-learn作者新书《Python机器学习导论》, 教程+代码手把手带你实践机器学习算法

专知

72+阅读 · 2017年12月4日

Python NLP 入门教程

Python NLP 入门教程

大数据技术

20+阅读 · 2017年10月24日

Caffe 深度学习框架上手教程

Caffe 深度学习框架上手教程

黑龙江大学自然语言处理实验室

14+阅读 · 2016年6月12日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

GREASE: A Generative Model for Relevance Search over Knowledge Graphs

Arxiv

4+阅读 · 2019年10月11日

Multi-task learning to improve natural language understanding

Arxiv

4+阅读 · 2018年12月17日

Meta-Transfer Learning for Few-Shot Learning

Meta-Transfer Learning for Few-Shot Learning

Arxiv

8+阅读 · 2018年12月6日

Training Generative Adversarial Networks Via Turing Test

Training Generative Adversarial Networks Via Turing Test

Arxiv

3+阅读 · 2018年10月25日

Scaling Neural Machine Translation

Arxiv

3+阅读 · 2018年6月1日

Universal Language Model Fine-tuning for Text Classification

Arxiv

3+阅读 · 2018年5月23日

The Web as a Knowledge-base for Answering Complex Questions

Arxiv

5+阅读 · 2018年3月18日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

Investigations on Knowledge Base Embedding for Relation Prediction and Extraction

Arxiv

8+阅读 · 2018年2月6日

VIP会员

相关主题

大数据处理

相关VIP内容

【干货书】Python机器学习导论，340页pdf数据科学家指南

专知会员服务

175+阅读 · 2020年6月4日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

101+阅读 · 2020年5月22日

Python导论，476页pdf，现代Python计算

Python导论，476页pdf，现代Python计算

专知会员服务

264+阅读 · 2020年5月17日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【经典书】《算法精解：C语言描述》，562页pdf，Mastering Algorithms with C

【经典书】《算法精解：C语言描述》，562页pdf，Mastering Algorithms with C

专知会员服务

106+阅读 · 2020年4月25日

【实用书】掌握Python数据分析，282页pdf，Mastering Python Data Analysis

【实用书】掌握Python数据分析，282页pdf，Mastering Python Data Analysis

专知会员服务

103+阅读 · 2020年4月22日

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

专知会员服务

310+阅读 · 2020年2月26日

【新书】用Python3六步掌握机器学习第二版，469页pdf，Mastering Machine Learning

【新书】用Python3六步掌握机器学习第二版，469页pdf，Mastering Machine Learning

专知会员服务

223+阅读 · 2020年2月2日

【2020新书】Python大数据处理，Mastering Large Datasets with Python

【2020新书】Python大数据处理，Mastering Large Datasets with Python

专知会员服务

54+阅读 · 2020年2月2日

【书籍推荐】简洁的Python编程（Clean Python），附274页pdf

【书籍推荐】简洁的Python编程（Clean Python），附274页pdf

专知会员服务

183+阅读 · 2020年1月1日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

【干货书】Python机器学习导论，340页pdf数据科学家指南

【干货书】Python机器学习导论，340页pdf数据科学家指南

专知

97+阅读 · 2020年6月4日

【Manning2020新书】R/mlr机器学习，513页pdf，Machine Learning with R

【Manning2020新书】R/mlr机器学习，513页pdf，Machine Learning with R

专知

69+阅读 · 2020年3月7日

Python 杠上 Java、C/C++，赢面有几成？

Python 杠上 Java、C/C++，赢面有几成？

CSDN

6+阅读 · 2018年4月12日

这几本Python新书特别赞

这几本Python新书特别赞

图灵教育

21+阅读 · 2018年3月1日

Python & 机器学习之项目实践 | 赠书

Python & 机器学习之项目实践 | 赠书

人工智能头条

14+阅读 · 2017年12月26日

小学生开始学Python,最接近AI的编程语言:安利一波Python书单

小学生开始学Python,最接近AI的编程语言:安利一波Python书单

程序人生

13+阅读 · 2017年12月24日

【入门】数据分析六部曲

【入门】数据分析六部曲

36大数据

18+阅读 · 2017年12月6日

【下载】Scikit-learn作者新书《Python机器学习导论》, 教程+代码手把手带你实践机器学习算法

【下载】Scikit-learn作者新书《Python机器学习导论》, 教程+代码手把手带你实践机器学习算法

专知

72+阅读 · 2017年12月4日

Python NLP 入门教程

Python NLP 入门教程

大数据技术

20+阅读 · 2017年10月24日

Caffe 深度学习框架上手教程

Caffe 深度学习框架上手教程

黑龙江大学自然语言处理实验室

14+阅读 · 2016年6月12日

相关论文

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

GREASE: A Generative Model for Relevance Search over Knowledge Graphs

Arxiv

4+阅读 · 2019年10月11日

Multi-task learning to improve natural language understanding

Arxiv

4+阅读 · 2018年12月17日

Meta-Transfer Learning for Few-Shot Learning

Meta-Transfer Learning for Few-Shot Learning

Arxiv

8+阅读 · 2018年12月6日

Training Generative Adversarial Networks Via Turing Test

Training Generative Adversarial Networks Via Turing Test

Arxiv

3+阅读 · 2018年10月25日

Scaling Neural Machine Translation

Arxiv

3+阅读 · 2018年6月1日

Universal Language Model Fine-tuning for Text Classification

Arxiv

3+阅读 · 2018年5月23日

The Web as a Knowledge-base for Answering Complex Questions

Arxiv

5+阅读 · 2018年3月18日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

Investigations on Knowledge Base Embedding for Relation Prediction and Extraction

Arxiv

8+阅读 · 2018年2月6日

微信扫码咨询专知VIP会员