转自:爱可可-爱生活
There are lots of machine learning ready datasets available to use for fun or practice on Kaggle's Public Datasets platform. Here is a short list of some of our favorites that we've already had the chance to review. They're all (mostly) cleaned and ready for analysis!
Indian Liver Patient Records
Synthetic Financial Data for Fraud Detection
Business and Industry Reports
Can You Predict Product Backorders?
Exoplanet Hunting in Deep Space
Adult Census Income
Iris Species
Fall Detection Data from China
Biomechanical Features of Orthopedic Patients
Video Game Sales with Ratings
NYC Property Sales
Gas Sensor Array Under Dynamic Gas Mixtures
The Enron Email Dataset
Ubuntu Dialogue Corpus
Old Newspapers: A cleaned subset of HC Corpora newspapers
Speech Accent Archive
Blog Authorship Corpus
Cryptocurrency Historical Prices
Exoplanet Hunting in Deep Space
YouTube Faces with Facial Keypoints
Fashion MNIST
Seattle Police Department 911 Incident Response
Baltimore 911 Calls
Crimes in Chicago
Philadelphia Crime Data
London Crime
Iowa Liquor Sales
Seattle Library Checkout Records
链接:
https://www.kaggle.com/annavictoria/ml-friendly-public-datasets/
原文链接:
https://m.weibo.cn/1402400261/4175682756483267