We believe that "all men are created equal". With the rise of the police shootings reported by media, more people in the U.S. think that police use excessive force during law enforcement, especially to a specific group of people. We want to apply multidimensional statistical analysis to reveal more facts than the monotone mainstream media. Our paper has three parts. First, we proposed a new method to quantify fatal police shooting news reporting deviation of mainstream media, which includes CNN, FOX, ABC, and NBC. Second, we analyzed the most comprehensive US fatal police shooting dataset from Washington Post. We used FP-growth to reveal the frequent patterns and DBSCAN clustering to find fatal shooting hotspots. We brought multi-attributes (social economics, demographics, political tendency, education, gun ownership rate, police training hours, etc.) to reveal connections under the iceberg. We found that the police shooting rate of a state depends on many variables. The top four most relevant attributes were state joined year, state land area, gun ownership rate, and violent crime rate. Third, we proposed four regression models to predict police shooting rates at the state level. The best model Kstar could predict the fatal police shooting rate with about 88.53% correlation coefficient. We also proposed classification models, including Gradient Boosting Machine, Multi-class Classifier, Logistic Regression, and Naive Bayes Classifier, to predict the race of fatal police shooting victims. Our classification models show no significant evidence to conclude that racial discrimination happened during fatal police shootings recorded by the WP dataset.
翻译:我们认为,“所有男人都是平等的”。第二,我们分析了华盛顿邮报上最全面的美国致命的警察射击数据集。我们用FP(FP)来揭示频繁的射击模式和DBSCAN(DBSCAN)集群,以寻找致命的射击热点。我们想用多层次的统计分析来揭示更多事实,而不是单调主流媒体。我们的文件有三部分。首先,我们提出了一种新的方法来量化致命的警察枪击事件,包括CNN、FOX、ABC和NBC。第二,我们分析了华盛顿邮报上最全面的美国致命的警察射击数据集。我们使用FP(FP)来揭示频繁的射击模式和DBSCAN(D)的集群,以寻找致命的射击热点。我们带来了多层次的统计分析(社会经济学、人口统计、政治趋势、教育、枪支拥有率、警察培训时间等)来揭示冰山下的联系。我们发现警察射击率率取决于许多变量。我们发现,头四个最相关的属性是州合并的年份、州级、州级、州级、枪支拥有率、暴力犯罪率。第三,我们提出了四个回归模型来预测警察射击射击射击率比率, 以及州级(我们最致命的BOIL)的模型预测。