This paper outlines a grammar of data analysis, as distinct from grammars of data manipulation, in which the primitives are metrics and dimensions. We describe a Python implementation of this grammar called Meterstick, which is agnostic to the underlying data source, which may be a DataFrame or a SQL database.
翻译:本文概述了一种数据分析的语法,该语法区别于数据操作的语法,其基本元素是度量和维度。我们描述了一种名为Meterstick的Python实现,该实现不依赖于底层数据源,数据源可以是DataFrame或SQL数据库。