With the expansion of electronic publishing, a new dynamics of scientific articles dissemination was initiated. Nowadays, many works are widely disseminated even before publication, in the form of preprints. Another important new element concerns the views of published articles. Thanks to the availability of respective data by some journals, such as PLoS ONE, it became possible to develop investigations on how scientific works are viewed along time, often before the first citations appear. This provides the main theme of the present work. More specifically, our research was motivated by preliminary observations that the view profiles along time tend to present a piecewise linear nature. A methodology was then delineated in order to identify the main segments in the view profiles, which allowed several related measurements to be derived. In particular, we focused on the inclination and length of each subsequent segment. Basic statistics indicated that the inclination can vary substantially along subsequent segments, while the segment lengths resulted more stable. Complementary joint statistics analysis, considering pairwise correlations, provided further information about the properties of the views. In order to better understand the view profiles, we performed respective multivariate statistical analysis, including principal component analysis and hierarchical clustering. The results suggest that a portion of the polygonal views are organized into clusters or groups. These groups were characterized in terms of prototypes indicating the relative increase or decrease along subsequent segments. Four respective distinct models were then developed for representing the observed segments. It was found that models incorporating joint dependencies between the properties of the segments provided the most accurate results among the considered alternatives.
翻译:随着电子出版的扩大,科学文章传播的新动态得以启动。如今,许多作品甚至在出版前以预印本的形式广泛散发。另一个重要的新内容涉及出版文章的意见。由于PLOS one等一些期刊提供各自数据,因此有可能对科学作品如何随时间看进行调查,通常是在第一次引用之前。这提供了目前工作的主要主题。更具体地说,我们的研究的动机是初步观察,即观点简介随着时间的推移往往呈现出片面直线性质。随后,我们界定了一种方法,以确定观点简介的主要部分,从而可以得出若干相关的计量。特别是,我们侧重于随后各部分的倾向和长度。基本统计数据表明,随着随后各部分的长度,趋势可能有很大差异,而各部分的长度则比较稳定。考虑到对各种观点的关联性提供了进一步的信息。为了更好地了解观点简介,我们进行了不同的多变统计分析,包括主要组成部分分析和分级组合。结果表明,在其后各部分中,将各自不同部分的对比性观点分为不同的类别或不同类别。提供了随后不同类别之间形成不同类别。这些组合的模型表明,在其后不同类别中形成不同类别中的不同类别中的不同类别。