ALL libraries (COBIB.SI union bibliographic/catalogue database)
  • Clustering of distributions : a case of patent citations
    Kejžar, Nataša, 1976- ; Korenjak-Černe, Simona ; Batagelj, Vladimir
    Often the data units are described with discrete distributions (work describedwith citation distribution over time, population pyramid described asage-sex distribution etc.).When the set of such ... units is very large, appropriate clustering methods can reveal the typical patterns hidden in the data. In this paper we present an adapted leaders method combined with a compatible adapted agglomerative hierarchical method that are based on relative error measure between a unit and the corresponding cluster representative-leader. The proposed approach is illustrated on citation distributions derived from the data set of US patents from 1980 to 1999. Thesenew methods were developed because clustering of units, described with distributions, with classical k-means method reveals patterns with single highpeaks which correspond to a single year. These patterns prevail over otherdistribution shapes also present in the data. Compared with centers in k-means method, clustersć representatives obtained with the proposed new methods better detect typical distribution shapes of units. The obtained main cluster types for different sets of units show three main patterns: patents with early or late peak of importance to the community, and patents where the importance is slowly increasing throughout the time period.
    Source: Journal of classification. - ISSN 0176-4268 (Vol. 28, no. 2, 2011, str. 156-183)
    Type of material - article, component part
    Publish date - 2011
    Language - english
    COBISS.SI-ID - 28528345
    DOI

source: Journal of classification. - ISSN 0176-4268 (Vol. 28, no. 2, 2011, str. 156-183)
loading ...
loading ...
loading ...