Page 236 - Becoming Metric Wise

P. 236

227
Indicators

propose the notion of merit, to be calculated as follows. We are given a
(homogeneous) group of N articles, for instance all publications by
research group G during the latest 5 years. Now, the evaluator must make
two choices: the first is the indicator to be used, e.g., average number of
received citations; total number of received citations, h-index, .. ., and
secondly a reference set. Next one determines the indicator value for
group G. In a following step one determines a random set of N articles
from the reference set and determines the indicator value for this random
set. This procedure is repeated 1000 or more times. The merit of G is its
percentile value among these 1000 or more indicator results. Clearly, a
good choice of reference sets makes it possible to compare the merit of
different research groups in a meaningful way.

7.11 TIME SERIES OF INDICATORS

In this Section, based on (Liu & Rousseau, 2008) we provide precise defi-
nitions of time series of indicators. Such definitions are necessary to avoid
possible confusion as different time series may lead to different conclu-
sions. We provide a general scheme for time series, using the terms
sequence and series interchangeably. We only discuss time series for which
terms—individual elements in the series—are obtained as sums (occasion-
ally sums consisting of one element). Time series for which terms are
averages can easily be derived from these.

7.11.1 The General Framework

We assume that the focus is on one set of articles. This set can be a scien-
tist’s research record, one specific journal, the set of all journals in one
particular field, or even all journals in a database. For this set we intend to
calculate a time series of indicators based on publication and citation data.
As a first example we consider how two simple time series can be
derived from a given one. Assume we know the numbers of citations
c 0 , c 1 , .. ., c 10 received by a set of articles published in the year Y 0 , where
c 0 is received in the year Y 0 , c 1 in the year Y 0 1 1 and so on. During
an investigation one may be interested in a subsequence such as
s 5 (c k ) k51,...,10 (we are not interested in the number of citations received
during the publication year). Similarly, we might be interested in the

P 1 P 2 P 10
cumulative sequence S 5 j50 j ; j50 j ; :::; j50 j .
c
c
c
Next we come to the main point of this section starting from a
general matrix of data, not a simple sequence. Consider a p-c matrix

231 232 233 234 235 236 237 238 239 240 241