Preferential attachment

Preferential attachment

Preferential attachment is a name given by scientists to a class of processes in which some quantity, typically some form of wealth or credit, is distributed among a number of individuals or objects according to how much they already have, so that those who are already wealthy receive more than those who are not. "Preferential attachment" is only the most recent of many names that have been given to such processes. They are also referred to under the names "Yule process", "Gibrat's law", "cumulative advantage", "the rich get richer", and, less correctly, the "Matthew effect". The principal reason for scientific interest in preferential attachment is that it can, under suitable circumstances, generate power law distributions of wealth.

Definition

A preferential attachment process is a stochastic urn process, meaning a process in which discrete units of wealth, usually called "balls", are added in a random or partly random fashion to a set of objects or containers, usually called "urns". A preferential attachment process is an urn process in which additional balls are added continuously to the system and are distributed among the urns as an increasing function of the number of balls the urns already have. In the most commonly studied examples, the number of urns also increases continuously, although this is not a necessary condition for preferential attachment and examples have been studied with constant or even decreasing numbers of urns.

A classic example of a preferential attachment process is the growth in the number of species per genus in some higher taxon of biotic organismscite journal
last = Yule
first = G. U.
title = A Mathematical Theory of Evolution, based on the Conclusions of Dr. J. C. Willis, F.R.S.
journal = Philosophical Transactions of the Royal Society of London, Ser. B
volume = 213
pages = 21–87
date = 1925
] . New genera ("urns") are added to a taxon whenever a newly appearing species is considered sufficiently different from its predecessors that it does not belong in any of the current genera. New species ("balls") are added as old ones speciate (i.e., split in two) and, assuming that new species belong to the same genus as their parent (except for those that start new genera), the probability that a species is added to a new genus will be proportional to the number of species the genus already has. This process, first studied by Yule, is a "linear" preferential attachment process, since the rate at which genera accrue new species is linear in the number they already have.

Linear preferential attachment processes in which the number of urns increases are known to produce a distribution of balls over the urns following the so-called Yule distribution. In the most general form of the process, balls are added to the system at an overall rate of "m" new species for each new urn. Each newly created urn starts out with "k"0 balls and further balls are added to urns at a rate proportional to the number "k" that they already have plus a constant "a" > −"k"0. With these definitions, the fraction "P"("k") of urns having "k" balls in the limit of long time is given by [cite journal
last = Newman
first = M. E. J.
title = Power laws, Pareto distributions and Zipf's law
journal = Contemporary Physics
volume = 46
pages = 323–351
date = 2005
url = http://arxiv.org/abs/cond-mat/0412004
]

:P(k) = {mathrm{B}(k+a,gamma)overmathrm{B}(k_0+a,gamma-1)},

for "k" ≥ "k"0 (and zero otherwise), where B("x","y") is the Euler beta function:

:mathrm{B}(x,y) = {Gamma(x)Gamma(y)overGamma(x+y)},

with Γ("x") being the standard gamma function, and

:gamma = 2 + {k_0 + aover m}.

The beta function behaves asymptotically as B("x","y") ~ "x"−"y" for large "x" and fixed "y", which implies that for large values of "k" we have

:P(k) sim k^{-gamma}.

In other words, the preferential attachment process generates a "long-tailed" distribution following a Pareto distribution or power law in its tail. This is the primary reason for the historical interest in preferential attachment: the species distribution and many other phenomena are observed empirically to follow power laws and the preferential attachment process is a leading candidate mechanism to explain this behavior. Preferential attachment is considered a possible candidate for, among other things, the distribution of the sizes of citiescite journal
last = Simon
first = H. A.
title = On a class of skew distribution functions
journal = Biometrika
volume = 42
pages = 425-440
date = 1955
] , the wealth of extremely wealthy individuals, the number of citations received by learned publicationscite journal
last = Price
first = D. J. de S.
title = A general theory of bibliometric and other cumulative advantage processes
journal = J. Amer. Soc. Inform. Sci.
volume = 27
pages = 292-306
date = 1976
url = http://garfield.library.upenn.edu/price/pricetheory1976.pdf
] , and the number of links to pages on the world wide webcite journal
last = Barabási
first = A.-L.
coauthors = R. Albert
title = Emergence of scaling in random networks
journal = Science
volume = 286
pages = 509-512
date = 1999
url = http://arxiv.org/abs/cond-mat/9910332
] .

The general model described here includes many other specific models as special cases. In the species/genus example above, for instance, each genus starts out with a single species ("k"0 = 1) and gains new species in direct proportion to the number it already has ("a" = 0), and hence "P"("k") = B("k","γ")/B("k"0,"γ"−1) with "γ" = 2 + 1/"m". Similarly the Price model for scientific citations corresponds to the case "k"0 = 0, "a" = 1 and the widely studied Barabási-Albert model corresponds to "k"0 = "m", "a" = 0.

Preferential attachment is sometimes referred to as the Matthew effect, but the two are not precisely equivalent. The Matthew effect, first discussed by Robert Merton [cite journal
last = Merton
first = Robert K.
title = The Matthew effect in science
journal = Science
volume = 159
pages = 56-63
date = 1968
] , is named for a passage in the biblical Gospel of Matthew: "For unto every one that hath shall be given, and he shall have abundance: but from him that hath not shall be taken away even that which he hath." (Matthew , King James Version.) The preferential attachment process does not incorporate the taking away part. An urn process that includes both the giving and the taking away would produce a log-normal distribution rather than a power law. This point may be moot, however, since the scientific insight behind the Matthew effect is in any case entirely different. Qualitatively it is intended to describe not a mechanical multiplicative effect like preferential attachment but a specific human behavior in which people are more likely to give credit to the famous than to the little known. The classic example of the Matthew effect is a scientific discovery made simultaneously by two different people, one well known and the other little known. It is claimed that under these circumstances people tend more often to credit the discovery to the well-known scientist. Thus the real-world phenomenon the Matthew effect is intended to describe is quite distinct from (though certainly related to) preferential attachment.

History

The first rigorous consideration of preferential attachment seems to be that of Yule in 1925, who used it to explain the power-law distribution of the number of species per genus of flowering plants. The process is sometimes called a "Yule process" in his honor. Yule was able to show that the process gave rise to a distribution with a power-law tail, but the details of his proof are, by today's standards, contorted and difficult, since the modern tools of stochastic process theory did not yet exist and he was forced to use more cumbersome methods of proof.

Most modern treatments of preferential attachment make use of the master equation method, whose use in this context was pioneered by Simon in 1955, in work on the distribution of sizes of cities and other phenomena.

The first application of preferential attachment to learned citations was given by Price in 1976. (He referred to the process as a "cumulative advantage" process.) His was also the first application of the process to the growth of a network, producing what would now be called a scale-free network. It is in the context of network growth that the process is most frequently studied today. Price also promoted preferential attachment as a possible explanation for power laws in many other phenomena, including Lotka's law of scientific productivity and Bradford's law of journal use.

The application of preferential attachment to the growth of the world wide web was proposed by Barabási and Albert in 1999. Barabási and Albert also coined the name "preferential attachment" by which the process is best known today and suggested that the process might apply to the growth of other networks as well.

ee also

* Stochastic processes
* Power law
* Yule–Simon distribution
* Simon model
* Complex network
* BA model

References


Wikimedia Foundation. 2010.

Игры ⚽ Нужно сделать НИР?

Look at other dictionaries:

  • BA model — The Barabási–Albert (BA) model is an algorithm for generating random scale free networks using a preferential attachment mechanism. Scale free networks are widely observed in natural and man made systems, including the Internet, the world wide… …   Wikipedia

  • Scale-free network — A scale free network is a network whose degree distribution follows a power law, at least asymptotically. That is, the fraction P ( k ) of nodes in the network having k connections to other nodes goes for large values of k as P ( k ) k − γ where… …   Wikipedia

  • Generalized scale-free model — There has been a burst of activity in the modeling of scale free complex networks. The recipe of Barabási and AlbertBarabási, A. L. and R. Albert, Science 286, 509(1999).] has been followed by several variations and generalizationsR. Albert, and… …   Wikipedia

  • Yule–Simon distribution — Probability distribution name =Yule–Simon type =mass pdf Yule–Simon PMF on a log log scale. (Note that the function is only defined at integer values of k. The connecting lines do not indicate continuity.) cdf Yule–Simon CMF. (Note that the… …   Wikipedia

  • Albert-László Barabási — (born March 30, 1967) is a Romanian born Hungarian scientist. He is the former Emil T. Hofmann professor at the University of Notre Dame and current Distinguished Professor and Director of Northeastern University s Center for Network Science and… …   Wikipedia

  • Wealth concentration — Wealth Concentration, also known as wealth condensation, is a process by which, in certain conditions, newly created wealth tends to become concentrated in the possession of already wealthy individuals or entities, a form of preferential… …   Wikipedia

  • Small-world network — In mathematics and physics, a small world network is a type of mathematical graph in which most nodes are not neighbors of one another, but most nodes can be reached from every other by a small number of hops or steps. A small world network,… …   Wikipedia

  • Simon model — MotivationAiming to account for the wide range of empirical distributions following a power law, Herbert SimonSimon, H. A., 1955, Biometrika 42, 425.] proposed a class of stochastic models that results in a power law distribution function. It… …   Wikipedia

  • Copying mechanism — In the study of scale free networks, a copying mechanism is a process by which such a network can form and grow, by means of repeated steps in which nodes are duplicated with mutations from existing nodes. Several variations of copying mechanisms …   Wikipedia

  • Modelo Barabási–Albert — Red de 1000 nodos generada con el modelo de Modelo de Barabási–Albert En teoría de redes se denomina Modelo de Barabási–Albert (es posible encontrarlo en la literatura abreviadamente como modelo BA) como un algoritmo empleado para generar redes… …   Wikipedia Español

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”