12,000

We have over 12,000 students, from over 100 countries, within one of the safest campuses in the UK

93%

93% of Lancaster students go into work or further study within six months of graduating

Home > Research > Publications & Outputs > Discussion on the paper by Handcock, Raftery an...
View graph of relations

« Back

Discussion on the paper by Handcock, Raftery and Tantrum.

Research output: Contribution to journalComment/debate

Published

  • T. A. B. Snijders
  • T. Robinson
  • A. C. Atkinson
  • M. Riani
  • I. C. Gormley
  • T. B. Murphy
  • T. Sweeting
  • D. S. Leslie
  • N. T. Longford
  • J. T. Kent
  • T. Lawrance
  • E. M. Airoldi
  • J. Besag
  • D. Blei
  • S. E. Fienberg
  • R. Breiger
  • C. T. Butts
  • P. Doreian
  • V. Batagelj
  • A. Ferligoj
  • D. Draper
  • M. A. J. Van Duijn
  • K. Faust
  • M. Petrescu-Prahova
  • J. J. Forster
  • A. Gelman
  • S. M. Goodreau
  • Katharina Tatjana Gruenberg
  • C. Hennig
  • P. D. Hoff
  • D. R. Hunter
  • D. Husmeier
  • C. Glasbey
  • D. Krackhardt
  • J. Kuha
  • A. Skrondal
  • A. Lawson
  • T. F. Liao
  • B. Mendes
  • D. Draper
  • G. Reinert
  • S. Richardson
  • A. Lewin
  • D. M. Titterington
  • S. Wasserman
  • A. V. Werhli
  • P. Ghazal
Journal publication date03/2007
JournalJournal of the Royal Statistical Society Series A
Journal number2
Volume170
Pages322 - 354
Original languageEnglish

Abstract

Network models are widely used to represent relations between interacting units or actors. Network data often exhibit transitivity, meaning that two actors that have ties to a third actor are more likely to be tied than actors that do not, homophily by attributes of the actors or dyads, and clustering. Interest often focuses on finding clusters of actors or ties, and the number of groups in the data is typically unknown. We propose a new model, the latent position cluster model, under which the probability of a tie between two actors depends on the distance between them in an unobserved Euclidean 'social space', and the actors' locations in the latent social space arise from a mixture of distributions, each corresponding to a cluster. We propose two estimation methods: a two-stage maximum likelihood method and a fully Bayesian method that uses Markov chain Monte Carlo sampling. The former is quicker and simpler, but the latter performs better. We also propose a Bayesian way of determining the number of clusters that are present by using approximate conditional Bayes factors. Our model represents transitivity, homophily by attributes and clustering simultaneously and does not require the number of clusters to be known. The model makes it easy to simulate realistic networks with clustering, which are potentially useful as inputs to models of more complex systems of which the network is part, such as epidemic models of infectious disease. We apply the model to two networks of social relations. A free software package in the R statistical language, latentnet, is available to analyse data by using the model.