Search

Scholarly Works (57 results)

Sort By:

Show:

Article

A Relational Event Model for Social Action, with Application to the World Trade Center Disaster

Butts, Carter T.

Other Recent Work (2006)

Interpersonal interaction over short time scales is frequently understood in terms of actions, which can be thought of as discrete events in which one individual emits a behavior directed at one or more other entities in his or her environment (possibly including him or herself). Here, we introduce a highly flexible framework for modeling actions within social settings, which permits likelihood-based inference for behavioral mechanisms with complex dependence. The utility of the framework is illustrated via an application to dynamic modeling of responder radio communications during the early hours of the World Trade Center disaster.

Cover page: A Relational Event Model for Social Action, with Application to the World Trade Center Disaster

Article

Cycle Census Statistics for Exponential Random Graph Models*

Butts, Carter T.

Other Recent Work (2006)

Exponential family models for random graphs (ERGs, also known as p∗ models) are an increasingly popular tool for the analysis of social networks. ERGs allow for the parameterization of complex dependence among edges within a likelihood-based framework, and are often used to model local influences on global structure. This paper introduces a family of cycle statistics, which allow for the modeling of long-range dependence within ERGs. These statistics are shown to arise from a family of partial conditional dependence assumptions based on an extended form of reciprocity, here called reciprocal path dependence. Algorithms for computing cycle statistic changescores and the cycle census are provided, as are analytical expressions for the first and approximate second moments of the cycle census under a Bernoulli null model. An illustrative application of ERG modeling using cycle statistics is also provided.

Cover page: Cycle Census Statistics for Exponential Random Graph Models*

Article

Predictability of Large-scale Spatially Embedded Networks

Butts, Carter T.

Other Recent Work (2002)

Although it is well-known that there is a relationship between socio-physical dis- tance and edge probability in interpersonal networks, the predictive power of such distances for total network structure has not been established. Here, it is shown that upper bounds on the marginal edge probabilities for farflung dyads can be used to place a lower bound on the predictive power of distance, and one such bound is de- rived. Application of this bound to the special case of uniformly placed vertices on the plane suggests that only modest constraints are required for distance effects to dominate at large physical scales.

Cover page: Predictability of Large-scale Spatially Embedded Networks

Article

California Exodus? A Network Model of Population Redistribution in the United States

UC Irvine Previously Published Works (2023)

Motivated by debates about California's net migration loss, we employ valued exponential-family random graph models to analyze the inter-county migration flow networks in the United States. We introduce a protocol that visualizes the complex effects of potential underlying mechanisms, and perform in silico knockout experiments to quantify their contribution to the California Exodus. We find that racial dynamics contribute to the California Exodus, urbanization ameliorates it, and political climate and housing costs have little impact. Moreover, the severity of the California Exodus depends on how one measures it, and California is not the state with the most substantial population loss. The paper demonstrates how generative statistical models can provide mechanistic insights beyond simple hypothesis-testing.

Cover page: California Exodus? A Network Model of Population Redistribution in the United States

Creative Commons 'BY' version 4.0 license

Article

Rooted America: Immobility and Segregation of the Intercounty Migration Network

UC Irvine Previously Published Works (2022)

Despite the popular narrative that the United States is a "land of mobility," the country may have become a "rooted America" after a decades-long decline in migration rates. This article interrogates the lingering question about the social forces that limit migration, with an empirical focus on internal migration in the United States. We propose a systemic, network model of migration flows, combining demographic, economic, political, and geographic factors and network dependence structures that reflect the internal dynamics of migration systems. Using valued temporal exponential-family random graph models, we model the network of intercounty migration flows from 2011 to 2015. Our analysis reveals a pattern of segmented immobility, where fewer people migrate between counties with dissimilar political contexts, levels of urbanization, and racial compositions. Probing our model using "knockout experiments" suggests one would have observed approximately 4.6 million (27 percent) more intercounty migrants each year were the segmented immobility mechanisms inoperative. This article offers a systemic view of internal migration and reveals the social and political cleavages that underlie geographic immobility in the United States.

Cover page: Rooted America: Immobility and Segregation of the Intercounty Migration Network

Article
Peer Reviewed

Rooted America: Immobility and Segregation of the Intercounty Migration Network

UC Irvine Previously Published Works (2023)

Despite the popular narrative that the United States is a “land of mobility,” the country may have become a “rooted America” after a decades-long decline in migration rates. This article interrogates the lingering question about the social forces that limit migration, with an empirical focus on internal migration in the United States. We propose a systemic, network model of migration flows, combining demographic, economic, political, and geographic factors and network dependence structures that reflect the internal dynamics of migration systems. Using valued temporal exponential-family random graph models, we model the network of intercounty migration flows from 2011 to 2015. Our analysis reveals a pattern of segmented immobility, where fewer people migrate between counties with dissimilar political contexts, levels of urbanization, and racial compositions. Probing our model using “knockout experiments” suggests one would have observed approximately 4.6 million (27 percent) more intercounty migrants each year were the segmented immobility mechanisms inoperative. This article offers a systemic view of internal migration and reveals the social and political cleavages that underlie geographic immobility in the United States.

Thesis
Peer Reviewed

Novel Applications of Statistical Network Models for HIV Research

Lee, Francis
Advisor(s): Butts, Carter T

UC Irvine Electronic Theses and Dissertations (2020)

Statistical network models have been shown to be of particular relevance for understanding various phenomena; one of the richest areas for research is understanding the spread of sexually transmitted infections such as HIV. With the advent of new epidemiological protocols designed to prevent spread and foundational work analyzing the impact of network structure on diffusion of contagion, network analysis is poised to tackle various questions relating to the spread of HIV amongst vulnerable populations. Chapter 1 presents a methodological development integrating Goffman's conception of stigma within the exponential random graph modeling framework. Various properties are explored under simulation and as a test case, this is used to quantify the level of behavioral stigma in an adolescent friendship network based on gender. Chapter 2 utilizes the development from Chapter 1 to quantify the level of HIV stigma in informal social networks of young black men who have sex with men (from a structural perspective). This is then linked to a framework for understanding the consequences of network perturbations in the resultant network structure (i.e. the impacts of "coming out HIV positive" on the network) and its subsequent effects on HIV diffusion. Chapter 3 focuses on issues of data collection in informal social networks, specifically resolving conflicting self-reports on relationships within the context of informant accuracy and network inference. The tools developed in this dissertation are far-reaching and can provide insight to the study of populations at risk or other social environments beyond HIV.

Cover page: Novel Applications of Statistical Network Models for HIV Research

Creative Commons 'BY-NC-SA' version 4.0 license

Article

Parameter Estimation Procedures for Exponential-Family Random Graph Models on Count-Valued Networks: A Comparative Simulation Study

UC Irvine Previously Published Works (2021)

The exponential-family random graph models (ERGMs) have emerged as an important framework for modeling social networks for a wide variety of relational types. ERGMs for valued networks are less well-developed than their unvalued counterparts, and pose particular computational challenges. Network data with edge values on the non-negative integers (count-valued networks) is an important such case, with examples ranging from the magnitude of migration and trade flows between places to the frequency of interactions and encounters between individuals. Here, we propose an efficient parallelable subsampled maximum pseudo-likelihood estimation (MPLE) scheme for count-valued ERGMs, and compare its performance with existing Contrastive Divergence (CD) and Monte Carlo Maximum Likelihood Estimation (MCMLE) approaches via a simulation study based on migration flow networks in two U.S. states. Our results suggest that edge value variance is a key factor in method performance, while network size mainly influences their relative merits in computational time. For small-variance networks, all methods perform well in point estimations while CD greatly overestimates uncertainties, and MPLE underestimates them for dependence terms; all methods have fast estimation for small networks, but CD and subsampled multi-core MPLE provides speed advantages as network size increases. For large-variance networks, both MPLE and MCMLE offer high-quality estimates of coefficients and their uncertainty, but MPLE is significantly faster than MCMLE; MPLE is also a better seeding method for MCMLE than CD, as the latter makes MCMLE more prone to convergence failure.

Cover page: Parameter Estimation Procedures for Exponential-Family Random Graph Models on Count-Valued Networks: A Comparative Simulation Study

Thesis
Peer Reviewed

More Than The Sum of Their Parts: Coordination In Dynamic Social Networks

Livas, Selena
Advisor(s): Butts, Carter T

UC Irvine Electronic Theses and Dissertations (2023)

This dissertation investigates coordination as a key component of social systems, from a network of international environmental governance to a localized response to disaster. Chapter 2 is a study of international environmental agreement (IEA) co-ratification. I focus specifically on mixing effects and how these demonstrate a shift in the global configuration of cooperative behavior within this context. Chapter 3 dives deeper into this network, looking more closely at the structural factors influencing ratification of IEAs, including factors at the agreement level. The goal of this chapter was to better understand the formation of new ratification ties over time, while making several methodological contributions as well. Finally, Chapter 4 is a study of dynamic communication patterns across 17 localized first responder networks in the midst of a disaster. We utilized a recently developed tool for relational event model simulation to study the resilience of these networks as they reorganized in the face of varied disruption. This dissertation pushes for the view of social systems as interconnected and specifically demonstrates the advantages of studying coordination from this perspective. I hope it spurs future research on these topics, especially in the realm of environmental degradation as it is ever more often encompassing both regulation and disaster response.

Cover page: More Than The Sum of Their Parts: Coordination In Dynamic Social Networks

Thesis
Peer Reviewed

Advances in Exponential-family Random Graph Models: Computation, Model Selection, and Methodology

Yin, Fan
Advisor(s): Butts, Carter T

UC Irvine Electronic Theses and Dissertations (2020)

Networks (graphs) are broadly used to represent relations between entities in a wide range of scientific fields. Exponential-family random graph models (ERGMs) provide a highly general way of specifying distributions on graphs, allowing the complex dependence structure of edges in a network to be specified in terms of local structural properties. This thesis addresses problems related to three lines of inquiry for ERGMs: faster Bayesian inference algorithms; comparison of newly proposed and traditional model selection techniques; and methodological innovation for modeling ensembles of networks.

In Chapter 2 of this dissertation, we present a highly parallel algorithm that enables fast Bayesian inference on ERGMs. The impetus for this work comes from the facts that conducting Bayesian inference for ERGMs is challenging because of the intractability of both the likelihood and posterior normalizing factor and auxiliary-variable based Markov Chain Monte Carlo (MCMC) methods for this problem are asymptotically exact but computationally demanding. We propose a kernel-based approximate Bayesian computation algorithm for fitting ERGMs, which is easily parallelizable. Through empirical comparisons against the state-of-the-art approximate exchange algorithm, we show that the proposed algorithm yields comparable accuracy to the state-of-the-art MCMC approach, the approximate exchange algorithm (Caimo and Friel, 2011), while cutting the wallclock runtime by half with 5 cores, and by 80\% with 30 cores.

In Chapter 3 of this dissertation, we carry out simulation studies to compare newly proposed and traditional model selection techniques. This work is driven by the importance of understanding the strengths and weaknesses of those model selection techniques for ERGMs that are currently available, including Akaike information criterion (Akaike, 1973), Bayesian information criterion (Schwarz, 1978), Held-Out Predictive Evaluation (HOPE) (Yin et al., 2019), Bayes factors (Raftery, 1995) and graphical goodness of fit (Hunter et al., 2008). In particular, we focus on the first three techniques, as the calculation of Bayes factor for ERGMs relies on reversible jump Markov chain Monte Carlo algorithm extension of the approximate exchange algorithm (Caimo and Friel, 2013), which is hard to implement and tune; the graphical goodness of fit is more suitable for checking whether a model is adequate rather than comparing competing models. The simulation studies are carried out under two scenarios, closed-M (under which the true model is among the set of candidate models) and open-M (under which the true model is not among the set of candidate models), and we evaluate the performance of model selection techniques from various aspects covering the model selection accuracy, predictive deviance and prediction accuracy of edge variables.

In Chapter 4 of this dissertation, we propose a novel methodology that can be used for modeling the generative processes of ensembles of networks. The motivation of this work is that ensembles of networks arise in many scientific fields, but there are few statistical tools for inferring their generative processes, particularly in the presence of both dyadic dependence and cross-graph heterogeneity. To fill in this gap, we propose characterizing network ensembles via finite mixtures of exponential family random graph models, a framework for parametric statistical modeling of graphs that has been successful in explicitly modeling the complex stochastic processes that govern the structure of edges in a network. Our proposed methodology can also be used for applications such as model-based clustering of ensembles of networks and density estimation for complex graph distributions. We develop a Metropolis-within-Gibbs algorithm to conduct fully Bayesian inference and adapt a version of deviance information criterion for missing data models to choose the number of latent heterogeneous generative mechanisms. Simulation studies show that the proposed procedure can recover the true number of latent heterogeneous generative processes and corresponding parameters. We demonstrate the utility of the proposed approach using an ensemble of political co-voting networks among U.S. Senators and an ensemble of advice-seeking networks among school teachers.

Cover page: Advances in Exponential-family Random Graph Models: Computation, Model Selection, and Methodology