Award Abstract # 1830547
ATD: Collaborative Research: Spatio-Temporal Data Analysis with Dynamic Network Models

NSF Org: DMS
Division Of Mathematical Sciences
Recipient: OHIO STATE UNIVERSITY, THE
Initial Amendment Date: August 6, 2018
Latest Amendment Date: July 31, 2019
Award Number: 1830547
Award Instrument: Continuing Grant
Program Manager: Leland Jameson
DMS
 Division Of Mathematical Sciences
MPS
 Direct For Mathematical & Physical Scien
Start Date: August 1, 2018
End Date: December 31, 2021 (Estimated)
Total Intended Award Amount: $141,685.00
Total Awarded Amount to Date: $141,685.00
Funds Obligated to Date: FY 2018 = $35,954.00
FY 2019 = $105,731.00
History of Investigator:
  • Subhadeep Paul (Principal Investigator)
    paul.963@osu.edu
  • James Wilson (Co-Principal Investigator)
Recipient Sponsored Research Office: Ohio State University
1960 KENNY RD
COLUMBUS
OH  US  43210-1016
(614)688-8735
Sponsor Congressional District: 03
Primary Place of Performance: Ohio State University
1958 Neil Ave
Columbus
OH  US  43210-1016
Primary Place of Performance
Congressional District:
03
Unique Entity Identifier (UEI): DLWBSLWAJWR1
Parent UEI: MN4MDDMN8529
NSF Program(s): ATD-Algorithms for Threat Dete
Primary Program Source: 01001819RB NSF RESEARCH & RELATED ACTIVIT
01001920DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 6877
Program Element Code(s): 046Y00, S10000
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.049

ABSTRACT

Modeling and analyzing spatially-determined and time-varying (spatiotemporal) interactions is at the forefront of research in many scientific and engineering disciplines, including the social and behavioral sciences, transportation, healthcare, economics, and epidemiology. This project represents spatiotemporal interactions of entities as a dynamic complex network and aims to develop statistically-principled methods for modeling, analyzing, and monitoring the dynamic interactions. The methods developed in this work will provide scalable solutions for problems relevant to threat detection, including understanding spreading of diseases and viruses through human proximity networks, understanding human migration patterns through geo-tagged social media data, and monitoring multi-modal urban mobility networks through video footage and sensor logs in a smart city. Graduate and undergraduate students will be trained in interdisciplinary data science through involvement in the research. New data structures, models, and algorithms for manipulating and analyzing spatiotemporal networks will be implemented in the widely-used NetworkX Python package.

The project aims to advance the field of spatiotemporal network analysis by developing new models and methods for representing, monitoring, and predicting spatiotemporal interactions. The research introduces new problem formulations, new analytical methods, and new algorithmic techniques for implementation. This project has three primary aims. First, the project will develop a dynamic embedding model in a latent hyperbolic space to represent spatiotemporal networks. This model enables tracking topological changes both at the network level and at the level of pairs of entities over time. Next, the project will investigate a network surveillance framework based on a multi-resolution exponential random graph model to monitor complex spatiotemporal systems for real-time anomalies and threats. Third, the project will develop a multivariate point process on collections of actors in a spatiotemporal network to model timestamped directed events across different regions in space. This project seeks to create an integrated framework for simultaneously monitoring systematic risk and detecting imminent threat to a system using multi-modal network monitoring techniques. The techniques under development will be utilized to monitor complex systems arising from massive spatiotemporal data accumulation, including data on human contacts through physical proximity, social media data, and event data such as homicides in city neighborhoods and conflicts between countries. The fundamental results derived in this work will guide research in modeling and inference on dynamic networks and will serve as a benchmark for future work.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

(Showing: 1 - 10 of 12)
Yu, Lisha and Zwetsloot, Inez Maria and Stevens, Nathaniel Tyler and Wilson, James David and Tsui, Kwok Leung "Monitoring dynamic networks: A simulation?based strategy for comparing monitoring methods and a comparative study" Quality and Reliability Engineering International , v.38 , 2021 https://doi.org/10.1002/qre.2944 Citation Details
Houghton, Isabel A. and Wilson, James D. "El Niño Detection Via Unsupervised Clustering of Argo Temperature Profiles" Journal of Geophysical Research: Oceans , v.125 , 2020 https://doi.org/10.1029/2019JC015947 Citation Details
Wilson, James D. and Cranmer, Skyler and Lu, Zhong-Lin "A Hierarchical Latent Space Network Model for Population Studies of Functional Connectivity" Computational Brain & Behavior , v.3 , 2020 https://doi.org/10.1007/s42113-020-00080-0 Citation Details
Kent, Daniel and Wilson, James D. and Cranmer, Skyler J. "A Permutation-Based Changepoint Technique for Monitoring Effect Sizes" Political Analysis , 2021 https://doi.org/10.1017/pan.2020.44 Citation Details
Paul, Subhadeep and Chen, Yuguo "Spectral and matrix factorization methods for consistent community detection in multi-layer networks" The Annals of Statistics , v.48 , 2020 https://doi.org/10.1214/18-AOS1800 Citation Details
Wilson, James D. and Baybay, Melanie and Sankar, Rishi and Stillman, Paul and Popa, Abbie M. "Analysis of population functional connectivity data via multilayer network embeddings" Network Science , v.9 , 2021 https://doi.org/10.1017/nws.2020.39 Citation Details
Arastuie, Makan and Paul, Subhadeep and Xu, Kevin S. "CHIP: A Hawkes process model for continuous-time networks with scalable and consistent estimation" Advances in neural information processing systems , v.33 , 2020 Citation Details
Lee, Jihui and Li, Gen and Wilson, James D. "Varying-coefficient models for dynamic networks" Computational Statistics & Data Analysis , 2020 10.1016/j.csda.2020.107052 Citation Details
Paul, Subhadeep and Chen, Yuguo "A random effects stochastic block model for joint community detection in multiple networks with applications to neuroimaging" Annals of Applied Statistics , v.14 , 2020 10.1214/20-AOAS1339 Citation Details
Wilson, James D. and Stevens, Nathaniel T. and Woodall, William H. "Modeling and detecting change in temporal networks via the degree corrected stochastic block model" Quality and Reliability Engineering International , v.35 , 2019 https://doi.org/10.1002/qre.2520 Citation Details
Stillman, Paul E. and Wilson, James D. and Denny, Matthew J. and Desmarais, Bruce A. and Cranmer, Skyler J. and Lu, Zhong-Lin "A consistent organizational structure across multiple functional subnetworks of the human brain" NeuroImage , v.197 , 2019 10.1016/j.neuroimage.2019.03.036 Citation Details
(Showing: 1 - 10 of 12)

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Supported by this grant, we developed statistical models, methods and theories for analysis of network connected spatiotemporal data. Our developments include:

(1)   The Community Hawkes Independent Pairs (CHIP) model for analyzing relational events data. We developed a highly scalable estimation approach that can easily fit networks with hundreds of thousands of nodes and millions of events, and yet has rigorous statistical performance guarantees.

(2)   The multivariate community Hawkes model improves the CHIP model and is capable of modeling realistic dependence patterns that are commonly observed in relational events data. However, fitting this model is not as computationally efficient as CHIP.

(3)   The latent space Hawkes model which produces interpretable results and fit many salient features of several real data sets well.

(4)   A joint latent space model for a social network and multivariate attributes that allows researchers to study student’s responses collected from a survey and their interactions with their peers simultaneously.

(5)   A method for estimating causal network influence from observational data adjusting for latent homophily. Identifying the influence network-connected neighbors exert on individuals' outcomes is a fundamental scientific problem associated with network-linked data and our methods represent a significant advancement in this literature.

(6)   Models and a statistical hypothesis testing framework for detecting and evaluating the small-world property that is widely observed in complex networks. This framework makes the widely applied small world property statistically more rigorous.

(7)   Statistical model, estimation, and hypothesis testing methods for community structure in a population of networks for analyzing data from multi-subject fMRI neuroimaging studies. These methods enable neuroscientists to assess differences in functional organization of brain networks between a population of patients and healthy controls.

(8)   Methods for estimating the latent positions of a network generated from the hyperbolic latent space model with community structure.

(9)   A generative model for correlation networks, known as the correlation generalized exponential random graph model, which characterizes the topological structure of a correlation network using an exponential model on summary statistics of the observed graph.

(10)       A network monitoring technique which combines the use of random graph modeling and statistical process monitoring (SPM) techniques.

 

The overarching aim of this project was to develop dynamic network models and algorithms for representing, analyzing, monitoring, and predicting complex interactions from spatiotemporal data. These developments together represent important contributions in many statistical problems on analysis of spatiotemporal data.  We have also engaged actively in collaborating with researchers from Psychology, Education, Neuroscience,  and Microbiology. We have applied the models and methods developed in this project for various application problems from those fields.  

The project has supported the research of 4 PhD students in Statistics and 1 PhD student in Psychology. In addition, 1 MS student and 1 Undergraduate student were involved in the project. Three of the PhD students were partially supported by funds from this project. The PI has taught a graduate special topics course on statistical inference in networks in Spring 2020 that has incorporated knowledge gained from this project.

The project team has disseminated the results to the communities of interest through invited talks at several conferences including the 2018 and 2020 CMStatistics conferences, the 2021 Network conference, the 2019 IISA conference, 2019 IEEE Data Science workshop, and the 2019 ABS conference.

The PI has organized sessions at the 2020 CMStatistics conference, and the 2019 IEEE Data Science Workshop in Minneapolis, MN. The Project team has also attended the ATD conferences in 2018, 2019, and 2020. The PI has presented in the special topic contributed session organized on behalf of the ATD program at the 2021 JSM conference.

The results of the research from this project has been published in several statistical methodology journals including the Annals of Applied Statistics, Annals of Statistics, Sankhya, machine learning conferences, including NeurIPS, as well as several interdisciplinary journals. A large number of articles with the results are also posted in the preprint server Arxiv. 

 


Last Modified: 04/30/2022
Modified by: Subhadeep Paul

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page