( )

Time: Thursday, August, 6th, 2020, 10:00 AM – 11:50 AM Eastern Daylight Time (EDT)

Place: Virtual Joint Statistical Meeting 2020 (

Organizer(s): Chul Moon,, Southern Methodist University

Chair(s): Hengrui Luo,, The Ohio State University

10:05 AM         Solution manifold and Its Statistical Applications

Speaker. Yen-Chi Chen, University of Washington

Abstract. A solution manifold is the collection of points in a d-dimensional space satisfying a system of s equations with s<d. Solution manifolds occur in several statistical problems including hypothesis testing, curved-exponential families, constrained mixture models, partial identifications, and nonparametric set estimation. We analyze solution manifolds both theoretically and algorithmically. In terms of theory, we derive five useful results: the smoothness theorem, the stability theorem (which implies the consistency of a plug-in estimator), the convergence of a gradient flow, the local center manifold theorem and the convergence of the gradient descent algorithm. To numerically approximate a solution manifold, we propose a Monte Carlo gradient descent algorithm. In the case of likelihood inference, we design a manifold constraint maximization procedure to find the maximum likelihood estimator on the manifold. We also develop a method to approximate a posterior distribution defined on a solution manifold.

10:25 AM         Persistent Topological Descriptors for Functional Brain Network

Speaker. Hyunnam Ryu, University of Georgia; Nicole Lazar, University of Georgia

Abstract. We compare the topological features of functional brain networks. In general, functional brain networks are dealt with in an elementwise manner based on the connectivity matrix as part of network data analysis. This tends to ignore the higher-order topology of the network, which can have significant implications. In recent studies, researchers have been interested in topological data analysis. Persistent homology is known to be useful for studying dynamic topological invariants hidden in complex data obtained from topological space. Analysis using persistent homology not only captures topological features that could be overlooked in the network data analysis but also addresses threshold selection problems commonly found in network data analysis.

We use persistent homology to compare the topological features of brain networks. We construct a brain network from the fMRI time series BOLD signal and calculate the persistent homology through the weighted brain network. Also, we compare the summarized topological features of different subject groups by calculating the persistence landscape.

10:45 AM         Uncovering the Holes in the Universe with Topological Data Analysis

Speaker. Jessi Cisewski-Kehe, Yale University

Abstract. The large-scale structure (LSS) of the Universe is a spatially complex web of matter that is difficult to analyze without losing potentially important information, but can help to constrain the underlying cosmological model that describes the Universe. Topological Data Analysis (TDA) is especially suitable for such weblike data and we have used this framework to visualize, define, and do inference on known (i.e., voids) and new (i.e., filament loops) cosmological structures. 

During this talk, I will discuss how TDA can be used to uncover cosmological structures.  The features on a persistence diagram represent homology group generators (connected components, loops, voids, etc.), which are not uniquely defined back in the dataset. However, having a way to visualize the generators in the dataset can be useful to better understand the data and to possibly determine the physical meaning of the structure. This led to a new procedure called “Significant Cosmic Holes in Universe” (SCHU) for defining representations of homology group generators in a cosmological survey, such as the Sloan Digital Sky Survey galaxy survey.  Cosmological voids correspond to the second homology group generators, and we also define a new class of voids based on the first homology group generators, which we call filament loops.

Persistence diagrams can also be used in hypothesis tests in order to make statistical comparisons between complicated spatial structures such as LSS.  I will present some developments using a two-sample hypothesis testing framework to distinguish LSS under different cosmological assumptions (e.g., cold dark matter vs. warm dark matter).

11:05 AM         Confidence Band for Persistent Homology

Speaker. Jisu Kim, INRIA

Abstract. Topological Data Analysis generally refers to utilizing topological features from data. For this talk, I will focus on persistent homology, which quantifies the salient topological features of data. I will present how the confidence band can be computed for determining the significance of the topological features in the persistent homology, based on the bootstrap procedure. First, I will present how the confidence band can be computed for the persistent homology of KDEs (kernel density estimators) computed on a grid. In practice, however, calculating the persistent homology of KDEs on d-dimensional Euclidean spaces requires to approximate the ambient space to a grid, which could be computationally inefficient when the dimension of the ambient space is high or topological features are in different scales. Hence, I will consider the persistent homology of KDE filtrations on Rips complexes as an alternative. I will describe how to construct an asymptotic confidence set for the persistent homology based on the bootstrap procedure. Unlike existing procedures, this method does not heavily rely on grid-approximations, scales to higher dimensions, and is adaptive to heterogeneous topological features.

11:25 AM         Discussant: Chul Moon, Southern Methodist University

11:45 AM         Floor Discussion and Follow-ups

Everyone is welcomed to register for Joint Statistical Meeting (JSM) to join our virtual session!

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.