Seminars 2018 ‒ SMAT ‐ EPFL

Dr. Fadoua Balabdaoui

ETHZ
Thursday, October 25, 2018
Time 14:15 – Room CM09
Title: Least squares estimation of a completely monotone pmf: From Analysis to Statistics

Abstract

We consider the class of completely monotone probability mass functions (pmf) from a statistical perspective. An element in this class is known to be a mixture of geometric pmfs, a consequence of the celebrated Hausdorff Theorem. We show that the complete monotone least squares estimator exists, is strongly consistent and converges weakly to the truth at $\sqrt n$-rate. Furthermore, we fully describe its limit distribution as the unique solution of a well-posed minimization problem. Through a simulation study we assess the performance of the method under different scenarios. A bootstrap testing procedure can be constructed to check to the validity of the model. As expected, deciding between complete and $k$-monotonicity is hard for large integers $k$. To formalize the link between these classes, we give a condition under which a sequence of $k$-monotone pmfs converges to a completely monotone pmf as $k \to \infty$, providing thereby an explicit characterization of this limit.

This work has been jointly done with Gabriella de Fournas-Labrosse.
Prof. Holger Rootzén

Chalmers
Friday, November 2, 2018
Time 15:15 – Room MA10
Title: Human life is unlimited — but short

Abstract

Does the human lifespan have an impenetrable biological upper limit which ultimately will stop further increase in life lengths? Answers to this question are important for our understanding of the ageing process, and for the organization of society, and have led to intense controversies. Demographic data for humans have been interpreted as showing existence of a limit close to the age, 122.45 years, of the longest living documented human, Jeanne Calment, or even as indication of a decreasing limit, but also as evidence that a limit does not exist. This talk uses EVS, extreme value statistics, to study what data says about human mortality after age 110. We show that in north America, western Europe, and Japan the yearly probability of dying after age 110 is constant and about 50% per year. Hence there is no finite limit to the human lifespan. Still, given the present stage of biotechnology, it is unlikely that during the next 25 years anyone will live longer than 128 years in these countries. Data, remarkably, show little difference in mortality after age 110 between men and women, between earlier and later periods, between ages, or between persons with different lifestyles or genetic backgrounds. These results can help testing biological theories of aging and aid early confirmation of success of efforts to find a cure for ageing. This is continuing work, and we have just received new data from Italy, and hope to have results from the analysis of it ready by the time of the talk.

Joint work with Dmitrii Zholud
Dr. Thordis L. Thorarinsdottir

Norwegian Computing Center
Thursday, November 8, 2018
Time 14:15 – Room CM 0 12
Title: Spatial hierarchical modelling with a large number of potential covariates

Abstract

A common problem in spatial statistics consists of estimating the marginal distribution of a phenomena at any location within a region. The (usually two or three) parameters of the marginal distribution are then assumed to depend on a set of covariates and, potentially, a spatially structured random effect. In addition, it is often of interest to incorporate a model averaging component to assess model uncertainty in the effect of the proposed covariates. We discuss how inference for such models can be performed in a Bayesian setting in a general and an efficient manner without the need for user-specified tuning parameters in the Bayesian inference algorithm. This is demonstrated in two applications where the three-parameter generalized extreme value (GEV) distribution with latent Gaussian fields is used for spatial modelling of extreme hourly precipitation and for regional flood frequency analysis in Norway.
Ass. Prof. Gilles Stupfler

University of Nottingham
Friday, November 23, 2018
Time 14:15 – Room MA 30
Title: Asymmetric least squares techniques for extreme risk estimation

Abstract

Financial and actuarial risk assessment is typically based on the computation of a single quantile (or Value-at-Risk). One drawback of quantiles is that they only take into account the frequency of an extreme event, and in particular do not give an idea of what the typical magnitude of such an event would be. Another issue is that they do not induce a coherent risk measure, which is a serious concern in actuarial and financial applications. In this talk, I will explain how, starting from the formulation of a quantile as the solution of an optimisation problem, one may come up with two alternative families of risk measures, called expectiles and extremiles. I will give a broad overview of their properties, as well as of their estimation at extreme levels in heavy-tailed models, and explain why they constitute sensible alternatives for risk assessment using some real data applications.

This is based on joint work with Abdelaati Daouia, Irène Gijbels and Stéphane Girard.
Stefano Rizzelli

Università Bocconi
Friday, December 7, 2018
Time 15:15 – Room MA 10
Title: Bayesian inference for multivariate extremes

Abstract

Multivariate extreme value theory provides the probabilistic framework for modelling the extremal behavior of a set of random variables. Different characterizations of multivariate extreme events are available. We focus on max-stable distributions. In its general formulation, this is a multivariate semi-parametric class of distributions, which makes the Bayesian approach particularly appealing for simultaneous inference about the marginal parameters and the extremal dependence structure. The latter can be equivalently represented through Pickands dependence functions and angular probability measures. An elegant way of modelling both representations is via a nonparametric Bayesian approach based on Bernstein polynomials, as shown by Marcon, Padoan and Antoniano [Electron. J. Stat. 10 (2016) 3310-3337] in the bivariate case. We expand their approach to higher dimensional cases and extend prior specification to include the parameters of the univariate marginal distributions. We investigate the asymptotic properties of our procedure, e.g. the contraction of the full posterior distribution at the true marginal parameters and dependence functions. We conclude by discussing further potential extensions.
Statistics seminar organised by UNIL

Jonas Peters University of Copenhagen
Tuesday, February 20, 2018
Time 12:15 to 13:15 – Internef – 237
Title: Invariant Causal Prediction

Abstract

Why are we interested in the causal structure of a process? In classical prediction tasks as regression, for example, it seems that no causal knowledge is required. In many situations, however, we want to understand how a system reacts under interventions, e.g., in gene knock-out experiments. Here, causal models become important because they are usually considered invariant under those changes. A causal prediction uses only direct causes of the target variable as predictors; it remains valid even if we intervene on predictor variables or change the whole experimental setting. In this talk, we show how we can exploit this invariance principle to estimate causal structure from data. We apply the methodology to data sets from biology, epidemiology, and finance.
The talk does not require any knowledge about causal concepts.
Dr. Łukasz Kidziński

Stanford University
Friday, February 23, 2018
Time 15:15 – Room MA10
Title: Sparse longitudinal modeling using matrix factorization

Abstract

A common problem in clinical practice is to predict disease progression from sparse observations of individual patients. The classical approach to modeling this kind of data relies on a mixed-effect model where time is considered as both a fixed effect (a population trajectory) and a random effect (an individual trajectory). In our work, we map the problem to a matrix completion framework and solve it using matrix factorization techniques. The proposed approach does not require assumptions of the mixed-effect model and it can be naturally extended to multivariate measurements
Dr. Giacomo Zanella

Università Bocconi, Milano
Friday, March 16, 2018
Time 15:15 – Room MA10
Title: Optimization and complexity of the Gibbs Sampler for multilevel Gaussian models

Abstract

We study the convergence properties of the Gibbs Sampler in the context of Bayesian hierarchical linear models with nested and crossed-effects structures. We develop a novel methodology based on multi-grid decompositions to derive analytic expressions for the convergence rates of the algorithm. In the nested context, our work gives a rather complete understanding of the Gibbs Sampler behavior for models with arbitrary depth, leading to simple and easy-to-implement guidelines to optimize algorithmic implementations. In the context of crossed-effect models, where classical strategies to speed-up convergence are not applicable, we show that the convergence of commonly implemented Gibbs Sampler strategies deteriorates as the data-size increases. This results in super-linear computational complexity (potentially even quadratic) in the number of data-points. Leveraging the insight provided by the multi-grid analysis, we design a simple collapsed Gibbs Sampler whose complexity matches the one of nested scenarios. The implications for scalable Bayesian inferences on large multilevel models are discussed.
Joint work with Omiros Papaspiliopoulos and Gareth Roberts
Ass. Prof. Ben Shaby

Penn State University
Friday, March 23, 2018
Time 15:15 – Room MA10
Title: Max-Infinitely Divisible Models for Spatial Extremes Using Random Effects

Abstract

Rare events can have crippling effects on economies, infrastructure, and human health and wellbeing. Their outsized impacts make extreme events critical to understand, yet their defining characteristic, rareness, means that precious little information is available to study them. Extremes of environmental processes are inherently spatial in structure, as a given event necessarily occurs over a particular spatial extent at a particular collection of locations. Characterizing their probabilistic structure therefore requires moving well beyond the well-understood models that describe marginal extremal behavior at a single location. Rather, stochastic process models are needed to describe joint tail event across space. Distinguishing between the subtly different dependence characteristics implied by current families of stochastic process models for spatial extremes is difficult or impossible based on exploratory analysis of data that is by definition scarce. Furthermore, different choices of extremal dependence classes have large consequences in the analysis they produce. We present stochastic models for extreme events in space that are 1) flexible enough to transition across different classes of extremal dependence, and 2) permit inference through likelihood functions that can be computed for large datasets. It will accomplish these modeling goals by representing stochastic dependence relationships conditionally, which will induce desirable tail dependence properties and allow efficient inference through Markov chain Monte Carlo. We develop models for spatial extremes using max-infinitely divisible processes, a generalization of the limiting max-stable class of processes which has received a great deal of attention. This work extends previous family of max-stable models based on a conditional hierarchical representation to the more flexible max-id class, thus accommodating a wider variety of extremal dependence characteristics while retaining the structure that makes it computationally attractive.
Dr. Quentin Berthet

University of Cambridge
Friday, April 13, 2018
Time 15:15 – Room MA10
Title: Optimal Link Prediction with Matrix Logistic Regression

Abstract

We consider the problem of link prediction, based on partial observation of a large network and on covariates associated to its vertices. The generative model is formulated as matrix logistic regression. The performance of the model is analysed in a high-dimensional regime under structural assumption. The minimax rate for the Frobenius norm risk is established and a combinatorial estimator based on the penalised maximum likelihood approach is shown to achieve it. Furthermore, it is shown that this rate cannot be attained by any algorithm computable in polynomial time, under a computational complexity assumption, and we will present the tools needed to establish these fundamental limits, and other problems where they appear. Joint work with Nicolai Baldin
Ass. Prof. David Bolin

Chalmers University of Technology
Friday, April 27, 2018
Time 15:15 – Room MA10
Title: A Bayesian General Linear Modeling Approach to Cortical Surface fMRI Data Analysis

Abstract

Cortical surface fMRI (cs-fMRI) has recently grown in popularity versus traditional volumetric fMRI, as it allows for more meaningful spatial smoothing and is more compatible with the common assumptions of isotropy and stationarity in Bayesian spatial models. However, as no Bayesian spatial model has been proposed for cs-fMRI data, most analyses continue to employ the classical, voxel-wise general linear model (GLM). Here, we propose a Bayesian GLM for cs-fMRI, which employs a class of spatial processes based on stochastic partial differential equations to model latent activation fields. Bayesian inference is performed using integrated nested Laplacian approximations (INLA), which is a computationally efficient alternative to Markov Chain Monte Carlo. To identify regions of activation, we propose an excursions set method based on the joint posterior distribution of the latent fields, which eliminates the need for multiple comparisons correction. Finally, we address a gap in the existing literature by proposing a Bayesian approach for multi-subject analysis. The methods are validated and compared to the classical GLM through simulation studies and a motor task fMRI study from the Human Connectome Project. The proposed Bayesian approach results in smoother activation estimates, more accurate false positive control, and increased power to detect truly active regions.
Prof. Philippe Rigollet

MIT
Thursday, May 24, 2018
Time 15:15 – Room MA10
Title: Learning determinantal point processes

Abstract

Determinantal Point Processes (DPPs) are a family of probabilistic models that have a repulsive behavior, and lend themselves naturally to many tasks in machine learning (such as recommendation systems) where returning a diverse set of objects is important. While there are fast algorithms for sampling, marginalization and conditioning, much less is known about learning the parameters of a DPP. In this talk, I will present recent results related to this problem, specifically:
– Rates of convergence for the maximum likelihood estimator: by studying the local and global geometry of the expected log-likelihood function we are able to establish rates of convergence for the MLE and give a complete characterization of the cases where these are parametric. We also give a partial description of the critical points for the expected log-likelihood.
– Optimal rates of convergence for this problem: these are achievable by the method of moments and are governed by a combinatorial parameter, which we call the cycle sparsity.
– A fast combinatorial algorithm to implement the method of moments efficiently.

Co-authors: Victor-Emmanuel Brunel (MIT), Ankur Moitra (MIT), John Urschel (MIT)
Ass. Prof. Patrick Rubin-Delanchy

University of Bristol
Friday, June 1st, 2018
Time 15:15 – Room MA10
Title: The generalised random dot product graph: a statistical model underpinning spectral embedding

Abstract

Finding a statistical framework under which to perform inference about graph-valued data has proved to be surprisingly challenging, considering the wealth of prior work in the fields of (broader) Mathematics and Computer Science. In this talk, a probabilistic model is presented that allows more refined analysis of spectral embedding and clustering as statistical estimation procedures, and which has several other advantages including generality (e.g. the mixed membership and standard stochastic block models are special cases), scalability (e.g. by some arguments requiring computation of only the first few singular vectors of the adjacency matrix), and interpretability (e.g. mixtures of connectivity behaviours are represented as convex combinations in latent space). Corresponding to this canonical statistical interpretation of spectral embedding is an indefinite orthogonal group that describes the identifiability limitations on the latent positions defined by the model. This group, which is most famously relevant to the theory of special relativity, can consist of transformations that affect inter-point distances, with worrying implications for spectral clustering. All such issues are resolved by simple statistical insights on the effect of linear transformations on volumes and Gaussian mixture models, confirming a more generally recognised rule-of-thumb in data science: Gaussian clustering should be preferred over K-means. Methodology and ideas are illustrated with cyber-security applications.

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract

Abstract