Seminars 2014 ‒ SMAT ‐ EPFL

Prof. Terry Speed
Walter and Eliza Hall Institute of Medical Research and UC Berkeley
Friday, March 7, 2014
Time 15:00 sharp – Room CE2

Title: Removing unwanted variation: from principal components to random effects

Abstract

Ordinary least-squares is a venerable tool for the analysis of scientific data originating in the work of A-M. Legendre and C. F. Gauss around 1800. Gauss used the method extensively in astronomy and geodesy. Generalized least squares is more recent, originating with A. C. Aitken in 1934, though weighted least squares was widely used long before that. At around the same time (1933) H. Hotelling introduced principal components analysis to psychology. Its modern form is the singular value decomposition. In 1907, motivated by social science, G. U. Yule presented a new notation and derived some identities for linear regression and correlation. Random effects models date back to astronomical work in the mid-19th century, but it was through the work of C. R. Henderson and others in animal science in the 1950s that their connexion with generalized least squares was firmly made. These are the diverse origins of our story, which concerns the removal of unwanted variation in high-dimensional genomic and other “omic” data using negative controls. We start with a linear model that Gauss would recognize, with ordinary least squares in mind, but we add unobserved terms to deal with unwanted variation. A singular value decomposition, one of Yule’s identities, and negative control measurements (here genes) permit the identification of our model. In a surprising twist, our initial solution turns out to be equivalent to a form of generalized least squares. This is the starting point for much of our recent work. In this talk I will try to explain how a rather eclectic mix of familiar statistical ideas can combine with equally familiar notions from biology (negative and positive controls) to give a useful new set of tools for omic data analysis. Other statisticians have come close to the same endpoint from a different perspectives, including Bayesian, sparse linear and random effects models. .
Prof. Jonathan Tawn
Lancaster University
Thursday, March 20, 2014
Time 15:15 – Room MEB331

Title: Extreme Value Theory: An Impact Case Study for International Shipping Standards

Abstract

Research on extreme value methods proved critical in determining the conclusions of the UK High Court’s investigation of the sinking of the M.V. Derbyshire (the UK’s largest ship lost at sea) and identified that design standards for hatch covers of ocean-going carriers needed to be increased by 35%. This new level was then set as a new worldwide mandatory standard. This talk describes my involvement in this work starting with the estimation of the probability of the M.V. Derbyshire having sunk from a structural failure, resulting from large wave impacts on the ship, for each of a range of possible sea-state and vessel conditions; through experiences of presenting evidence to the High Court; and subsequent work aimed at setting new design standards for ocean-going carriers.
Dr. Marco Oesting
INRA Paris
Thursday, April 3, 2014
15:15 – MA 10

Title: Conditional Modeling of Extreme Wind Gusts by Bivariate Brown-Resnick Processes

Abstract

In order to incorporate the dependence between the spatial random fields of observed and forecasted maximal wind gusts, we propose to model them jointly by a bivariate Brown-Resnick process. As there is a one-to-one correspondence between bivariate Brown-Resnick processes and pseudo cross-variograms, stationary Brown-Resnick processes can be characterized by properties of the underlying pseudo cross-variogram. We particularly focus on the investigation of their asymptotic behavior and introduce a flexible parametric model both being interesting in classical geostatistics on their own. The model is applied to real observation and forecast data for 110 stations in Northern Germany. The resulting post-processed forecasts are verified. This is joint work with Martin Schlather (Universität Mannheim) and Petra Friederichs (Universität Bonn).
Dr. Heather Battey
University of Bristol
Friday, April 4, 2014
15:15 – MA 12

Title: Smooth projected density estimation

Abstract

In this talk, I will introduce a new family of procedures, smooth projection estimators, for multidimensional density estimation. These estimators are defined by a projection of a nonparametric pilot estimate onto a finite mixture class. The projection step yields a succinct parametric representation, whilst the nonparametric step allows one to exploit structure (e.g. conditional independencies) that may be detected by other means. Although structural constraints are not preserved through the projection, exploitation of the structural information is shown to be worthwhile. I will discuss the sense in which the estimator is consistent, and its ability to achieve a faster rate of convergence than the pilot estimator upon which it is based.
Dr. Susan Wei
University of North Carolina, Chapel Hill
Thursday, April 10, 2014
15:15 – MA 10

Title: Latent Supervised Learning

Abstract

Machine learning is a branch of artificial intelligence concerning the construction of systems that can learn from data. Algorithms in machine learning can be placed along a spectrum according to the type of input available during training. The two main machine learning algorithms, unsupervised and supervised learning, occupy either end of this spectrum. In this talk I will overview some of my recent research on machine learning tasks that fall somewhere in the middle of this spectrum. I will primarily focus on a new machine learning task called latent supervised learning, where the goal is to learn a binary classifier from continuous training labels that serve as surrogates for the unobserved class labels. A specific model is investigated where the surrogate variable arises from a two-component Gaussian mixture with unknown means and variances, and the component membership is determined by a hyperplane in the covariate space. A data-driven sieve maximum likelihood estimator for the hyperplane is proposed, which in turn can be used to estimate the parameters of the Gaussian mixture. Extensions of the framework to survival data and applications to estimating treatment effect heterogeneity will also be discussed.
Prof. Jane L. Hutton
University of Warwick
Friday, May 16, 2014
15:15 – MA 12

Title: Chain Event Graphs for Informative Missingness

Abstract

Chain event graphs (CEGs) extend graphical models to address situations in which, after one variable takes a particular value, possible values of future variables differ from those following alternative values (Thwaites et al 2010). These graphs are a useful framework for modelling discrete processes which exhibit strong asymmetric dependence structures, and are derived from probability trees by merging the vertices in the trees together whose associated conditional probabilities are the same. We exploit this framework to develop new classes of models where missingness is influential and data are unlikely to be missing at random (Barclay et al 2014). Context-specific symmetries are captured by the CEG. As models can be scored efficiently and in closed form, standard Bayesian selection methods can be used to search over a range of models. The selected maximum a posteriori model can be easily read back to the client in a graphically transparent way. The efficacy of our methods are illustrated using survival of people with cerebral palsy, and a longitudinal study from birth to age 25 of children in New Zealand, analysing their hospital admissions aged 18-25 years with respect to family functioning, education, and substance abuse aged 16-18 years. P Thwaites, JQ Smith, and E Riccomagno (2010) “Causal Analysis with Chain Event Graphs” Artificial Intelligence, 174, 889-909. LM Barclay, JL Hutton and JQ Smith, (2014) “Chain Event Graphs for Informed Missingness”, Bayesian Analysis, Vol. 9, 53-76.
Dr. Axel Gandy
Imperial College London
Friday, May 23, 2014
15:15 – MA 12

Title: Implementing (Multiple) Monte Carlo Tests

Abstract

Consider Monte Carlo tests, e.g. bootstrap tests or permutation tests. Naive implementations can lead to decisions that depend mainly on the simulation error and not on the observed data. This talk will present algorithms that solve this problem: for individual Monte Carlo tests as well as for multiple Monte Carlo tests with multiplicity correction such as the Benjamini & Hochberg False Discovery Rate (FDR) procedure. The key property of the presented algorithms is that, with arbitrarily high probability, the same decisions as the original procedure with the ideal p-values is reached.
Prof. Elena Kulinskaya
University of East Anglia, UK
Friday, May 23, 2014
16:15 – MA 12

Title: Random means biased?

Abstract

Random effects model (REM) in meta-analysis incorporates heterogeneity of effect measures across studies. We were interested in combining odds ratios from K 2×2 contingency tables. The standard (additive) REM is the random intercept model in 1-way ANOVA for log-odds ratios. Alternatively, heterogeneity can be induced via intra-cluster correlation, say assuming beta-binomial distributions. This (multiplicative) model is convenient for defining REM in conjunction with the Mantel-Haenzsel approach. Our method of estimating intra-class correlation (assumed constant across studies) is based on profiling the modified Breslow-Day test. Coverage of resulting confidence intervals is compared to standard methods through simulation. Unexpectedly, we found that the standard methods are very biased in the multiplicative REM, and our new method is very biased in the standard REM. The explanation lies in the general (but new to us) fact that any function of a random variable is biased under REM. This is a general concern in Generalised Linear Mixed Models. The question on what exactly is random under REM is a difficult question for a frequentist… (joint work with Ilyas Bakbergenuly)

Prof. Victor Panaretos [Public Inaugural Lecture]
Ecole Polytechnique Fédérale de Lausanne
Monday, May 26, 2014
17:15 – Room CM2

Sums of Squares from Pythagoras to Hilbert

Prof. Claudia Klüppelberg
Technische Universität München
Friday, 19 September, 2014
15:15 – MA11

Title: Semiparametric estimation for max-stable space-time processes

Abstract

Max-stable space-time processes have been developed to study extremal dependence in space-time data. We propose a semiparametric estimation procedure based on a closed form expression of the extremogram to estimate the parameters in a max-stable space-time process. We show asymptotic properties of the resulting parameter estimates and propose bootstrap procedures to obtain asymptotically correct confidence intervals. A simulation study shows that the proposed procedure works well for moderate sample sizes. Finally, we apply this estimation procedure to fitting a max-stable model to radar rainfall measurements in a region in Florida. This is joint work with Richard Davis and Christina Steinkohl.
Dr. Raphael Huser
KAUST
Thursday, 25 September, 2014
15:15 – MA10

Title: Modelling of non-stationarity in spatial extremes

Abstract

Max-stable processes are natural models for spatial extremes, because they provide suitable asymptotic approximations to the distribution of maxima of random fields. In the recent past, several parametric families of stationary max-stable models have been developed, and fitted to various types of data. However, a recurrent problem is the modelling of non-stationarity. While it is fairly straightforward to build non-stationary models for marginal distributions, it is much less obvious to model non-stationarity in the dependence structure of extremal data, and there have been very few attempts to address this important issue so far. In my talk, I will discuss non-stationarity modelling in max-stable processes and show how inference can be performed using pairwise likelihoods. If time allows, I will also illustrate the methodology with an application to environmental data.
Prof. Richard Olshen
Stanford University
Friday, 10 October, 2014
15:15 – CE 105

Title: Successive normalization/standardization of rectangular arrays

Abstract

When each subject in a study provides a vector of numbers/features for analysis, and one wants to standardize, then for each coordinate of the resulting rectangular array one may subtract the mean by subject and divide by the standard deviation by subject. Each subject’s data then has mean 0 and standard deviation 1. Subsequently, one may so standardize by row, then by column, and so on. Data from expression arrays and protein arrays often come as such rectangular arrays, where typically column denotes “subject” and the other some measure of “gene.” When analyzing these data one may ask that subjects and genes “be on the same footing.” Thus, there may be a need to standardize successively rows and columns of the matrix. I investigate the convergence, including rates of convergence, of this successive approach to standardization, which colleague Bala Rajaratnam and I learned from Bradley Efron. Limit matrices exist on a Borel set of full measure; these limits have row and column means 0, row and column standard deviations 1. We have studied implementation on simulated data and data that arose in cardiology. The procedure can be shown not to work with simultaneous standardization, first subtracting off means for rows and columns, and then division of resulting numbers by the product of standard deviations. Results make contact with previous work on large deviations of Lipschitz functions of Gaussian vectors, with alternating conditional expectations, and with von Neumann’s algorithm for the distance between two closed, convex subsets of a Hilbert space. New insights regarding inference are enabled.

Efforts have been joint not only with Rajaratnam, but also with many others (who will be mentioned during my presentation).
Joint seminar of numerical analysis / statistics
Dr. David Ginsbourger
IMSV, Bern Universität
Friday, 17 October, 2014
15:15 – MA 31

Title: Gaussian random field models for the adaptive design of costly experiments

Abstract

Gaussian random field models have become commonplace in the design and analysis of costly experiments. Thanks to convenient properties of associated conditional distributions, Gaussian field models not only allow approximating deterministic functions based on scarce evaluation results, but can also be used as a basis for evaluation strategies dedicated to optimization, inversion, uncertainty quantification, probability of failure estimation, and more. After an introduction to Gaussian random field modelling and some of its popular applications in adaptive design of deterministic experiments, we will focus on two recent contributions. First, results on covariance-driven pathwise invariances of random fields will be presented. Simulation and prediction examples will illustrate how Gaussian field models can incorporate a number of structural priors such as group invariances, harmonicity, or sparsity. Second, results on infill sampling criteria for sequential uncertainty reduction will be discussed, with application to an excursion set estimation problem from safety engineering.

Dr. Djalel Eddine Meskaldji
EPFL
Friday, 21 November, 2014
15:15 – MA11

Title: The control of the scaled false discovery rate, a flexible and comprehensive error control and a powerful theoretical tool in multiple testing

Abstract

A large variety of error control rates to limit the declaration of false effects as being real has been proposed in the field of multiple hypotheses testing or multiple comparisons. Given the large number of papers written over the last ten years on error control in high dimensional testing, it would be worthwhile to consider a single comprehensive technique that allows a user flexibility in error control when dealing with big-data. We describe a new and comprehensive family of error rates that contains and generalizes most existing proposals. It offers the scientist a broad choice on how to properly control for discovering false findings. We also propose a corresponding family of control procedures that guarantees the control of the new error rates under different assumptions on the p-values. We show the interest of introducing this comprehensive error rate to obtain new interesting theoretical results on assumption weakening, relation between different error rates and on asymptotic control. We also discuss some particular choices of error rates that bridge the gap between two well known control error metrics: FWER and FDR. The comprehensive family and the corresponding control theorems open new perspectives in the field of multiple testing.
Prof. Alastair Young
Imperial College London
Thursday, 27 November, 2014
16:15 – MA10

Title: The formal relationship between analytic and simulation approaches to parametric inference

Abstract

Two routes most commonly proposed for accurate inference on a scalar interest parameter in the presence of a (possibly high-dimensional) nuisance parameter are parametric simulation (`bootstrap’) methods, and analytic procedures based on normal approximation to adjusted forms of the signed root likelihood ratio statistic. Both methods yield, under some null hypothesis of interest, p-values which are uniformly distributed to error of third-order in the available sample size. But, given a specific inference problem, what is the formal relationship between p-values calculated by the two approaches? We elucidate the extent to which the two methodologies actually give the same inference.