Archive for time zones

model uncertainty and missing data: an objective BAyesian perspective

Posted in Books, Statistics, Travel, University life with tags , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , on September 16, 2025 by xi'an

My Spanish and objective Bayesian friends Gonzalo García-Donato, María Eugenia Castellanos, Stefano Cabras, Alicia Quirós, and Anabel Forte wrote an fairly exciting paper in BA that is open to discussion (for a few more days), to be discussed on 05 November (4:00 PM UTC | 11:00 AM EST | 5:00 PM CET).

The interplay between missing data and model uncertainty—two classic statistical problems—leads to primary questions that we formally address from an objective Bayesian perspective. For the general regression problem, we discuss the probabilistic justification of Rubin’s rules applied to the usual components of Bayesian variable selection, arguing that prior predictive marginals should be central to the pursued methodology. In the regression settings, we explore the conditions of prior distributions that make the missing data mechanism ignorable, provided that it is missing at random or completely at random. Moreover, when comparing multiple linear models, we provide a complete methodology for dealing with special cases, such as variable selection or uncertainty regarding model errors. In numerous simulation experiments, we demonstrate that our method outperforms or equals others, in consistently producing results close to those obtained using the full dataset. In general, the difference increases with the percentage of missing data and the correlation between the variables used for imputation.

The so-called Rubin’s identity is simply the representation of the posterior probability of a model γ given the observed data x⁰, p(γ|x⁰), as the integrated posterior probability of a model given both observed and latent data,  p(γ|x⁰, x¹), against the marginal of latent x¹ given observed x⁰. Since this marginal involves the probabilities p(γ|x⁰), this representation is not directly useful for a numerical implementation.

In this paper, missingness relates to some entries of either the covariates or the response variate. Which is less common but more realistic, especially if some covariates do not contribute to the response. (The missingness mechanism does not matter if the data is missing at random (à la Rubin). The computational solution (p9) is rather standard, simulating the missing variables given the observed variables. In my opinion, the elephant in the room is the super-delicate selection of a prior distribution on the missing covariates, as methinks this impacts in a considerable manner the actual value of the Bayes factor, hence the selection of the surviving model. (As a side remark, we are credited in Celeux et al. (2006) to have “extended DIC for missing data models or when missing data were present”, but our point was instead to point out the arbitrariness of the very definition of DIC in such contexts.)

“The standard Bayesian method for addressing the absence of prior information uses improper distributions. In estimation problems (the model is fixed), the impropriety of priors does not imply any additional difficulty as long as the posterior is proper” (p9)

The authors point out the well-known difficulty with improper priors but still resort to improper priors on the parameters shared by all models—which I dispute as being adequate, despite the arguments put forward on p15, right Haar measure or not—, while sticking to proper priors on the model-dependent parameters. Which unsurprisingly become Zellner’s g-priors. Or rather g’-priors, although the discussion seems to resolve into the (model-free) factor g’ being equal to 1 as for the g-priors. Again a strong term in the derivation of the Bayes factor.

Blackwell-Rosenbluth awards 2022

Posted in pictures, Statistics, Travel, University life with tags , , , , , , , , , on November 23, 2022 by xi'an

Here are the Winners of the j-ISBA Blackwell-Rosenbluth awards 2022, between those based on the time zones UTC-12 to UTC-1 (aka the Americas):

and those based on the time zones UTC+0 to UTC+13 (aka the Americasc):

Congrats!!! They will all present their webinar on 28 or 29 November at 1pm UTC (Universal Time Coordinate).

Microsoft cares!

Posted in Travel, University life with tags , , , , , on August 2, 2022 by xi'an

2021 Whova Meeting of the International Society for Bayesian Analysis

Posted in pictures, Statistics, Travel, University life with tags , , , , , , , , , , , on May 5, 2021 by xi'an

The website for the incoming ISBA 2021 meeting is now operational and open to all! The program is ready, as well, with short courses starting on 23 June. And the main event on 28 June, with very long days, from 5:15am till 9:30pm in (US) Eastern Time (EDT, ie EST-05:00, UTC-06:00, CET-06:00, IST-9:30, CDT-11:00, JST-13:00, AEST-14:00). The number of registered participants is currently above 1700!, which shows the positive side of having a free on-line event since everyone (with an Internet connection!) interested can participate. On the negative side, namely the limited human interactions and the challenge of staying focussed 24/5, a solution stands in creating local clusters where a group could attend together the sessions. Provided local health policies allow. I am still working on gathering at CIRM, Marseille, if the centre reopens on 27 June. And am happy to broadcast any initiative to this effect.