
2025 Joint Statistical Meetings
2025-08-04



and more!


Suppose \mathbf{Z}\sim \text{CN}_p\left(\mathbf{0}, \tau^2\boldsymbol \Omega, \tau^2\boldsymbol \Lambda\right) and the scale parameter \tau has the density h(\tau).
The scale mixture of complex Gaussians of \mathbf{Z} has its marginal density f(\hbox{\bf z}) = \int_{0}^{\infty} CN_p\left(\mathbf{0}, \tau^2\boldsymbol \Omega, \tau^2\boldsymbol \Lambda\right) h(\tau) \, d\tau.
Complex multivariate normal-gamma \mathbf{Z}\mid \tau^2 \sim \text{CN}_p\left(\mathbf{0}, \tau^2\boldsymbol \Omega, \tau^2\boldsymbol \Lambda\right) and \tau^2 \sim \text{Ga}(\alpha, \beta)
Complex multivariate Laplace with \tau^2 \sim \text{Ga}(1, 1) = \text{Exp}(1)
Complex group Lasso with \tau^2 \sim \text{Ga}\left(\frac{1+2p}{2}, \frac{\lambda^2}{4}\right)
Group Lasso (Xu and Ghosh (2015), Bai and Ghosh (2021)) with \tau^2 \sim \text{Ga}\left(\frac{1+2p}{2}, \frac{\lambda^2}{4}\right), \boldsymbol \Omega= \mathbf{I} and \boldsymbol \Lambda= \mathbf{0}
Complex multivariate generalized double Pareto (GDP) \mathbf{Z}\mid \tau^2 \sim \text{CN}_p\left(\mathbf{0}, \tau^2\boldsymbol \Omega, \tau^2\boldsymbol \Lambda\right) and \tau^2 \sim \text{Ga}\left(\frac{1}{2}+p, \frac{\lambda^2}{4}\right) and \lambda \sim \text{Ga}\left(\alpha, \eta\right)
The three have heavier tails.
Laplace has the most pronounced peak and decays at the fastest rate.
GDP has the fattest tails.
\begin{align*} \mathbf{y}&= \mathbf{X}\boldsymbol \beta+ \boldsymbol{\epsilon}, ~~ \boldsymbol{\epsilon}\sim \text{CN}(\mathbf{0}, 2\sigma^2\mathbf{I}_n, 2\sigma^2\rho\mathbf{I}_n),\\ \boldsymbol \beta&\sim \text{CN}(0, 2\sigma^2D_{\tau}, 0), ~~ \tau_j^2 \sim \text{Ga}\left(\frac{1 + 2}{2}, \frac{\lambda^2}{2}\right), ~~ \lambda^2 \sim \text{Ga} (r, \delta), ~~ \sigma^2 \sim \text{IG}(a, b), \end{align*} where D_{\tau} = \text{diag}(\tau_1^2, \dots, \tau_p^2).
M-Cplx: \boldsymbol{\epsilon}\sim CN(\mathbf{0}, 2\sigma^2\mathbf{I}_n, 2\sigma^2\rho\mathbf{I}_n) and \beta_j \sim CN(0, 2\sigma^2\tau_j^2, 0)
M-Re-Im: \boldsymbol{\epsilon}_{a} \sim N(\mathbf{0}, \sigma^2\mathbf{I}_n), \beta_{a, j} \sim N(0, \sigma^2\tau_j^2), a = re, im.
With p coefficients, the first three are non-zero and the rest are zero: \boldsymbol \beta_{a} = (3, 1.5, 2, 0, \dots, 0), a = re, im.
p = 10, 50, 200
n = 40, 200, 1000
Simulate 100 data replicates.
Check whether credible interval for \boldsymbol \beta includes zero or not, then use F-score to evaluate performance.
M-Cplx is more consistent and robust than M-Re-Im across different n and p.
Less variation as n increases
Cauchy

Bayesian Lasso

GDP

Use mean squared error (MSE) to measure the inference performance on coefficients.
M-Cplx performs better M-Re-Im in terms of MSE.
For non-zero \betas, M-Re-Im distributions are more right-skewed with larger variation.
The estimation gap between M-Cplx and M-Re-Im shrinks as n get large.
Cauchy

Bayesian Lasso

Use mean squared prediction error (MSPE) to assess posterior predictive accuracy for 100 test data sets.
The median MSPE is obtained from the 100 data sets, and quantify uncertainty by 1000 bootstrapped samples
M-Cplx has better out-of-sample prediction than M-Re-Im in terms of MSPE.
MSPE gets large with as p increases.
Cauchy

Bayesian Lasso

Activation is viewed as variable selection and done by inclusion of zero of credible intervals.
The strength of M-Cplx is measured by \sqrt{\beta_{re}^2 + \beta_{im}^2} \in (0, \infty) and that of M-Mag is the \beta from the real-valued model.
CV-GDP

CV-Bayesian Lasso

MO-GDP

MO-Bayesian Lasso

CV-GDP

CV-Bayesian Lasso

MO-GDP

MO-Bayesian Lasso

Either complex or real-valued, GDP shrinks coefficients more when their intensity is small.
Strength by MO models is weaker.
The NON-spatial NON-temporal CV models could perform as good as sophisticated spatiotemporal real-valued MO models (Yu et al., 2018, 2023).
We are working on selecting the best activation via information criteria such as WAIC and DIC.
One credible level cannot serve for all data and prior types.
Using Gaussian likelihood for M-Mag in WAIC/DIC is distorted since magnitude is positive and closer to Rician distribution.
Extend the scale mixture of Gaussians from the real-valued domain into the complex-valued domain, deriving the general form of scale mixtures.
Demonstrate how the complex-valued scale mixtures can be used as a shrinkage prior in Bayesian regression.
Contribute to CV-fMRI activation studies.
Developing R package cplxrv (https://github.com/chenghanyustats/cplxrv) for simulating complex-valued random variables, and fitting complex Bayesian shrinkage regression.
Future work include complex-valued horseshoes and global-local shrinkage priors, and spatiotemporal modeling.