Bayesian Modeling of Complex-valued fMRI 🧠

Bayesian Modeling of Complex-valued fMRI 🧠Spatiotemporal modeling via kernel convolutionDr. Cheng-Han YuMathematical and Statistical Sciences 
 Marquette UniversityStatistics group, KAUST March 21 20221 / 39

(Task-based) Functional Magnetic Resonance Imaging?

fMRI is a noninvasive neuroimaging method that measures the blood-oxygen level-dependent (BOLD) signals in the brain.
- Large-dimensional
- Low signal-to-noise ratios (SNRs)
- Sophisticated spatio-temporal dependence of voxels.

Source: Martha Skup (2010)

2 / 39

Why Need Complex-valued Models

Raw fMRI data are complex-valued (CV) after Fourier transform and inverse Fourier transform image reconstruction.
Most fMRI studies use magnitude-only (MO) data and phase information is discarded.

Source: Lindquist (2008) and Adali at. el (2011)

3 / 39

Complex-valued fMRI (CV-fMRI) Data

4 / 39

Why Bayesian Models of CV-fMRI

Sophisticated Bayesian spatiotemporal models have been developed for MO-fMRI (Zhang Guindani, et al., 2016).

5 / 39

Why Bayesian Models of CV-fMRI

Sophisticated Bayesian spatiotemporal models have been developed for MO-fMRI (Zhang Guindani, et al., 2016).

... the most common software packages for fMRI analysis can result in false-positive rates of up to 70% --- Eklund et al. (PNAS 2016)

5 / 39

Why Bayesian Models of CV-fMRI

Sophisticated Bayesian spatiotemporal models have been developed for MO-fMRI (Zhang Guindani, et al., 2016).

... the most common software packages for fMRI analysis can result in false-positive rates of up to 70% --- Eklund et al. (PNAS 2016)

Can we make use of full information of CV-fMRI while keep the advantages of Bayesian nature?

5 / 39

Why Bayesian Models of CV-fMRI

Sophisticated Bayesian spatiotemporal models have been developed for MO-fMRI (Zhang Guindani, et al., 2016).

... the most common software packages for fMRI analysis can result in false-positive rates of up to 70% --- Eklund et al. (PNAS 2016)

Can we make use of full information of CV-fMRI while keep the advantages of Bayesian nature?

Start with the most common objective: detecting brain activations in task-based fMRI.

Source: https://blog.applysci.com

5 / 39

Goal: Computationally Effcient Models for CV-fMRI

Propose

computationally efficient models/algorithms that
- jointly analyze real and imaginary parts of CV-fMRI data
- better detect activation at the voxel level
- infer brain connectivity (in preparation)

6 / 39

Goal: Computationally Effcient Models for CV-fMRI

Propose

computationally efficient models/algorithms that
- jointly analyze real and imaginary parts of CV-fMRI data
- better detect activation at the voxel level
- infer brain connectivity (in preparation)

Complex-valued Expectation-Maximization Variable Selection with Autoregressive Processes (CV-EMVS-AR)
(Multi-subject) Bayesian Spatiotemporal Model via Kernel Convolution and Autoregressive Processes (CV-KC-AR)

6 / 39

Background: Rowe-Logan Constant Phase Model

From Rowe and Logan (2004), for time $t = 1 : T$ , voxel $v = 1 : V$ and $p$ tasks, $\begin{aligned} y_{t}^{v} & = ρ_{t}^{v} \cos (ϕ^{v}) + i ρ_{t}^{v} \sin (ϕ^{v}) + η_{t}^{v}, \\ ρ_{t}^{v} & = β_{0}^{v} + β_{1}^{v} x_{1, t} + \dots + β_{p}^{v} x_{p, t} \end{aligned}$

7 / 39

Background: Rowe-Logan Constant Phase Model

$[\begin{matrix} y_{1}^{v} \\ ⋮ \\ y_{T}^{v} \end{matrix}] = [\begin{matrix} 1 & x_{11} & \dots & x_{p 1} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & x_{1 T} & \dots & x_{p T} \end{matrix}] [\begin{matrix} β_{0}^{v} \cos (ϕ^{v}) + i β_{0}^{v} \sin (ϕ^{v}) \\ ⋮ \\ \underset{γ_{R e}^{v}}{\underset{⏟}{β_{p}^{v} \cos (ϕ^{v})}} + i \underset{γ_{I m}^{v}}{\underset{⏟}{β_{p}^{v} \sin (ϕ^{v})}} \end{matrix}] + [\begin{matrix} η_{1}^{v} \\ ⋮ \\ η_{T}^{v} \end{matrix}]$

$y^{v} = X γ_{R e}^{v} + i X γ_{I m}^{v} + η^{v}$
$y^{v} = X γ^{v} + η^{v}$ , $η^{v} \sim C N_{T} (0, Γ_{v}, C_{v})$ . $Γ_{v} \in C^{T \times T}$ , $C_{v} \in C^{T \times T}$
$η^{v}$ is circular normal if $C_{v} = 0$

7 / 39

Background: Real-valued Representation

$y^{v} = X γ^{v} + η^{v}$ , $η^{v} \sim C N_{T} (0, Γ_{v}, C_{v})$

or equivalently, $y_{r}^{v} = X^{r} γ_{r}^{v} + η_{r}^{v}$ , where $η_{r}^{v} \sim N_{2 T} (0, Σ_{v})$

8 / 39

Background: Real-valued Representation

$y^{v} = X γ^{v} + η^{v}$ , $η^{v} \sim C N_{T} (0, Γ_{v}, C_{v})$

or equivalently, $y_{r}^{v} = X^{r} γ_{r}^{v} + η_{r}^{v}$ , where $η_{r}^{v} \sim N_{2 T} (0, Σ_{v})$

Source: Lindquist (2008)

8 / 39

Brain Activation as Variable Selection

$y^{v} = X γ^{v} + η^{v}$
$γ_{j}^{v} \neq 0$ iff voxel $v$ at task $j$ is activated (Xia, Liang, and Wang (2009); Zhang, Guindani, and Vannucci (2015)).
Complex normal spike-and-slab prior $γ_{j}^{v} \sim (1 - ψ_{j}^{v}) \underset{s p i k e}{\underset{⏟}{C N (0, ω_{0}, λ_{0})}} + ψ_{j}^{v} \underset{s l a b}{\underset{⏟}{C N (0, ω_{1}, λ_{1})}},$ $ω_{0} < ω_{1} \in C, λ_{0} < λ_{1} \in C$ .

9 / 39

Brain Activation as Variable Selection

Complex normal spike-and-slab prior $γ_{j}^{v} \sim (1 - ψ_{j}^{v}) \underset{s p i k e}{\underset{⏟}{C N (0, ω_{0}, λ_{0})}} + ψ_{j}^{v} \underset{s l a b}{\underset{⏟}{C N (0, ω_{1}, λ_{1})}},$ $ω_{0} < ω_{1} \in C, λ_{0} < λ_{1} \in C$ .
Non-active voxel: $ψ_{j}^{v} = 0 \Rightarrow γ_{j}^{v} = 0$
Active voxel: $ψ_{j}^{v} = 1 \Rightarrow γ_{j}^{v} \neq 0$
Activation is inferred by borrowing information across voxels through a Bernoulli prior on $ψ_{j}^{v}$ with a common probability of activation for all voxels: $ψ_{j}^{v} \sim B e r n o u l l i (θ_{j}^{v} = θ_{j})$ , i.e., $P r (ψ_{j}^{v} = 1 ∣ θ_{j}) = θ_{j}$ .

10 / 39

CV-EMVS (Yu, Prado, Ombao, and Rowe, 2018)

$\begin{aligned} y^{v} & = X γ^{v} + η^{v}, η^{v} \sim C N_{T} (0, 2 σ_{v}^{2} I, 0), v = 1 : V, \\ γ_{j}^{v} ∣ ψ_{j}^{v} & \overset{i n d e p}{\sim} (1 - ψ_{j}^{v}) C N_{1} (0, σ_{v}^{2} ω_{0}, σ_{v}^{2} λ_{0}) + γ_{j}^{v} C N_{1} (0, σ_{v}^{2} ω_{1}, σ_{v}^{2} λ_{1}), \\ ω_{0} & << ω_{1}, λ_{0} << λ_{1}, j = 1 : p, \\ ψ_{j}^{v} ∣ θ_{j} & \overset{I I D}{\sim} B e r n o u l l i (θ_{j}), θ_{j} \overset{I I D}{\sim} B e t a (a_{θ}, b_{θ}), σ_{v}^{2} \sim I G (a_{σ}, b_{σ}) \end{aligned}$

11 / 39

CV-EMVS (Yu, Prado, Ombao, and Rowe, 2018)

We determine if the real and imaginary parts of $γ_{j}^{v}$ are zero jointly: $γ_{j}^{v} \neq 0$ if $P r (ψ_{j}^{v} = 1 ∣ γ^{*}, θ^{*}, σ^{*}, y) > δ$ .
- E-step: derive
  $P r (ψ_{j}^{v} = 1 ∣ γ^{(l)}, θ^{(l)}, σ^{(l)}, y)$
- M-step: obtain maximum a posteriori
  $γ^{(l + 1)}, θ^{(l + 1)}, σ^{(l + 1)} = max \arg E_{ψ | \cdot} [\log π (γ, ψ, θ, σ ∣ y) ∣ γ^{(l)}, θ^{(l)}, σ^{(l)}, y]$

11 / 39

CV-EMVS (Yu, Prado, Ombao, et al., 2018)

Under circular normal prior on $γ_{j}^{v}$ , i.e., $λ_{0} = λ_{1} = 0$ ,
$\begin{aligned} {\hat{γ}}_{R e}^{v} & = (X^{'} X + 2 D_{v})^{- 1} (X^{'} y^{v})_{R e}, \\ {\hat{γ}}_{I m}^{v} & = (X^{'} X + 2 D_{v})^{- 1} (X^{'} y^{v})_{I m} . \end{aligned}$

12 / 39

Activation and Strength Maps: Low SNR

The CV model outperforms other MO models in terms of activation detection and strength.
CV: CV-EMVS
MO: MO-EMVS
ALA: Adaptive Lasso

13 / 39

Human CV-fMRI (Karaman, Bruce, and Rowe, 2015)

Unilateral finger tapping data of dimension 96 x 96 x 510.
KBR-CV: false positives outside the brain area.
KBR-MO: low detecting power.
DeTeCT-ING: Nonlinear model using the MR signal equation.

14 / 39

CV-EMVS with AR Noise

$\begin{array}{rcl} y_{t}^{v} & = & γ_{1}^{v} + γ_{2}^{v} t / T + γ_{3}^{v} x_{t} + η_{t}^{v}, \\ η_{t}^{v} & = & φ_{v} η_{t - 1}^{v} + ζ_{t}^{v}, ζ_{t}^{v} \overset{i i d}{\sim} C N_{1} (0, 2 σ_{v}^{2}, 0), \\ φ_{v} & \sim & U n i f o r m (- 1, 1) . \end{array}$

15 / 39

General Bayesian spatio-temporal model

CV-EMVS-AR much improves detection but does not explicitly model the spatial dependence across voxels.
The execution of biological tasks involves populations of neurons spanning across several voxels, rather than a single voxel.
Propose Bayesian variable selection tools coupled with a spatial kernel convolution (KC) structure as well as temporal autoregressive processes for CV-fMRI data. (Yu, Prado, Ombao, and Rowe, 2022)

16 / 39

Kernel Convolution: Definition

Given a kernel $k (z; ϕ), z \in S$ and a white noise process $w (u), u \in S$ , the kernel convolution is $S (z) = \int_{S} k (z - u; ϕ) w (u) d u .$
In practice, for sites $u_{1}, . . ., u_{D}$ , the process is defined as $S (z) = \sum_{d = 1}^{D} k (z - u_{d}; ϕ) w (u_{d})$
The spatial process $S (z)$ governs the spatial dependence of voxels in the image, and affects the probability of voxels being activated.

17 / 39

Kernel Convolution: Properties

Dimension reduction: A small number $D$ of parameters $w (u_{1}), . . ., w (u_{D})$ governs the entire process $S (z)$ that may have tons of measurements at location $z_{1}, . . ., z_{V}$ . $(V >> D)$ .
$S = (S (z_{1}), . . ., S (z_{V}))^{'}$ is the voxel-level spatial effects
$w = (w (u_{1}), . . ., w (u_{D}))^{'}$ is the $D$ -dimensional latent spatial effect formed by the selected $D$ sites.

18 / 39

Kernel convolution (KC) vs. Gaussian process (GP)

Select $D$ sites to be representative of the image.
Compute the voxel-level spatial effects: $S_{(j)} = (S_{j}^{1}, . . ., S_{j}^{V})$ via kernel convolution using the latent spatial process formed by the selected $D$ sites.

Parcellate the image into $G$ clusters of voxels.
Compute the region-level spatial effects: $S_{(j)} = (S_{j}^{1}, . . ., S_{j}^{G})$ based on a Gaussian process and the "location" of the $G$ regions. (Bezener et al. 2018)

19 / 39

Kernel convolution (KC) vs. Gaussian process (GP)

Select $D$ sites to be representative of the image.
Compute the voxel-level spatial effects: $S_{(j)} = (S_{j}^{1}, . . ., S_{j}^{V})$ via kernel convolution using the latent spatial process formed by the selected $D$ sites.

Parcellate the image into $G$ clusters of voxels.
Compute the region-level spatial effects: $S_{(j)} = (S_{j}^{1}, . . ., S_{j}^{G})$ based on a Gaussian process and the "location" of the $G$ regions. (Bezener et al. 2018)

$D \neq G$ but we use $D = G$ for comparison.

19 / 39

Advantages of KC over GP

Circumvents the need for assigning each voxel to some group.

Needs to define and compute the location of regions.
Shape of region is sensitive to the detecting performance.

20 / 39

Complex-valued Bayesian Spatiotemporal Model

Likelihood: With the indicators $ψ_{j}^{v}$ such that $γ_{j}^{v} \neq 0$ if $ψ_{j}^{v} = 1$ and $γ_{j}^{v} = 0$ if $ψ_{j}^{v} = 0$ , $\begin{aligned} y^{v} & = X (ψ^{v}) γ^{v} (ψ^{v}) + η^{v} \\ η^{v} & \sim C N_{T} (0, 2 σ_{v}^{2} Λ_{v}, 0), \end{aligned}$

where $Λ_{v}$ is the AR(1) correlation matrix.

Empirical Bayes estimator ${\hat{ρ}}_{v}$ $({\hat{Λ}}_{v})$ for AR coefficient for computation efficiency.

21 / 39

Complex-valued Bayesian Spatiotemporal Model

Likelihood: With the indicators $ψ_{j}^{v}$ such that $γ_{j}^{v} \neq 0$ if $ψ_{j}^{v} = 1$ and $γ_{j}^{v} = 0$ if $ψ_{j}^{v} = 0$ , $\begin{aligned} y^{v} & = X (ψ^{v}) γ^{v} (ψ^{v}) + η^{v} \\ η^{v} & \sim C N_{T} (0, 2 σ_{v}^{2} Λ_{v}, 0), \end{aligned}$

where $Λ_{v}$ is the AR(1) correlation matrix.

Empirical Bayes estimator ${\hat{ρ}}_{v}$ $({\hat{Λ}}_{v})$ for AR coefficient for computation efficiency.
Complex-valued g-prior on $γ^{v}$ : $\begin{aligned} γ^{v} (ψ^{v}) ∣ ψ^{v}, σ_{v}^{2} & \overset{i n d}{\sim} C N_{p} ({\hat{γ}}^{v} (ψ^{v}), 2 T σ_{v}^{2} (X^{'} (ψ^{v}) {\hat{Λ}}_{v}^{- 1} X (ψ^{v}))^{- 1}, 0) \\ {\hat{γ}}^{v} (ψ^{v}) & = (X^{'} (ψ^{v}) {\hat{Λ}}_{v}^{- 1} X (ψ^{v}))^{- 1} X^{'} (ψ^{v}) y^{v} \end{aligned}$
Integrate $γ^{v}$ out for faster computation.

21 / 39

Spatial Priors

$π (ψ_{(j)} | S_{(j)}) = \prod_{v = 1}^{V} π (ψ_{j}^{v} | S_{j}^{v})$ $ψ_{j}^{v} | S_{j}^{v} \overset{i n d}{\sim} B e r n o u l l i (\frac{1}{1 + e^{- (a_{j}^{d} + S_{j}^{v})}})$ $S_{j}^{v} = \sum_{d = 1}^{D} k (z_{v} - s_{d}; ϕ^{d}) w_{j}^{d}$

$w_{j}^{d} ∣ τ_{j}^{2} \overset{i n d}{\sim} N (0, τ_{j}^{2})$ $τ_{j}^{2} \overset{i i d}{\sim} I G (a_{τ}, b_{τ})$ $ϕ^{d} \overset{i i d}{\sim} G a (a_{ϕ}, b_{ϕ})$

22 / 39

Spatial Priors

$w_{j}^{d} ∣ τ_{j}^{2} \overset{i n d}{\sim} N (0, τ_{j}^{2})$ $τ_{j}^{2} \overset{i i d}{\sim} I G (a_{τ}, b_{τ})$ $ϕ^{d} \overset{i i d}{\sim} G a (a_{ϕ}, b_{ϕ})$

$π (ψ_{(j)} | S_{(j)}) = \prod_{g = 1}^{G} \prod_{v \in R_{g}} π (ψ_{j}^{v} | S_{j}^{g})$ $ψ_{j}^{v} | S_{j}^{g} \overset{i n d}{\sim} B e r n o u l l i (\frac{1}{1 + e^{- (a_{j}^{g} + S_{j}^{g})}})$ $S_{(j)} | δ_{j}^{2}, r_{j} \overset{i n d}{\sim} N (0, δ_{j}^{2} Γ_{j})$ $Γ_{j} (i, k) = \exp (- \frac{| | s_{i} - s_{k} | |}{r_{j}})$ $π (δ_{j}^{2}) \propto δ_{j}^{- 2}$ $r_{j} \sim G a (a_{j}, b_{j})$

22 / 39

Choice of Kernels

Gaussian kernel $k (z_{v} - u_{d}; ϕ) = \exp (- \frac{‖ z_{v} - u_{d} ‖^{2}}{2 ϕ})$
Bezier kernel $k (s_{v} - u_{d}; ν, ϕ) = {\begin{cases} {(1 - \frac{‖ z_{v} - u_{d} ‖^{2}}{ϕ^{2}})}^{ν}, & ‖ z_{v} - u_{d} ‖ < ϕ \\ 0, & otherwise \end{cases},$ where $ν$ is the smooth parameter and $ϕ$ is the range parameter.

23 / 39

Choice of Kernels

Gaussian kernel $k (z_{v} - u_{d}; ϕ) = \exp (- \frac{‖ z_{v} - u_{d} ‖^{2}}{2 ϕ})$
Bezier kernel $k (s_{v} - u_{d}; ν, ϕ) = {\begin{cases} {(1 - \frac{‖ z_{v} - u_{d} ‖^{2}}{ϕ^{2}})}^{ν}, & ‖ z_{v} - u_{d} ‖ < ϕ \\ 0, & otherwise \end{cases},$ where $ν$ is the smooth parameter and $ϕ$ is the range parameter.

The Bezier kernel has a compact support.
To capture neighboring spatial dependence
- this avoids unrealistically relating any two voxels together
- lets the model learn the neighboring structure of a given voxel by learning $ϕ$

23 / 39

Bezier Kernel: $ν = 2$ ; $ϕ = 2, 4, 6$

Here shows you what the Bezier kernel looks like with different value of parameters.

24 / 39

Bezier Kernel: $ν = 0.5, 2, 5$ ; $ϕ = 3$

25 / 39

Simulated data with AR coefficient 0.5

Estimate $P r (ψ_{j}^{v} = 1 | y)$ by computing the number of 1s of $ψ_{j}^{v}$ in the posterior sample divided by the number of MCMC iterations.

Temporal vs. Non-temporal

Model	Sensitivity	Specificity	Precision	Accuracy	F1	MCC
CV-KC-AR	0.92	0.99	0.99	0.99	0.95	0.95
CV-GP-AR	0.82	0.99	0.97	0.97	0.89	0.88
CV-EMVS-AR	0.96	0.99	0.93	0.98	0.94	0.93
CV-KC	1	0.79	0.47	0.82	0.63	0.61

26 / 39

Simulated data with AR coefficient 0.5

Estimate $P r (ψ_{j}^{v} = 1 | y)$ by computing the number of 1s of $ψ_{j}^{v}$ in the posterior sample divided by the number of MCMC iterations.

Temporal vs. Non-temporal

27 / 39

Simulated data with AR coefficient 0

Spatial vs. Non-spatial

Model	Sensitivity	Specificity	Precision	Accuracy	F1	MCC
CV-KC	0.79	0.99	0.99	0.97	0.87	0.86
CV-GP	0.58	0.99	0.99	0.93	0.73	0.72
CV-EMVS	0.66	0.99	0.92	0.94	0.77	0.75

28 / 39

KC produces a finer and reasonable latent spatial effect map

$ψ^{v} | S^{v} \overset{i n d}{\sim} B e r n o u l l i (\frac{1}{1 + e^{- S^{v}}})$

$S^{v} = \sum_{d = 1}^{D} k (z_{v} - s_{d}; ϕ) w^{d}$

GP forces voxels in the same region share the same effect

$ψ_{j}^{v} | S_{j}^{g} \overset{i n d}{\sim} B e r n o u l l i (\frac{1}{1 + e^{- S_{j}^{g}}})$

29 / 39

KC is Less Sensitive to Dimension Reduction

It matters because ...

30 / 39

Computing Time

Average computing time over 10 times of running the MCMC algorithms.
The time unit is seconds per 1000 MCMC iterations

Model	$16$	$25$	$100$
CV-KC	0.51	0.59	1.48
CV-GP	0.30	0.41	3.36

The example shows the KC model can use just about 15% to 20% of the computing time of the GP model per 1000 MCMC iterations to reach the similar detecting activation performance

31 / 39

CV vs. MO

The CV model is better than the MO model especially when signals are noisy.
The MO models improve more when an explicit spatial prior is included.

32 / 39

MO-GP Has a Region Probability Inflation

If a spatial region contains true activated voxels, other nonactivated voxels in the region will also have relatively high probability of activation $⟹$ increases false positives.

33 / 39

Spatial Models Encourage Activation in Clusters

34 / 39

Zoom In

35 / 39

36 / 39

Multi-resolution

$D$ coarse-resolution sites (white dots) and $H$ fine-resolution sites (green dots)

$S^{v} = \sum_{d = 1}^{D} k_{c} (z_{v} - s_{d}; ϕ_{c}) w^{d} + \sum_{h = 1}^{H} k_{f} (z_{v} - s_{d}; ϕ_{f}) b^{h}$

37 / 39

Take-home Message

Complex-valued modeling improves activation.
CV-EMVS and CV-KC-AR are computationally efficient.
Spatial models encourage activation in clusters.
The MO models need a sophisticated spatial structure to reach good performance as CV models.
The KC models are flexible and outperform the baseline GP models.
- Any valid kernel can be used.
- Avoid dealing with problems of shape of regions.
- How a voxel is influenced by its neighboring voxels is model-based.
- Less affected by dimension reduction, leading to fast computation.

38 / 39

References

[1] M. Karaman, I. P. Bruce, and D. B. Rowe. "Incorporating relaxivities to more accurately reconstruct MR images". In: Magnetic Resonance Imaging 33 (2015), pp. 374-384.

[2] D. B. Rowe and B. R. Logan. "A complex way to compute fMRI activation". In: NeuroImage 23 (2004), pp. 1078-1092.

[3] J. Xia, F. Liang, and Y. Wang. "FMRI analysis through Bayesian variable selection with a spatial prior". In: Proceedings of the 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro. Boston, MA, USA, 2009, pp. 714-717.

[4] C. Yu, R. Prado, H. Ombao, et al. "A Bayesian variable selection approach yields improved detection of brain activation from complex-valued fMRI". In: Journal of American Statistical Association: Applicaiton and Case Studies 113 (2018), pp. 1395-1410.

[5] C. Yu, R. Prado, H. Ombao, et al. "Bayesian spatiotemporal modeling on complex-valued fMRI signals via kernel convolutions". In: Biometrics (2022), pp. 1-13.

[6] L. Zhang, M. Guindani, and M. Vannucci. "Bayesian models for fMRI data analysis". In: WIREs Computational Statistics 7 (2015), pp. 21-41.

[7] L. Zhang, M. Guindani, F. Versace, et al. "A spatio-temporal non-parametric Bayesian model of multi-subject fMRI data". In: Annals of Applied Statistics (2016).

↑, ←, Pg Up, k	Go to previous slide
↓, →, Pg Dn, Space, j	Go to next slide
Home	Go to first slide
End	Go to last slide
Number + Return	Go to specific slide
b / m / f	Toggle blackout / mirrored / fullscreen mode
c	Clone slideshow
p	Toggle presenter mode
t	Restart the presentation timer
?, h	Toggle this help

Bayesian Modeling of Complex-valued fMRI 🧠

Spatiotemporal modeling via kernel convolution

Dr. Cheng-Han Yu

Mathematical and Statistical Sciences Marquette University

Statistics group, KAUST March 21 2022

(Task-based) Functional Magnetic Resonance Imaging?

Why Need Complex-valued Models

Complex-valued fMRI (CV-fMRI) Data

Why Bayesian Models of CV-fMRI

Why Bayesian Models of CV-fMRI

Why Bayesian Models of CV-fMRI

Why Bayesian Models of CV-fMRI

Goal: Computationally Effcient Models for CV-fMRI

Goal: Computationally Effcient Models for CV-fMRI

Background: Rowe-Logan Constant Phase Model

Background: Rowe-Logan Constant Phase Model

Background: Real-valued Representation

Background: Real-valued Representation

Brain Activation as Variable Selection

Brain Activation as Variable Selection

CV-EMVS (Yu, Prado, Ombao, and Rowe, 2018)

CV-EMVS (Yu, Prado, Ombao, and Rowe, 2018)

CV-EMVS (Yu, Prado, Ombao, et al., 2018)

Activation and Strength Maps: Low SNR

Human CV-fMRI (Karaman, Bruce, and Rowe, 2015)

CV-EMVS with AR Noise

General Bayesian spatio-temporal model

Kernel Convolution: Definition

Kernel Convolution: Properties

Kernel convolution (KC) vs. Gaussian process (GP)

Kernel convolution (KC) vs. Gaussian process (GP)

Advantages of KC over GP

Complex-valued Bayesian Spatiotemporal Model

Complex-valued Bayesian Spatiotemporal Model

Spatial Priors

Spatial Priors

Choice of Kernels

Choice of Kernels

Bezier Kernel: ν=2ν=2; ϕ=2,4,6ϕ=2,4,6

Bezier Kernel: ν=0.5,2,5ν=0.5,2,5; ϕ=3ϕ=3

Simulated data with AR coefficient 0.5

Simulated data with AR coefficient 0.5

Simulated data with AR coefficient 0

KC is Less Sensitive to Dimension Reduction

Computing Time

CV vs. MO

MO-GP Has a Region Probability Inflation

Spatial Models Encourage Activation in Clusters

Zoom In

Multi-resolution

Take-home Message

References

(Task-based) Functional Magnetic Resonance Imaging?

Help

Mathematical and Statistical Sciences
Marquette University

Bezier Kernel: $ν = 2$ ; $ϕ = 2, 4, 6$

Bezier Kernel: $ν = 0.5, 2, 5$ ; $ϕ = 3$