Publications
For a full list, see my Google Scholar profile.
Selected Publications
2026

ICASSP 2026
A multi-resolution generative model for medical image segmentation with only ~2.6M parameters, achieving state-of-the-art performance and strong cross-domain generalization. We combine a Haar wavelet-based encoder (ℰwave) with a compact mask tokenizer (ℰtiny) to eliminate reliance on large pretrained vision foundation models such as SD-VAE. A latent-alignment loss (ℒalign) ensures that LMM produces codes consistent with the decoding space of 𝒟tiny.
In Preparation
Survey (In Preparation)
A comprehensive survey unifying theoretical perspectives on diffusion models (autoencoders, flow, energy, score-based methods) and their applications in imaging and inverse problems. Analyzes methods for segmentation, restoration, super-resolution, and generation (e.g., ControlNet, SDSeg, StableSR, ADD), highlighting fine-tuning, cross-modality learning, and latent space optimization strategies.
