Robust Differential Composition And Variability Analysis For Multisample Cell Omics

Robust Differential Composition And Variability Analysis For Multisample Cell Omics


Author(s): Stefano Mangiola, Alex Schulze, Marie Trussart, Enrique Zozaya, Mengyao Ma, Zijie Guo, Alan Rubin, Terry Speed, Heejung Shim, Anthony Papenfuss

Affiliation(s): WEHI

Twitter: @steman_research

Cell omics such as single-cell genomics, proteomics and microbiomics allow the characterisation of tissue and microbial community composition, which can be compared between conditions to identify biological drivers. This strategy has been critical to unveiling markers of disease progression such as cancer and pathogen infection. For cell omic data, no method for differential variability analysis exists, and methods for differential composition analysis only take a few fundamental data properties into account. Here we introduce sccomp, a generalised method for differential composition and variability analyses able to jointly model data count distribution, compositionality, group-specific variability and proportion mean-variability association, with awareness against outliers. Sccomp is an extensive analysis framework that allows realistic data simulation and cross-study knowledge transfer. Here, we demonstrate that mean-variability association is ubiquitous across technologies showing the inadequacy of the very popular Dirichlet-multinomial modelling and provide mandatory principles for differential variability analysis. We show that sccomp accurately fits experimental data, with a 50% incremental improvement over state-of-the-art algorithms. Using sccomp, we identified novel differential constraints and composition in the microenvironment of primary breast cancer.

On YouTube: