This is a Python3 implementation of the Popalign framework (this package will not run with Python2).
The PopAlign package allows users to analyze and compare single-cell sequencing data across hundreds of experimental samples. The Popalign framework builds probabilistic Gaussian mixture models of single-cell data within a compressed low-dimensional space, and uses statistical metrics to compare the models. Each GMM dissects complex populations into constituent subpopulations, which can then be individually compared. The framework can :
- Rank samples based on shifts across the entire population
- Align subpopulations across samples based on minimum divergence
- Find subpopulation specific transcriptional and abundance changes
- Automatically discover differentially expressed genes based on L1 error in the distributions and
- Query data using a common universal model
Install the package with pip
in a terminal. This will install the required dependencies.
pip install git+https://github.com/thomsonlab/popalign.git
Update the package with:
pip install --upgrade git+https://github.com/thomsonlab/popalign.git
Import the package in python
:
import popalign as PA
You can clone this package directly using the git clone
command in a terminal:
git clone https://github.com/thomsonlab/popalign.git
In the package folder, install the dependencies from the terminal with:
pip install -r requirements.txt
A Google Colabs notebook tutorial is now available in the 'tutorials/' folder and can also be accessed at the following link: https://colab.research.google.com/drive/1t5W_r9FzQCBHe4IL5hCABda5VGAdeTWS?usp=sharing
Full documentation of the PopAlign package is still under construction. Not all features currently implemented have been completely documented or highlighted in the tutorial but will be covered in future versions.
Please cite the following paper:
@article {ChenPopalign,
author = {Chen, Sisi and Park, Jong H. and Tsou, Tiffany and Rivaud, Paul and Charles, Emeric and Haliburton, John and Pichiorri, Flavia and Thomson, Matt},
title = {Dissecting heterogeneous cell populations across drug and disease conditions with PopAlign},
elocation-id = {421354},
year = {2020},
doi = {10.1101/421354},
publisher = {Cold Spring Harbor Laboratory},
URL = {https://www.biorxiv.org/content/early/2020/03/31/421354},
eprint = {https://www.biorxiv.org/content/early/2020/03/31/421354.full.pdf},
journal = {bioRxiv}
}