Skip to content

caravagnalab/DeConveil

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

95 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DeConveil

pypi version

Introduction

The goal of DeConveil is the extension of Differential Gene Expression testing by accounting for genome aneuploidy. This computational framework extends traditional DGE analysis by integrating DNA Copy Number Variation (CNV) data. This approach adjusts for dosage effects and categorizes genes as dosage-sensitive (DSG), dosage-insensitive (DIG), and dosage-compensated (DCG), separating the expression changes caused by CNVs from other alterations in transcriptional regulation. To perform this gene separation we need to carry out DGE testing using both PyDESeq2 (CN-naive) and DeConveil (CN-aware) methods.

You can download the results of our analysis from deconveilCaseStudies

Installation

Pre-required installations before running DeConveil

Python libraries are required to be installed: pydeseq2

pip install pydeseq2

DeConveil can be installed from PyPI using pip:

pip install DeConveil

DeConveil can also be installed from Bioconda with conda:

conda install -c bioconda deconveil

Data

Input data

DeConveil requires the following input matrices:

- matched mRNA read counts (normal and tumor samples) and absolute CN values (for normal diploid samples we assign CN=2), structured as NxG matrix, where N represents the number of samples and G represents the number of genes;

- a design matrix structured as an N × F matrix, where N is the number of samples and F is the number of features or covariates.

Example of CN data for a given gene g: CN = [1, 2, 3, 4, 5, 6].

An example of the input data can be found in the test_deconveil Jupyter Notebook.

Output data

res_CNnaive.csv (for PyDESeq2 method) and res_CNaware.csv (for DeConveil) data frames reporting log2FC and p.adjust values for both methods.

These data frames are further processed to separate gene groups using define_gene_groups() function included in DeConveil framework.

A tutorial of the analysis workflow is available in test_deconveil.ipynb

Citation

If you use DeConveil, cite:

K. Davydzenka, G. Caravagna, G. Sanguinetti. Extending differential gene expression testing to handle genome aneuploidy in cancer. bioRxiv preprint, 2025.

Copyright and contacts

Katsiaryna Davydzenka, Cancer Data Science (CDS) Laboratory.

About

Extension of Differential Gene Expression testing to handle genome aneuploidy in cancer

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published