DeepInsulated: De novo design of insulated cis-regulatory elements based on deep learning-predicted fitness landscape
The code for official implementation of "De novo design of insulated cis-regulatory elements based on deep learning-predicted fitness landscape"
This codebase provides:
- The implementation of prediction model of insulated dataset.
- The implementation of de novo design of insulated promoters.
- The implementation of sequence space visualization and kernel visualization.
Precise control of gene activity is crucial in bioengineering applications. Despite tremendous efforts in cis-regulatory sequence activity prediction and reverse engineering, the host cellular context-dependent effects have long been neglected, leading to persistent difficulties in accurately modeling the regulatory process. Here, we introduce an insulated design strategy for purifying and modeling host-independent transcriptional activity. By incorporating heterologous paired cis- and trans-regulatory modules into an orthogonal host cell, we have created a controllable transcriptional regulatory system. A deep learning-based algorithm combined with experimental data purification process enabled us to de novo design full-length transcriptional promoter sequences based on host-independent activity landscape.
Remarkably, the landscape accurately depicted the transcriptional activity of the insulated system, which can generate desired sequence- and function-diversified _cis-_regulatory sequences for three different trans-RNA polymerase, and their activities are precisely predictable in both bacterial (Escherichia Coli) and mammalian (Chinese hamster ovary) cell lines. We expected that generative design strategy could be expanded to other complex cis-regulatory elements by combining the deep learning-based algorithm and construction of paired cis- and trans-regulatory modules in an orthogonal host cell.
Env Requirements:
- MAC OS, Linux or Windows.
- Python 3.6.
- Tensorflow 1.10.0 + keras 2.1.0
- CUDA 9.0 if you need train deep learning model with gpu.
Steps of using DeepInsulated:
-
Install Python ref to Download Python
-
Install the code in virtualenv to keep your environment clean:
pip install virtualenv # or pip install -i https://pypi.tuna.tsinghua.edu.cn/simple virtualenv virtualenv --python=python3 Deepinsulated cd Deepinsulated source ./bin/activateOptional: After use, shutdown virtual environment with
deactivatepip install virtualenv # or pip install -i https://pypi.tuna.tsinghua.edu.cn/simple virtualenv virtualenv --python=python3 Deepinsulated cd Deepinsulated .\Scripts\activate.batOptional: Shutdown virtual environment with
.\Scripts\deactivate.bat -
Install Git, this step is optional if you does not install DeepInsulated by git clone. Clone the source codes with git.
git clone https://github.com/HaochenW/deepinsulated.git -
or, download the source codes and extract files and put it in the virtual environment directory you defined.
-
after 2/3, the directory of DeepInsulated should have the following structure:
DeepInsulated deepinsulated |------- | |-------data |... |-------prediction |... |-------design |... |-------visualize |... |-------weight |... |------- |... |... -
After the extraction, download all dependencies with the following commend.
cd deepinsulated pip install -r requirements.txtTo boost download speeds, you could setup pip mirror such as
pip install -i https://pypi.tuna.tsinghua.edu.cn/simple -r requirements.txt # or pip install -i https://mirrors.aliyun.com/pypi/simple -r requirements.txt -
Run DeepInsulated with python and enjoy it with following steps:
```
cd prediction
python predict_parallel.py
```
```
cd design
python de_novo_design.py
```
```
cd design
python semi_rational_design.py
```
```
cd visualize
python kernel_visual_small.py
```
Then Using Matlab to run kernel_visual.m
Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follows.
@article{,
title={De novo design of insulated _cis_-regulatory elements based on deep learning-predicted fitness landscape},
author={Haochen Wang, Yanhui Xiang, Ziming Liu, Wen Yin, Boyan Li, Long Qian, Xiaowo Wang, Chunbo Lou},
journal={},
year={2024}
}
For academic use, this project is licensed under the MIT License - see the LICENSE file for details For commercial use, please contact the authors.
