TransSE is an innovative deep learning framework designed for predicting super enhancers (SEs) and typical enhancers (TEs) from DNA sequences using transfer learning. By leveraging data from both human and mouse genomes, TransSE learns generalizable features and patterns conserved across species, demonstrating robust cross-species transferability and improved performance in SE identification.
- Advanced Architecture: Utilizes a convolutional neural network to capture complex sequence patterns
- Transfer Learning: Leverages knowledge from human and mouse datasets to SE prediction accuracy
- Cross-Species Applicability: Achieves high predictive performance on both human and mouse SEs
- Robust Transferability: Exhibits strong performance when applied across species
- Integrated Analysis: Enables motif enrichment analysis and SNP prioritization within predicted SEs
- User-Friendly Interface: Offers a web server for accessible SE and TE predictions
git clone https://github.com/Yin-Shen/TransSE.git
cd TransSE
tensorflow>=2.4.0
keras>=2.4.0
pandas>=1.2.0
numpy>=1.19.0
scipy>=1.5.0
biopython>=1.78
matplotlib>=3.3.0
seaborn>=0.11.0
plotly>=4.14.0
scikit-learn>=0.24.0
jupyter>=1.0.0
tqdm>=4.50.0
Place your human training, validation, and test data in the data/ directory Place your mouse training, validation, and test data in the data/ directory
python transse.py --human_train data/human_train.txt --human_val data/human_val.txt --human_test data/human_test.txt --mouse_train data/mouse_train.txt --mouse_val data/mouse_val.txt --mouse_test data/mouse_test.txt
python evaluate.py