Download

Stand-alone Version

The stand-alone version of PhaBOX for large-scale inputs can be downloaded via https://github.com/KennthShang/PhaBOX.

Please noted that the local version of PhaBOX will not generate the visualization files. However, all the intermediate files, such as the network files and significant protein alignments will still provided as outputs.

If you used the database mentioned below, please cite:

Jiayu Shang, Cheng Peng, Herui Liao, Xubo Tang, Yanni Sun, PhaBOX: a web server for identifying and characterizing phage contigs in metagenomic data, Bioinformatics Advances, Volume 3, Issue 1, 2023, vbad101, https://doi.org/10.1093/bioadv/vbad101
Viral Dataset

Below, we provided the scource of the training and test data for user who may want to use for study. Because some of the benchmark datasets are curated by other research groups. We will listed the name of the paper and the link to the dataset. All the data are public and we are grateful for their contributions to our study.

Because some datasets are very large in size, only the accession and the label are given in CSV format. In this case, there are some useful websites/tools that may help you to download:

Lifestyle Database
Virus-host Database

Our paper: From genomic signals to prediction tools: a critical feature analysis and rigorous benchmark for phage–host prediction. https://doi.org/10.1093/bib/bbaf626

Protein Annotation Database

Released information and annotation of PVPs and non-PVPs

Released information and annotation of all genes

Protein cluster database

The protein cluster database and the annotation of the proteins are provided for user who may want to further analysis the alignment results.

Other useful resources

Our paper: From genomic signals to prediction tools: a critical feature analysis and rigorous benchmark for phage–host prediction. https://doi.org/10.1093/bib/bbaf626