-
Notifications
You must be signed in to change notification settings - Fork 4
7 SourceCode Installation
Users should check out the Conda installation instructions for more information about downloading MetaSanity to their system.
MetaSanity is available for download as a source code version. The source code works with existing installations of the program dependencies, and thus is most useful for users working on systems on which most or all are already installed.
Install MetaSanity by using its install Python3 script. Download the script from this repository.
wget https://raw.githubusercontent.com/cjneely10/MetaSanity/master/install.py -O install.py
Note
Users are strongly advised to use the default version of this software. However, users who wish to use the source code may install MetaSanity using python3 install.py -s sourcecode_installation.
MetaSanity relies on a series of databases to compute its evaluation and annotation pipelines.
The script download-data.py in the program package is available to easily download all needed database files into the MetaSanity installation.
Run the script to download all required databases. For example, python3 download-data.py. This will download and extract ~46GB of database files into the MetaSanity directory.
usage: download-data.py [-h] [-d DATA]
Download required MetaSanity data
Select from: gtdbtk,checkm,kofamscan,peptidase,virsorter
optional arguments:
-h, --help show this help message and exit
-d DATA, --data DATA Comma-separated list (no spaces) of databases to download, default allUsers may elect only to download a subset of these databases. For example, python3 download-data.py -d kofamscan,peptidase,virsorter will only download databases that are used in the FuncSanity pipeline. Similarly, python3 download-data.py -d gtdbtk,checkm will only download databases that are used in the PhyloSanity pipeline. Database availability is subject to the discretion of each program's hosting institution.
The install.py script configures the MetaSanity installation based on download location and installation type. Variables in MetaSanity.py are updated to reflect these values.
Confirm that the variable DOWNLOAD_DIRECTORY on line 29 in the MetaSanity.py script is the correct location of the program installation. For example, DOWNLOAD_DIRECTORY = "/correct/path/to/MetaSanity".
Users who have database files stored in directories not generated by the download-data.py script, or who have external BioMetaDB installations, should update these values as needed in MetaSanity.py. If the download-data.py script was used to download data to a directory other than the default MetaSanity values, update the databases directory location on line 50.
- Python ≥ 3.5
sudo apt-get -y install python3.5 && sudo apt-get -y install python3-venv python3-pip && pip3 install --upgrade pip
- Python 2
sudo apt-get -y install python2.7 && sudo apt-get -y install python-pip && pip install --upgrade pip
- Python packages
- Cython
- argparse
- configparser
- luigi
- pandas
- BioMetaDB
sudo pip3 install Cython argparse configparser luigi pandas
- External programs and their dependencies
-
PhyloSanity
-
CheckM
-
HMMER,prodigal, andpplacerare required to be on the user's system path. Uses Python2.7 to run. - Use
checkm data setRooton precalulated data files, available through theCheckMwebsite. pip2.7 install numpy scipy checkm-genome- Given the python requirement, this program is best contained within a separate python environment.
-
-
GTDBtk
-
Prodigal,HMMER,pplacer,FastANI,FastTreeare required to be on the system path. Uses Python2.7 to run. - From website - "GTDB-Tk requires an environmental variable named GTDBTK_DATA_PATH to be set to the directory containing the data downloaded from https://data.ace.uq.edu.au/public/gtdbtk/."
pip2.7 install dendropy future matplotlib numpy scipy gtdbtk- Given the python requirement, this program is best contained within a separate python environment.
-
-
FastANI
- Clone the program from Github and follow the directions within its
INSTALL.txtfile to compile.
- Clone the program from Github and follow the directions within its
-
CheckM
-
FuncSanity
-
diamond
- Download and extract the binary from the Github page
- Prodigal
sudo apt-get install prodigal
-
kofamscan
- This program requires database files and program binaries to be configured. This github, along with the program's README files, explains how to configure
kofamscan.
- This program requires database files and program binaries to be configured. This github, along with the program's README files, explains how to configure
-
BioData/KEGGDecoder
- Clone or download the
BioDatarepository.
- Clone or download the
-
PROKKA
sudo apt-get install libdatetime-perl libxml-simple-perl libdigest-md5-perl git default-jre bioperlsudo cpan Bio::Perl-
git clone https://github.com/tseemann/prokka.git $HOME/prokka- Can install in a different place than
$HOMEif desired.
- Can install in a different place than
$HOME/prokka/bin/prokka --setupdb- RNAmmer may optionally be installed
-
VirSorter
-
MetaSanity uses the Docker v1.0.5 version of
virsorter. - Download and extract the required databases that are listed on the program's website.
- Run
docker pull simroux:virsorter:v1.0.5.
-
MetaSanity uses the Docker v1.0.5 version of
-
psortb
- The commandline version of
psortbis used in MetaSanity. sudo docker pull brinkmanlab/psortb_commandline:1.0.2- Download the run script and change its file permissions
wget https://raw.githubusercontent.com/brinkmanlab/psortb_commandline_docker/master/psortb && chmod +x psortb
- The commandline version of
- hmmer
sudo apt-get install hmmer
- Required Data (automatically downloaded in the
download-data.pyscript):- dbCAN2 hmm profiles, available here
- MEROPS hmm profiles, available here
merops-as-pfams.txt
-
diamond
-
PhyloSanity
Important note for Source Code usage:
MetaSanity uses config files to run its analysis. Within the config file for your run, change the values of the PATH variables to reflect your system's installation. Otherwise, using the SourceCode is identical to using the Docker installation. See the full example for more information.
Prior to running your MetaSanity.py script, ensure ALL of the following:
- The
DOWNLOAD_DIRECTORYis the correct location of the MetaSanity directory (line 29). - The
VERSIONisv1.2.0(line 31).
If you are using any of the recommended programs listed above make sure that their paths are correct (lines 38-42)
If you installed databases in a separate location than in the MetaSanity installation directory, ensure that lines 50-60 are valid.
Ensure that line 34 is the location of the pipedm.py script.
Your installation is complete! See the full example to test your installation and to learn how to use MetaSanity! Learn more about PhyloSanity and FuncSanity on their wiki pages!