HADDOCK2.4 manual - Installation Instructions
- Installing HADDOCK
Installing HADDOCK
Downloading HADDOCK
To obtain HADDOCK2.4 fill the HADDOCK license form ).
Downloading CNS
The other required piece of software to run HADDOCK is its computational engine, CNS (Crystallography and NMR System – https://cns-online.org ). CNS is freely available for non-profit organisations. In order to get access to all features of HADDOCK you will need to recompile CNS using the additional files provided in the HADDOCK distribution in the cns1.3
directory. Compilation of CNS might be non-trivial. We are providing some instructions here. Consult also for some guidance the related entry in the HADDOCK forum
Python2.7
python version 2.7 is required in order to run HADDOCK2.4. Importantly, python2 (pointing to python2.7) should be existing on your system.
As python2.7 is end of life, one way to install it is to use miniconda on your system.
Biopython 1.72 (only required to use the coarse-graining option in HADDOCK2.4)
Biopython version 1.72 is required in order to be able to convert the atomistic models into coarse grained models (see coarse graining PDB files for docking.
DSSP (only required to use the coarse-graining option in HADDOCK2.4)
DSSP is required to define the secondary structure, an information required to select the proper backbone parameter for the Martini coarse grained model (see coarse graining PDB files for docking.
Recommended auxiliary software
FreeSASA
In order to identify surface-accessible residues to define restraints for HADDOCK we can make use of NACCESS freely available to non-profit users, or its open-source software alternative FreeSASA. We will here make use of FreeSASA. Following the download and installation instructions from the FreeSASA website. The direct download command is:
wget https://github.com/mittinatten/freesasa/archive/refs/tags/2.0.3.tar.gz
If running into problems you might want to disable json
and xml
support. Here we will assume you save the tar archive under the software
directory in your home directory:
tar xvfz freesasa-2.0.3.tar.gz cd freesasa-2.0.3 ./configure \-\-disable-json \-\-disable-xml \-\-prefix ~/software make make install
HADDOCK-tools
HADDOCK-tools** is collection of HADDOCK-related scripts freely available from our GitHub repository. To install it:
cd ~/software git clone https://github.com/haddocking/haddock-tools
In case git is not installed on your system, go the GitHub site given in the command and download directly the archive.
MolProbity
MolProbity is a structure validation software suite developed in the Richardson lab at Duke University. In the context of HADDOCK we are making use of MolProbity to define the protonation state of Histidine residues using the reduce
application. An pre-compiled executable can be freely downloaded from the MolProbity GitHub website. You can directly download the reduce
executable for Linux or OSX.
Put the executable for example in ~software/bin
or some other software installation directory in your path,
rename it to reduce
if needed and make sure it is executable (e.g. chmod +x ~/software/bin/reduce
).
PDB-tools
PDB-tools is a useful collection of Python scripts for the manipulation (renumbering, changing chain and segIDs…) of PDB files is freely available from our GitHub repository. To install it:
cd ~/software git clone https://github.com/haddocking/pdb-tools
In case git is not installed on your system, go the GitHub site given in the command and download directly the archive.
ProFit
ProFit is designed to be the ultimate protein least squares fitting program. Some of the provided analysis tools in HADDOCK make use of Profit. Profit can be obtained free of charge for both non-profit and commercial users. The latter should notify the authors that they are using it. For information and download see the ProFit webpage.
PyMol
PyMol is a visualisation software. It is used throughout our tutorials.
Configuring HADDOCK
After having downloaded HADDOCK, unpack the archive under the installation directory with the following command:
tar xvfz haddock2.4.tgz
In case you installed miniconda, create a HADDOCK2.4 environment with the following command (withing the haddock2.4 directory):
conda env create -f requirements.yml
Then activate the haddock2.4 environment with:
Edit then a configuration file specific to your system.
This configuration file should contain the following information:
CNSTMP
defining the location of your CNS executable
QUEUETMP
defining the submission command for running the jobs (e.g. either via csh
or through a specific command submitting to your local batch system)
NUMJOB
defining the number of concurrent jobs executed (or submitted).
QUEUESUB
defining the HADDOCK python script used to run the jobs (the default QueueSubmit_concat.py should do in most cases).
And example configuration file for running on local resources assuming a 4 core system would be:
set CNSTMP=/home/software/cns/cns_solve-1.31-UU-Linux64bits.exe set QUEUETMP=/bin/csh set NUMJOB=4 set QUEUESUB=QueueSubmit_concat.py
For submitting to a batch system instead you might want to use a wrapper script. An example for torque can be found here.
In order to configure HADDOCK, call the install.csh
script with as argument the configuration script you just created:
./install.csh my-config-file
There is one more file that should be manually edited to define the number of models to concatenate within one job (useful when submitting to a batch system to ensure jobs are not too short in queue). Depending on the size of your system, a typical run time for rigid body docking would be a few tens of seconds per model written to disk (which effectively correspond to 10 docking trials internally), and a few minutes per model for the flexible refinement and water refinement. But this can increase a lot depending on the complexity of your system and the number of molecules to dock.
To define the number of concatenated models edit Haddock/Main/MHaddock.py located in the haddock2.4 installation directory and change the values as required, e.g.:
jobconcat["0"] = 5 jobconcat["1"] = 1 jobconcat["2"] = 1
If running on local system, change all values to 1.
OneOne last command to source the HADDOCK environment (under bash) (for csh, replace .sh
by .csh
):
cd ~/software/haddock2.4 source ./haddock_configure.sh
At this stage you should be ready to use HADDOCK2.4!