PDB Traceback client
Traces back PDB files from a HADDOCK run directory.
Given an input run directory, haddock3-traceback traces back each model to the initial input molecules used, providing the rank of each intermediate model.
USAGE:
haddock3-traceback -r <run_dir>
- haddock.clis.cli_traceback.get_ori_names(n: int, pdbfile: PDBFile, max_topo_len: int) tuple[list, int] [source]
Get the original name(s) of the PDB file.
- Parameters:
n (int) – Step number.
pdbfile (PDBFile) – PDBFile object.
max_topo_len (int) – Maximum length of the topologies found so far.
- Returns:
ori_names (list) – List of original names.
max_topo_len (int) – Maximum length of the topologies found so far.
- haddock.clis.cli_traceback.get_steps_without_pdbs(run_dir, all_steps)[source]
Get the modules that do not produce PDB files.
- Parameters:
run_dir (str or pathlib.Path) – Path to the run directory.
all_steps (list) – List of all the steps in the run directory.
- Returns:
steps_without_pdbs (list) – List of steps that did not produce PDB files.
- haddock.clis.cli_traceback.main(run_dir: str | Path, offline: bool = False) None [source]
Traceback CLI.
- Parameters:
run_dir (str or Path) – Path to the original run directory.
- haddock.clis.cli_traceback.order_traceback_df(df_output, sel_step)[source]
Order the traceback dataframe. Each step is ordered by rank.
- Parameters:
df_output (pandas.DataFrame) – Dataframe containing the traceback data.
sel_step (list) – List of selected steps.
- Returns:
df_output (pandas.DataFrame) – Dataframe containing the ordered traceback data.
- haddock.clis.cli_traceback.subset_traceback(traceback_df: DataFrame, cons_filename: Path) DataFrame [source]
Generate a subset the traceback dataframe with the top 40 models.
- Parameters:
traceback_df (pandas.DataFrame) – Dataframe containing the traceback data.
cons_filename (pathlib.Path) – name of the consensus file.
- Returns:
rank_data_subset (pandas.DataFrame) – Dataframe containing the subset of the traceback data.
- haddock.clis.cli_traceback.traceback_dataframe(data_dict: dict, rank_dict: dict, sel_step: list, max_topo_len: int) DataFrame [source]
Create traceback dataframe by combining together ranks and data.
- Parameters:
data_dict (dict) – Dictionary containing the data to be traced back.
rank_dict (dict) – Dictionary containing the ranks of the data to be traced back.
sel_step (list) – List of selected steps.
max_topo_len (int) – Maximum length of the topologies.
- Returns:
df_ord (pandas.DataFrame) – Dataframe containing the traceback data.