In summary, for a given sequence, ModPipe:
There are two steps. An initialization step sets up the ModPipe filesystem and the identifier for the given sequence, or for each of a set of given sequences. Then ModPipe can be run for each sequence.
While there are a number of command-line arguments to ModPipe, ModPipe relies primarily on the configuration file to specify input and output file locations and run parameters. The command line options are also described.
There is an example file included in the ModPipe distribution in the demo subdirectory.
To run the demo, change into the demo directory and then set the DEMO environment variable to the current directory, e.g. with something like:
export DEMO=`pwd`
Next, run the following command (this assumes that the MODPIPE environment variable points to your copy of ModPipe) to populate the ModPipe filesystem:
${MODPIPE}/main/AddSeqMP.py --conf_file modpipe.conf --sequence_file test.fsa
This command will add the sequences to the ModPipe filesystem in the data directory and create a file with unique sequence identifiers, test.unq. (The file should contain two such identifiers.) Next, run:
${MODPIPE}/main/ModPipe.pl --conf_file modpipe.conf \
--sequence_id 97e075794f588a59e8a0fb8a945814b1MLGIKIKP \
--hits_mode 1100
This command will run ModPipe on one of the two sequences in the filesystem and produce 13 fold assignments and alignments (6 from Sequence-Sequence search and 7 from Profile-Sequence search). The fold assignments (hits) are then filtered to remove redundancy, resulting in 8 selected hits. Finally, one model is built for each selected hit.
Note
For the sake of speed, the example does not calculate profile-profile alignments, and only builds a single model per alignment.
Data about generated models will be stored in *.mod files in each sequence directory. For instance, if you look in data/97e/97e075794f588a59e8a0fb8a945814b1MLGIKIKP/sequence/ there will be a file called 97e075794f588a59e8a0fb8a945814b1MLGIKIKP.mod. There will also be a .hit file that contains all the fold assignments. See File formats for full information on the contents of these files.