The nuclear pore complex (NPC) is a multiprotein assembly that serves as the sole mediator of nucleocytoplasmic exchange in eukaryotic cells. Here, we use an integrative approach to determine the structure of an essential component of the yeast NPC, the ~600 kDa heptameric Nup84 complex. The configuration of the subunit structures was determined by satisfaction of spatial restraints derived from a diverse set of negative stain EM and protein domain mapping data. Phenotypic data was mapped onto the complex, allowing us to find regions that stabilize the NPC's interaction with the NE membrane and connect the complex to the rest of the NPC. We suggest a scenario for the evolution of the Nup84 complex through a series of gene duplication and loss events. This work demonstrates that integrative approaches based on low resolution data can generate functionally informative structures at intermediate resolution.
The commands given here are for running in the Nup84 directory of an IMP build (that is, imppy.sh is two levels up).
input_models.txt: a list of input models and their pdb file names, plus a which subsequence of which protein each model representsinput/structures: the 14 input model pdb filesnup84_class_average.jpg: contains a 2d class average of particles from negative stain em located in input/emcoarse_restraints_set1, coarse_restraints_set2, coarse_restraints_set3, refined_restraints located in input/restraints are the restraint data input files.The following steps generate one model. They should be repeated many times to generate an ensemble of models. When %1% occurs in an argument, it should be replaced by a number or something similar to differentiate the various runs.
First, generate a coarse structural solution
Second, refine the model
Once you have set of refined models, generate em2d scores for all of them
Next, filter out all except the top 10% of the structures based on the em2d scores
Next, proceed with the final refinement of each of the filtered structures
Finally, generate 7 localization density maps from the final set of structures (one per protein). Any of the info files from the last step will do.
Author(s): Jeremy Phillips
Version: SVN.r14091
License: LGPL. This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
Testable: No.
Parallelizeable: Yes.
Last known good IMP version: None
Publications: