Process All Files In Directory
Jump to navigation
Jump to search
Explanation
For a given directory with PDB files in it, the following code will output, for each PDB, the bound disulfide bond lengths like this:
1alk.pdb
A CYS 168 SG
A CYS 178 SG
1.975
A CYS 286 SG
A CYS 336 SG
1.995
B CYS 168 SG
B CYS 178 SG
1.996
B CYS 286 SG
B CYS 336 SG
2.032
1btu.pdb
CYS 42 SG
CYS 58 SG
2.039
CYS 136 SG
CYS 201 SG
2.031
CYS 168 SG
CYS 182 SG
2.001
CYS 191 SG
CYS 220 SG
2.019
...
Bound Disulfides
from pymol import cmd
from glob import glob
for file in glob("*.pdb"):
print file
cmd.load(file,'prot')
for a in cmd.index("elem s and bound_to elem s"):
if cmd.select("s1","%s`%d"%a) and \
cmd.select("s2","elem s and bound_to %s`%d"%a):
if cmd.select("(s1|s2) and not ?skip"):
cmd.iterate("s1|s2","print ' ',chain,resn,resi,name")
print ' ',round(cmd.dist("tmp","s1","s2"),3)
cmd.select("skip","s1|s2|?skip")
cmd.delete("all")
All Sulfur Distances
Note that the above is for bonded sulfurs in disulfides. For all intra-cysteine gamma sulfur distances, you'd want to do something more like:
1alk.pdb
A CYS 168 SG
A CYS 178 SG
1.975
A CYS 168 SG
A CYS 286 SG
35.845
A CYS 168 SG
A CYS 336 SG
35.029
A CYS 168 SG
B CYS 168 SG
63.64
A CYS 168 SG
B CYS 178 SG
63.775
A CYS 168 SG
B CYS 286 SG
39.02
A CYS 168 SG
B CYS 336 SG
39.314
1btu.pdb
CYS 42 SG
CYS 58 SG
2.039
CYS 42 SG
CYS 136 SG
from pymol import cmd
from glob import glob
for file in glob("*.pdb"):
print file
cmd.load(file,'prot')
for a in cmd.index("CYS/SG"):
for b in cmd.index("CYS/SG"):
if a[1]<b[1]:
cmd.select("s1","%s`%d"%a)
cmd.select("s2","%s`%d"%b)
if cmd.select("(s1|s2) and not ?skip"):
cmd.iterate("s1|s2","print ' ',chain,resn,resi,name")
print ' ',round(cmd.dist("tmp","s1","s2"),3)
cmd.select("skip","s1|s2|?skip")
cmd.delete("all")