If you take a look at https://www.wwpdb.org/documentation/file-format-content/format33/sect9.html#ATOM

you will see the following:

77 - 78        LString(2)    element      Element symbol, right-justified.

Going by atom name will get you in trouble.  As you stated calcium vs Calpha.  The element symbol comes from the chemical component dictionary.


Ezra



On 5/15/24 6:28 AM, Harry Powell wrote:
Hi folks

I’m sure that this has been answered many times before (I’m sure that when I 
was young I even read it here…), and I *know* that we should all be using 
mmCIF, but I’m using PDB format files generated by a popular Python module and 
I wanted to check the output against a definitive format definition (if that’s 
not tautology).

I noticed this because I was encouraged to try Moorhen and found that a HEM 
(apparently written by this module) did not have the atoms connected with bonds 
in the display.

I’m particularly interested in metal atoms here, and want to be 100% sure that 
I’ve found a calcium, say, and not a C-alpha.

Q: Is it necessary to check columns 77-78 if I really want to be sure?

I’ve read the following, but can’t see anything obvious in “official” PDB 
documentation that what it says here is actually defined anywhere:

Atom names are composed of an atomic (element) symbol right-justified in 
columns 13-14, and trailing identifying characters left-justified in columns 
15-16. A single-character element symbol should not appear in column 13 unless 
the atom name has four characters (for example, see Hydrogen Atoms). Many 
programs simply left-justify all atom names starting in column 13. The 
difference can be seen clearly in a short segment of hemoglobin (entry 3hhb):

Correct:
HETATM 1071 FE   HEM A   1       8.128   7.371 -15.022 24.00 16.74          FE
HETATM 1072  CHA HEM A   1       8.617   7.879 -18.361  6.00 17.74           C
HETATM 1073  CHB HEM A   1      10.356  10.005 -14.319  6.00 18.92           C
HETATM 1074  CHC HEM A   1       8.307   6.456 -11.669  6.00 11.00           C
HETATM 1075  CHD HEM A   1       6.928   4.145 -15.725  6.00 13.25           C

Incorrect:
HETATM 1071 FE   HEM A   1       8.128   7.371 -15.022 24.00 16.74          FE
HETATM 1072 CHA  HEM A   1       8.617   7.879 -18.361  6.00 17.74           C
HETATM 1073 CHB  HEM A   1      10.356  10.005 -14.319  6.00 18.92           C
HETATM 1074 CHC  HEM A   1       8.307   6.456 -11.669  6.00 11.00           C
HETATM 1075 CHD  HEM A   1       6.928   4.145 -15.725  6.00 13.25           C
I’m sure that someone here will say “why don’t you look at *****, it’s 
obvious”, in which case - many thanks!

help

Harry

########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1

This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list 
hosted by www.jiscmail.ac.uk, terms & conditions are available at 
https://www.jiscmail.ac.uk/policyandsecurity/

########################################################################

To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1

This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list 
hosted by www.jiscmail.ac.uk, terms & conditions are available at 
https://www.jiscmail.ac.uk/policyandsecurity/

Reply via email to