The PDB file format was invented in 197223 as a human-readable file that would allow researchers to exchange the atomic coordinates in a given protein through a database system. Its fixed-column width format is limited to 80 or 1404 columns, which was based on the width of the computer punch cards that were previously used to exchange the coordinates.5 Through the years the file format has undergone many changes and revisions. The final update to the PDB file format was in November 2012 with version 3.30.6
A typical PDB file describing a protein consists of hundreds to thousands of lines like the following (taken from a file describing the structure of a synthetic collagen-like peptide):
Berman, Helen M.; Kleywegt, Gerard J.; Nakamura, Haruki; Markley, John L. (2014-10-01). "The Protein Data Bank archive as an open data resource". Journal of Computer-Aided Molecular Design. 28 (10): 1009–1014. doi:10.1007/s10822-014-9770-y. ISSN 1573-4951. PMC 4196035. PMID 25062767. https://doi.org/10.1007/s10822-014-9770-y ↩
"wwPDB: File Format". www.wwpdb.org. https://www.wwpdb.org/documentation/file-format ↩
"PROTEIN DATABASE FILE RECORD FORMATS" (PDF). Retrieved 9 June 2024. https://cdn.rcsb.org/wwpdb/docs/documentation/file-format/PDB_format_1972.pdf ↩
Berman, Helen M (2008). "The Protein Data Bank: a historical perspective". Acta Crystallographica. 64 (Pt 1): 88–95. doi:10.1107/S0108767307035623. ISSN 2053-2733. PMID 18156675. https://www.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&tool=sumsearch.org/cite&retmode=ref&cmd=prlinks&id=18156675 ↩
"wwPDB: File Formats and the PDB". Protein Data Bank. Retrieved 2024-01-15. https://www.wwpdb.org/documentation/file-formats-and-the-pdb ↩
"wwPDB Format version 3.3: Coordinate Section". Archived from the original on 2012-02-28. Retrieved 2012-03-23. https://web.archive.org/web/20120228025220/http://www.wwpdb.org/documentation/format33/sect9.html ↩