Skip NavigationSkip to Content

Duplicate entries in the Protein Data Bank: how to detect and handle them

  1. Author:
    Wlodawer,Alexander [ORCID]
    Dauter, Zbigniew [ORCID]
    Rubach, Pawel [ORCID]
    Minor, Wladek [ORCID]
    Jaskolski, Mariusz [ORCID]
    Jiang, Ziqiu [ORCID]
    Jeffcott, William [ORCID]
    Anosova, Olga
    Kurlin, Vitaliy
  2. Author Address

    Center for Structural Biology, Center for Cancer Research, National Cancer Institute, Frederick, MD 21702, USA., Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22908, USA., Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland., Department of Surgery and Cancer, Imperial College London, London, United Kingdom., Computer Science, University of Liverpool, Liverpool L69 3BX, United Kingdom.,
    1. Year: 2025
    2. Date: Apr 01
    3. Epub Date: 2025 04 01
  1. Journal: Acta Crystallographica. Section D, Structural Biology
  2. Type of Article: Article
  1. Abstract:

    A global analysis of protein crystal structures in the Protein Data Bank (PDB) using a newly developed computational approach reveals many pairs with (nearly) identical main-chain coordinates. Such cases are identified and analyzed, showing that duplication is possible since the PDB does not currently have tools or mechanisms that would detect potentially duplicate submissions. Some duplicated entries represent modeling efforts of ligand binding that masquerade as experimentally determined structures. We propose that duplicate entries should either be obsoleted by the PDB or, as a minimum, marked with a clear `CAVEAT' record that would alert potential users to the presence of such problems. We also suggest that using a tool for verifying the uniqueness of the deposited structure, such as that presented in this work, should become part of the routine validation procedure for new depositions. open access.

    See More

External Sources

  1. DOI: 10.1107/S2059798325001883
  2. PMID: 40056147
  3. PII : S2059798325001883

Library Notes

  1. Fiscal Year: FY2024-2025
NCI at Frederick

You are leaving a government website.

This external link provides additional information that is consistent with the intended purpose of this site. The government cannot attest to the accuracy of a non-federal site.

Linking to a non-federal site does not constitute an endorsement by this institution or any of its employees of the sponsors or the information and products presented on the site. You will be subject to the destination site's privacy policy when you follow the link.

ContinueCancel