No ratings yet

Abstract

Accurate prediction of ligand-bound protein complexes and ranking them by affinity are central problems in drug discovery. While deep learning co-folding methods can help address these challenges, their evaluation has been hampered by the difficulties in assessing independence from training data and insufficiently large test sets. Here we test the ability of co-folding methods to predict the structures of 557 ligands bound to the SARS-CoV-2 NSP3 macrodomain (Mac1) that were determined after the training cut-off dates. AlphaFold3 (AF3), Boltz-2, and Chai-1 each reproduced >50% of the Mac1 ligand poses to better than 2 [A] RMSD of experiment. Despite the potential for co-folding to describe protein conformational changes that stabilize ligand binding, we did not find that common conformational rearrangements, including peptide flip and a large loop opening, were recapitulated by the co-folding prediction. For AF3 and Chai-1, ligand pose prediction confidence weakly, but significantly, tracked experimental potency, while DOCK3.7 energies were only weakly correlated. Boltz-2 affinity predictions showed the strongest correlation with measured potency and, after calibration, achieved lower mean absolute error than a baseline predictor. We next assessed whether co-folding scores could rescore docking hit-lists to distinguish true ligands from non-binders among hundreds of molecules prospectively experimentally tested against AmpC {beta}-lactamase, the dopamine D4 and the {sigma}2 receptors. AF3 ligand pose confidence values did not separate true ligands from high-scoring false-positives as effectively as docking scores or Boltz-2 affinity predictions did. Taken together, the modest, but independent correlations of docking score and co-folding confidence or affinity suggests that integrating physics-based and deep-learning and approaches may help with hit prioritization and subsequent optimization in structure-based ligand discovery.

Links & Resources

Authors

Cite This Paper

Year:2025
Category:biophysics
APA

J., K., J., C. G., W., H. B., M., R. M., O., M., T., T., L., G. R., P., J., J., N. R., R., H. E., U., D. Y., V., S. M. G., E., D. M., R., R., S., G., J., K. N., R., R. A., A., A., K., S. B., S., F. J. (2025). Large scale prospective evaluation of co-folding across 557 Mac1-ligand complexes and three virtual screens. arXiv preprint arXiv:10.64898/2025.12.25.696505.

MLA

Kim, J., Correy, G. J., Hall, B. W., Rachman, M. M., Mailhot, O., Togo, T., Gonciarz, R. L., Jaishankar, P., Neitz, R. J., Hantz, E. R., Doruk, Y. U., Stevens, M. G. V., Diolaiti, M. E., Reid, R., Gopalkrishnan, S., Krogan, N. J., Renslo, A. R., Ashworth, A., Shoichet, B. K., and Fraser, J. S.. "Large scale prospective evaluation of co-folding across 557 Mac1-ligand complexes and three virtual screens." arXiv preprint arXiv:10.64898/2025.12.25.696505 (2025).