J Comput Aided Mol Des. 2021 Nov;35(11):1141-1155. doi: 10.1007/s10822-021-00427-0. Epub 2021 Oct 29.
The goal of the Statistical Assessment of the Modeling of Proteins and Ligands (SAMPL) challenge is to improve the accuracy of current computational models to estimate free energy of binding, deprotonation, distribution and other associated physical properties that are useful for the design of new pharmaceutical products. New experimental datasets of physicochemical properties provide opportunities for prospective evaluation of computational prediction methods. Here, aqueous pKa and a range of bi-phasic logD values for a variety of pharmaceutical compounds were determined through a streamlined automated process to be utilized in the SAMPL8 physical property challenge. The goal of this paper is to provide an in-depth review of the experimental methods utilized to create a comprehensive data set for the blind prediction challenge. The significance of this work involves the use of high throughput experimentation equipment and instrumentation to produce acid dissociation constants for twenty-three drug molecules, as well as distribution coefficients for eleven of those molecules.