Use of Benford's law in drug discovery data

Drug Discov Today. 2010 May;15(9-10):328-31. doi: 10.1016/j.drudis.2010.03.003. Epub 2010 Mar 16.

Abstract

Benford's law states that the distribution of the first digit of many data sets is not uniform. The first digit of any random number will be 1 almost 30% of the time, and larger digits occur as the first digit with lower and lower frequency, to the point where 9 occurs as a first digit only 5% of the time. Here, we demonstrate that several data sets in the field of drug discovery follow Benford's distribution, whereas 'doctored' data do not. Our findings indicate the applicability of Benford's law in assessing data quality in the field of drug discovery. We also propose a useful index of evaluating data quality based on Benford's law.

MeSH terms

  • Drug Discovery / methods*
  • Drug Discovery / statistics & numerical data*
  • Models, Statistical
  • Quantitative Structure-Activity Relationship
  • Research Design*