Benford's law - PHBS/MLF GitHub Wiki

Background

Benford’s Law, also known as the Law of First Digits or the Phenomenon of Significant Digits, is the finding that the first digits (or numerals to be exact) of the numbers found in series of records of the most varied sources do not display a uniform distribution, but rather are arranged in such a way that the digit 1 is the most frequent, followed by 2, 3, and so in a successively decreasing manner down to 9.

Law

Figure: The distribution of first digits, according to Benford's law. Each bar represents a digit, and the height of the bar is the percentage of numbers that start with that digit. (Source: Wikipedia)

Project goals

Benford's law has been used to detect financial fraud/crime because artificially generated numbers do not follow Benford's law. The goal of project is how to use Benford's law and machine learning together to further increase the accuracy of the detection. Read some references below.

References

First Digit Distribution Pre-Lockdown number of confirmed cases in Chinese Provinces, U.S. States and Italian Regions.(Source: Koch, C., & Okamura, K. (2020))