Accounting fraud is the manipulation or misstatement of a company’s financial statements to create an illusion of strong financial health. This type of fraud boosts investor or shareholder confidence—when in fact the company could be in poor financial health with liquidity or solvency issues.
There are several ways of falsifying financial statements to paint a more rosy picture such as overstating revenue and/or assets, not recording expenses, or understating liabilities. The results of these falsifications mislead investors to believe a company is making money or has sufficient liquidity, which also leads to overinflated share prices.
The most famous example of accounting fraud is the Enron scandal where share prices plummeted to a measly few cents after the company’s fraudulent practices were uncovered.
Fortunately, regulators now have an additional tool to fight accounting fraud—artificial intelligence. Using the right financial datasets and models, fraudulent account practices can be accurately detected and reasonably explained.
For example, we can use data entries in balance sheets and profit-and-loss statements as features in a machine learning model to predict accounting fraud such as total current assets, cost of goods sold (COGS), or total debt in current liabilities.
Along with raw data entries in these statements, additional financial ratios such as year-on-year changes in return on assets or book-to-market value are useful machine learning features as well.
The dataset used in the following example was published in the Journal of Accounting Research.
Note that DataRobot also automatically runs Data Quality Assessments on the dataset to identify and remedy potential data quality issues.
Using this dataset, DataRobot’s Automated Machine Learning has built over 100 different machine learning models to identify the best performing model in the project and to recommend the model for deployment. The following is our best model, a Light Gradient Boosted Trees Classifier with Early Stopping, which has an Area-Under-Curve (AUC) score of over 80 percent.
Through Feature Impact, DataRobot also informs us of the top features in the dataset that contributes significantly to the model’s predictive accuracy. The top features are % Soft Assets, Sale of Common and Preferred Stock, Total Receivables, and others. Using this model, we can then make predictions of the risk of accounting fraud for a given company based on the company’s financial statements and characteristics. The following shows how DataRobot’s No-Code Apps allows users to design applications in a code-free fashion, query a given model, and generate predictions.
By some accounts, when executed properly, AI fraud detection systems can reduce fraud by 95 percent while also reducing the associated costs. These results provide significant value for public agencies and regulators. If you’re interested in using AI, tour the DataRobot AI Platform. To request a fraud detection demo shown in this blog post, contact us and we’ll explore a proof of value together.
About the author
Wei Zhong Toh
Customer Facing Data Scientist, DataRobot
Wei Zhong is a Customer Facing Data Scientist (CFDS) at DataRobot, located in Singapore. His areas of expertise include public sector, healthcare, and telecommunications. Wei Zhong holds a MSc in Statistics and a BSc (First Class Honours) in Computational Biology, from the National University ofSingapore.
Meet Wei Zhong Toh
Nina Xing
Customer Facing Data Scientist
Nina is a Customer Facing Data Scientist at DataRobot, based in Singapore. Nina has data science expertise in fintech, insurance, manufacturing, and retail industries. Prior to joining DataRobot, Nina worked in the financial market and specialized in portfolio performance and risk management. Nina is a Chartered Financial Analyst and holds a bachelor’s degree in Mathematical Sciences from Nanyang Technological University in Singapore.
Meet Nina Xing
Clifton Phua
Customer Facing Data Scientist, DataRobot
Clifton is a Customer Facing Data Scientist (CFDS) at DataRobot working in Singapore and leads the Asia Pacific (APAC)’s CFDS team. His vertical domain expertise is in banking, insurance, government; and his horizontal domain expertise is in cybersecurity, fraud detection, and public safety. Clifton’s PhD and Bachelor’s degrees are from Clayton School of Information Technology, Monash University, Australia. In his free time, Clifton volunteers professional services to events, conferences, and journals. Was also part of teams which won some analytics competitions.
Meet Clifton Phua