There has been a tremendous explosion of data available with public sector entities over the last 20 years. This is largely driven by higher deployment of digital devices such as sensor instruments, scanners, meters, smart phones, CCTV cameras, satellites, and drones as well as software applications such as Supervisory Control and Data Acquisition Systems, Geographic Information Systems and biometric ID based applications for citizens.
The resulting granular, variegated and voluminous data has chanced upon significant reduction in the costs of storage and processing of data. These conducive circumstances have resulted in the emergence of Machine Learning (ML) Algorithms. These are computer algorithms, which mimic the techniques used by the human brain to identify patterns, learn to create mental models and solve problems. These algorithms use a high volume of input data to train and learn, to create an output data model. This output data model is then used on test data to deliver the required results.
For example, one can envisage an ML algorithm, which is intended to work as an image classifier for defects in construction i.e., the ML is required to deliver the result of whether an image contains a construction defect such as a crack in concrete or a bent iron beam or not. In order to deliver this result, the ML algorithm would have to use a high volume of construction images, which are clearly labelled as either defective or defect-free, as input data to train and learn to identify the different types of defects. This learning would then result in creation of an output data model. The output data model would then be in a position to consume new images, the test data, which are not labelled and classify them as either defective or defect-free. The accuracy of the output data model would depend on the volume and quality of the input training data.
Such ML algorithms would find ready-use cases in Government Audit. For the high volume of structured data being maintained by audited entities, the application of analytical tools such as IDEA and SQL would suffice. But for the combination of high volume of unstructured data such as images and videos of documents and objects and structured data being maintained, the application of ML algorithms in Audit would appear to be the logical solution.
ML algorithms may be applied to solve the following types of basic problems-
i. | Classification | Categorization of transactions as Risky/ Non-Risky |
ii. | Clustering | Identifying transactions having similar characteristics |
iii. | Association | Identifying correlations between transactions or actions |
iv. | Summarization | Computing the impact of certain category of transactions |
v. | Network analysis | Identification of related entities across multiple levels of connections |
vi. | Deviation detection | Identification of transactions which are significantly different from the norm |
vii. | Prediction | Estimation of quantity/ materiality of transactions |
viii. | Image Analytics | Analysis of data trends and patterns visually |
ML algorithms enable detection of sophisticated cases of non-compliant actions / potential frauds from the test data,even though it may not be clear to the Auditors a priori, as to how the data fields may be internally correlated.
In terms of the types of Audit assignments in which ML algorithms could potentially be applied, the following are some illustrative examples where these algorithms can be applied for Image Analytics, with the images obtained from satellites, drones or CCTV cameras-
The use of ML algorithms during Audit is poised at a very interesting stage, with a large scope for innovation and delivery of insights and results that would not be feasible through human analysis alone. It is for this reason that SAIs around the world are deeply engaged in identifying and developing strong uses for ML algorithms.
Forty years ago, recognising the knowledge driven nature of our Institution, the quarterly publication of the “Journal of Management and Training” was initiated....
The need for enhancing the value and impact of CAG’s audit has been an enduring concern...
In an era where technology intertwines with governance, the shift towards online portals for delivering public services has been monumental...
The democratization of cutting-edge technologies such as Geographic Information Systems (GIS) and Remote Sensing stands as a huge shift, empowering...
Social programmes have evolved over the decades, aiming for increased coverage, higher entitlements, and better design by using technology for targeting...
Human beings thrive in diverse ecosystems but often simplify these ecosystems over time. This simplification occurs through targets...
In the vast landscape of artificial intelligence, few innovations have captured the imagination of the humans and sparked as much excitement as...
It was on 14th December 2022 when our posting orders came while I was still attached as an Assistant Director in the CRA Wing in the Headquarters Office, New Delhi....
Water: The word itself is an indication of life on the planet. About 60% of the human body contains water...