Integrating Machine Learning into Automated Accounting Transaction Classification: Architecture, Algorithms, and Performance Evaluation
Disha Patel , Senior Accounts Manager New York, USAAbstract
This article conducts a comparative analysis of the efficiency of various machine learning algorithms in addressing the task of classifying accounting transactions — a component ensuring the accuracy of financial reporting and enhancing operational efficiency. The aim of this study is to analyze different machine learning algorithms for the task of automated classification of accounting entries. The methodological basis of the research includes an extensive review of specialized literature, where the architectures of models such as logistic regression, support vector machine (SVM), random forest and gradient boosting are analyzed, as well as promising neural network solutions employing natural language processing (NLP) technologies. As a result of the experiment, a comparative analysis is presented according to key metrics (accuracy, recall, F1-score) and a hybrid architecture is proposed, combining an NLP module based on the BERT model and a gradient boosting classifier, which demonstrates the best results when processing transactions with complex textual descriptions. The scientific novelty of the work lies in the description of a conceptual model for selecting the optimal algorithm depending on the characteristics of the original data set and in substantiating the advantages of the proposed hybrid architecture, which integrates natural language processing methods for extracting semantic features and ensemble algorithms for final classification. In conclusion it is emphasized that the implementation of intelligent classification automation not only minimizes the influence of the human factor but also transforms the role of the accountant from a data entry operator into a strategic analyst. The obtained data are of interest to researchers in financial engineering and artificial intelligence, practicing accountants and auditors, as well as developers of software products for the automation of financial flow management.
Keywords
machine learning, classification of transactions, accounting, artificial intelligence, natural language processing, automation, gradient boosting, deep learning, financial technologies, categorical data
References
Gartner. Gartner identifies the top 10 strategic technology trends for 2024. https://www.gartner.com/en/newsroom/press-releases/2023-10-16-gartner-identifies-the-top-10-strategic-technology-trends-for-2024 (date of request: 05/08/2025).
The future of accounting: Trends impacting ACCA and CMA professionals. (n.d.). Alight Motion Mod Pro. Retrieved from (date of request: 05/10/2025).
Dong, H., Liu, R., & Tham, A. W. (2024). Accuracy comparison between five machine learning algorithms for financial risk evaluation. Journal of Risk and Financial Management, 17(2). https://doi.org/10.3390/jrfm17020050
Virro, H., Järveoja, J., Küttim, M., & Järvet, A. (2022). Random forest-based modeling of stream nutrients at national level in a data-scarce region. Science of The Total Environment, 840. https://doi.org/10.1016/j.scitotenv.2022.156613
Kotios, D., Arampatzis, A., Dimou, A., & Doulamis, A. (2022). Deep learning enhancing banking services: A hybrid transaction classification and cash flow prediction approach. Journal of Big Data, 9 (1).
Liapis, C. M., & Kotsiantis, S. (2023). Temporal convolutional networks and BERT-based multi-label emotion analysis for financial forecasting. Information, 14(11). https://doi.org/10.3390/info14110596
Daroń, M., & Górska, M. (2023). Enterprises development in context of artificial intelligence usage in main processes. Procedia Computer Science, 225, 2214–2223. https://doi.org/10.1016/j.procs.2023.10.212
Guo, Z., Schlichtkrull, M., & Vlachos, A. (2022). A survey on automated fact-checking. Transactions of the Association for Computational Linguistics, 10, 178–206. https://doi.org/10.1162/tacl_a_00454
Cha, G. W., Moon, H. J., & Kim, Y. C. (2021). Comparison of random forest and gradient boosting machine models for predicting demolition waste based on small datasets and categorical variables. International Journal of Environmental Research and Public Health, 18(16), 1–16. https://doi.org/10.3390/ijerph18168530
Talaat, A. S. (2023). Sentiment analysis classification system using hybrid BERT models. Journal of Big Data, 10(1), 1–18.
Monteiro, A., & Cepêda, C. (2021). Accounting information systems: Scientific production and trends in research. Systems, 9(3), 5–20. https://doi.org/10.3390/systems9030067
PwC. (2023). PwC's Global Artificial Intelligence Study: Sizing the prize. https://www.pwc.com/gx/en/issues/data-and-analytics/publications/artificial-intelligence-study.html (date of request: 05/19/2025).
Stancu, M. S., & Dutescu, A. (2021). The impact of the artificial intelligence on the accounting profession, a literature’s assessment. Proceedings of the International Conference on Business Excellence, 15(1), 749–758.
Article Statistics
Copyright License
Copyright (c) 2025 Disha Patel

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain the copyright of their manuscripts, and all Open Access articles are disseminated under the terms of the Creative Commons Attribution License 4.0 (CC-BY), which licenses unrestricted use, distribution, and reproduction in any medium, provided that the original work is appropriately cited. The use of general descriptive names, trade names, trademarks, and so forth in this publication, even if not specifically identified, does not imply that these names are not protected by the relevant laws and regulations.