RETRIEVAL-AUGMENTED GENERATION WITH SMALL LLMS FOR KNOWLEDGE-DRIVEN DECISION AUTOMATION IN ENTERPRISE SERVICE PLATFORMS

Main Article Content

Siva Hemanth Kolla

Abstract

Enterprise service platforms connect various knowledge artifacts and office applications in organizations to enable automation of routine decision-making. During this automation, service requests are expressed as domain-independent knowledge queries to capture gaps in knowledge related to governance, operations, risk management, customer service, and other enterprise aspects, and stored in a knowledge repository. Retrieval-augmented generation driven by small-scale pre-trained transformers offers an ideal means to automate responses to such queries because information retrieval and text-to-text generation can be achieved using state-of-the-art—if not better—large language models without incurring the high inference costs associated with their larger counterparts. A system architecture providing this functionality is presented, together with an exploration of the elements of the knowledge-retrieval phase. Empirical evaluation of the effectiveness of the retrieval step shows that it satisfies the requirements of a diverse set of queries.


Deployments of enterprise service platforms within organizations have shown that a significant proportion of service requests relate to knowledge gaps in domains such as governance, operations, risk management, customer service, and so on. Efforts to support automation of these decision-making tasks attempt to address such requests by posing knowledge-retrieval queries for the pertinent answers. Cross-domain databases, policy repositories, internal and external knowledge bases, and other such information collections serve as knowledge sources. To support these requests, retrieval-augmented generation leverages a combination of information retrieval and large language models.

Downloads

Download data is not yet available.

Article Details

How to Cite
Kolla, S. H. (2024). RETRIEVAL-AUGMENTED GENERATION WITH SMALL LLMS FOR KNOWLEDGE-DRIVEN DECISION AUTOMATION IN ENTERPRISE SERVICE PLATFORMS. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 15(3), 476–486. https://doi.org/10.61841/turcomat.v15i3.15497
Section
Articles

References

[1] Adomavicius, G., & Tuzhilin, A. (2005). Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, 17(6), 734–749.

[2] Meda, R. (2024). Agentic AI in Multi-Tiered Paint Supply Chains: A Case Study on Efficiency and Responsiveness. Journal of Compu-tational Analysis and Applications (JoCAAA), 33(08), 3994-4015.

[3] Albahri, A. S., Al-Obaidi, J. R., Zaidan, A. A., Zaidan, B. B., Hamid, R. A., Albahri, O. S., & Rashid, N. A. (2021). Multi-biometric systems: A state of the art survey and research directions. IEEE Access, 9, 104050–104095.

[4] Sheelam, G. K. (2024). Towards Autonomic Wireless Systems: Integrating Agentic AI with Advanced Semiconductor Technologies in Telecommunications. American Online Journal of Science and Engineering (AOJSE)(ISSN: 3067-1140), 2(1).

[5] Aldasoro, I., Gambacorta, L., Giudici, P., & Leach, T. (2024). The rise of artificial intelligence in financial services. BIS Quarterly Review, 1, 45–58.

[6] Kummari, D. N., & Burugulla, J. K. R. (2023). Decision Support Systems for Government Auditing: The Role of AI in Ensuring Transparency and Compliance. International Journal of Finance (IJFIN)-ABDC Journal Quality List, 36(6), 493-532.

[7] Anagnostopoulos, I. (2022). Artificial intelligence in financial services: A critical review of applications and challenges. Journal of Financial Regulation and Compliance, 30(2), 195–210.

[8] Anderson, C., Domingos, P., & Weld, D. S. (2002). Relational Markov models and their application to adaptive web navigation. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 143–152). ACM.

[9] Ramesh Inala. (2023). Big Data Architectures for Modernizing Customer Master Systems in Group Insurance and Retirement Planning. Educational Administration: Theory and Practice, 29(4), 5493–5505. https://doi.org/10.53555/kuey.v29i4.10424

[10] Arik, S. Ö., & Pfister, T. (2021). TabNet: Attentive interpretable tabular learning. In Proceedings of the AAAI Conference on Artificial Intelligence (pp. 6679–6687).

[11] A Scalable Web Platform for AI-AugmentedSoftware Deployment in Automotive Edge Devices via Cloud Services. (2024). American Advanced Journal for Emerging Disciplinaries (AAJED) ISSN: 3067-4190, 2(1). https://aajed.com/index.php/aajed/article/view/12

[12] Attia, P. M., & Dayan, P. (2022). Neural systems for decision making. Nature Reviews Neuroscience, 23(9), 561–576.

[13] Ba, J. L., Kiros, J. R., & Hinton, G. E. (2016). Layer normalization. arXiv.

[14] Nagabhyru, K. C. (2024). Data Engineering in the Age of Large Language Models: Transforming Data Access, Curation, and Enterprise Interpretation. Computer Fraud and Security.

[15] Banerjee, S., & Pedersen, T. (2003). The design, implementation, and use of the Ngram Statistics Package. In Proceedings of CICLing 2003 (pp. 370–381). Springer.

[16] Aitha, A. R. (2024). Generative AI-Powered Fraud Detection in Workers' Compensation: A DevOps-Based Multi-Cloud Architecture Leveraging, Deep Learning, and Explainable AI. Deep Learning, and Explainable AI (July 26, 2024).

[17] Basel Committee on Banking Supervision. (2023). Principles for the management of counterparty credit risk. Bank for International Settlements.

[18] Bast, H., Buchhold, B., & Haussmann, E. (2016). Semantic search on text and knowledge bases. Foundations and Trends in Information Retrieval, 10(2–3), 119–271.

[19] Vajpayee, A., Khan, S., Gottimukkala, V. R. R., Sharma, D., & Seshasai, S. J. (2025). Digital Financial Literacy 4.0: Consumer Readiness for AI-Driven Fintech and Blockchain Ecosystems. International Insurance Law Review, 33(S5), 963-973.

[20] Bender, E. M., Gebru, T., McMillan-Major, A., & Shmitchell, S. (2021). On the dangers of stochastic parrots: Can language models be too big? In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (pp. 610–623). ACM.

[21] Bengio, Y., Lecun, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444.

[22] Segireddy, A. R. (2024). Machine Learning-Driven Anomaly Detection in CI/CD Pipelines for Financial Applications. Journal of Computational Analysis and Applications, 33(8).

[23] Bholat, D., Gharbawi, M., & Thew, O. (2023). Machine learning, big data, and financial stability. Financial Stability Review, 27, 33–49.

[24] Bhutani, N., Zheng, X., & Jagadish, H. V. (2020). Learning to inject knowledge into language generation. In Proceedings of EMNLP 2020 (pp. 3469–3479). ACL.

[25] Amistapuram, K. (2024). Generative AI in Insurance: Automating Claims Documentation and Customer Communication. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 15(3), 461–475.

[26] Bird, S., Klein, E., & Loper, E. (2009). Natural language processing with Python. O’Reilly Media.

[27] Yandamuri, U. S. AI-Driven Decision Support Systems for Operational Optimization in Hospitality Technology.

[28] Rongali, S. K. (2024). Federated and Generative AI Models for Secure, Cross-Institutional Healthcare Data Interoperability. Journal of Neonatal Surgery, 13(1), 1683-1694.

[29] Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.

[30] Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems, 30(1–7), 107–117.

[31] Varri, D. B. S. (2023). Advanced Threat Intelligence Modeling for Proactive Cyber Defense Systems. Available at SSRN 5774926.

[32] Buckley, C., & Voorhees, E. M. (2004). Retrieval evaluation with incomplete information. In Proceedings of SIGIR 2004 (pp. 25–32). ACM.

[33] Carbonell, J., & Goldstein, J. (1998). The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of SIGIR 1998 (pp. 335–336). ACM.

[34] Nagubandi, A. R. (2023). Advanced Multi-Agent AI Systems for Autonomous Reconciliation Across Enterprise Multi-Counterparty Derivatives, Collateral, and Accounting Platforms. International Journal of Finance (IJFIN)-ABDC Journal Quality List, 36(6), 653-674.

[35] Chen, J., Chen, H., Wu, S., Yu, H., & Xiong, C. (2022). In-context learning with retrieval augmentation. In Proceedings of EMNLP 2022 (pp. 4569–4582). ACL.

[36] Guntupalli, R. (2024). Enhancing Cloud Security with AI: A Deep Learning Approach to Identify and Prevent Cyberattacks in Multi-Tenant Environments. Available at SSRN 5329132.

Similar Articles

<< < 17 18 19 20 21 22 23 24 25 26 > >> 

You may also start an advanced similarity search for this article.