Empowering farmers with artificial intelligence: a retrieval-augmented generation based large language model advisory framework

Shreeram Sawant; Rahul Nair; Siddharth Hariharan

doi:10.4081/jae.2026.1908

Authors

Shreeram Sawant

sawantshreeram2122@ternaengg.ac.in

https://orcid.org/0009-0002-4344-6056

Computer Engineering, Terna Engineering College, Navi Mumbai, Maharashtra, India.

Rahul Nair

https://orcid.org/0009-0003-4243-072X

Computer Engineering, Terna Engineering College, Navi Mumbai, Maharashtra, India.

Siddharth Hariharan

https://orcid.org/0000-0002-1890-162X

Computer Engineering, Terna Engineering College, Navi Mumbai, Maharashtra, India.

This study presents a retrieval augmented generation (RAG) based system designed to provide farmers with expert agricultural advisory services. The framework delivers context aware guidance on critical practices such as crop cultivation, pest and disease management, fertilizer application, and other agronomic practices, and compares the performance of four large language models (LLMs) in generating these recommendations. The system processes package of practices (PoP) documents for five major crops maize, ragi, sweet potato, cotton, and groundnut through semantic chunking and embedding using Amazon Titan via BedrockEmbeddings. Vector representations are indexed in ChromaDB to enable efficient similarity search for query-relevant content retrieval. Upon receiving user queries, the system retrieves the most semantically similar document chunks and incorporates them into structured prompts. Four LLMs such as Llama3.1, Mistral, Phi3, and Qwen2.5 were evaluated for their effectiveness in generating accurate agricultural recommendations. Performance was evaluated across multiple dimensions. Relevance and retrieval were assessed using precision@K, recall@K, mean reciprocal rank (MRR), and normalized discounted cumulative gain (NDCG). Lexical overlap was measured with the bilingual evaluation understudy (BLEU) and recall-oriented understudy for gisting evaluation (ROUGE-1, ROUGE-2, ROUGE-L) metrics. Semantic quality was analyzed using Bidirectional Encoder Representations from transformers score (BERTScore) precision, recall, F1, semantic similarity and faithfulness to capture contextual alignment between generated and reference responses. Source attribution was assessed through the attribution score, while efficiency was measured using retrieval time, generation time, and total time. Overall, mistral and Qwen2.5 achieved the highest performance, demonstrating superior relevance, semantic quality, and efficiency. This evaluation highlights which LLMs perform best for the agricultural domain and illustrates the potential of knowledge-grounded AI systems to democratize agricultural expertise, particularly in regions with limited access to traditional advisory services.

Downloads

Download data is not yet available.

Citations

A, S., Krishnan, A.G., V, G. 2024. Leveraging technology to empower millet farmers a retrieval-augmented generation approach with large language models. Proc. 5th IEEE Global Conf. Advancement in Technology (GCAT), Bangalore; pp. 1-7.

Acharya, D.B., Kuppan, K., Divya, B. 2025. Agentic AI: autonomous intelligence for complex goals - A comprehensive survey. IEEE Access 13:18912-18936.

Arslan, M., Ghanema, H., Munawarb, S., Cruza, C. 2024. A survey on RAG with LLMs. Procedia Comput. Sci. 246:3781-3790.

Balpande, M., Mahajan, K., Bhandarkar, J., Borse, G., Badjat, S. 2024. AI powered agriculture optimization chatbot using RAG and GenAI. Proc. IEEE Silchar Subsection Conf. (SILCON 2024), Agartala; pp. 1-6.

Dhanabalan, T., Sathish, A. 2018. Transforming Indian industries through artificial intelligence and robotics in industry 4.0. Int. J. Mech. Eng. Technol. 9:835-845.

Government of Kerala, Directorate of Economics and Statistics, EARAS Division. 2020. Agricultural Statistics 2018-19. Available from: https://ecostat.kerala.gov.in/storage/publications/239.pdf

Hu, R., Liu, S., Qi, P., Liu, J., Li, F. 2025. ICCA-RAG: intelligent customs clearance assistant using retrieval-augmented generation (RAG). IEEE Access 13:39711-39726.

Irican, B.B., Sivri, M., Kokach, V., Kocacinar, B., Akbulut, F.P. 2024. QBot: domain-specific chatbots with retrieval-augmented generation and vector embedding for complex documentation queries. Proc. Innovations in Intelligent Systems and Applications Conf. (ASYU), Ankara; pp. 1-6.

K S, N.P., S, S., T N, T., Yuvraaj, Y., D A, V. 2023. Conversational chatbot builder – smarter virtual assistance with domain specific AI. Proc. 4th Int. Conf. Emerging Technology (INCET), Belgaum; pp. 1-4.

Kar, R., Haldar, R. 2016. Applying chatbots to the internet of things: opportunities and architectural elements. arXiv:1611.03799.

Khanifar, J. 2025. Evaluating AI-generated responses from different chatbots to soil science-related questions. Soil Adv. 3:100034.

Kim, M., Kim, D., Park, Y., Jeong, D. 2024. Development of an expert chatbot for digital forensics using RAG model implementation. Proc. Int. Conf. Platform Technology and Service (PlatCon), Jeju; pp. 182-187.

Legashev, L., Shukhman, A., Badikov, V., Kurynov, V. 2025. Using large language models for goal-oriented dialogue systems. Appl. Sci. 15:4687.

Mathebula, M., Modupe, A., Marivate, V. 2024. Fine-tuning retrieval-augmented generation with an auto-regressive language model for sentiment analysis in financial reviews. Appl. Sci. 14:10782.

Meng, W., Li, Y., Chen, L., Dong, Z. 2025. Using the retrieval-augmented generation to improve the question-answering system in human health risk assessment: the development and application. Electronics 14:386.

P, K., M, H., Hayagreevan, V. 2025. Development of interactive assistance for academic preparation using large language models. Proc. Int. Conf. Computational, Communication and Information Technology (ICCCIT), Indore; pp. 265-269.

Saha, B., Saha, U., Zubair Malik, M. 2024. QuIM-RAG: advancing retrieval-augmented generation with inverted question matching for enhanced QA performance. IEEE Access 12:185401-185410.

V, N., G. A, S., S, G., M, K., A, M., S, T. 2024. AgriBot: An integrated chatbot platform for precision agriculture and farmer support using deep learning techniques. Proc. Int. Conf. Power, Energy, Control and Transmission Systems (ICPECTS), Chennai; pp. 1-6.

Wilkho, R.S., Chang, S., Gharaibeh, N.G. 2023. FF-BERT: A BERT-based ensemble for automated classification of web-based text on flash flood events. Adv. Eng. Inform. 59:102293.

Zhou, B., Zou, L., Mostafavi, A., Lin, A., Yang, M., Gharaibeh, N., et al. 2022. VictimFinder: harvesting rescue requests in disaster response from social media with BERT. Comput. Environ. Urban Syst. 95:101824.

Xiong, J., Pan, L., Liu, Y., Zhu, L., Zhang, L., Tan, S. 2025. Enhancing plant protection knowledge with large language models: a fine-tuned question-answering system using LoRA. Appl. Sci. 15:3850.

Yin, S., Xi, Y., Zhang, X., Sun, C., Mao, Q. 2025. Foundation models in agriculture: a comprehensive review. Agriculture 15:847.

Zafarmomen, N., Samadi, V. 2025. Can large language models effectively reason about adverse weather conditions? Environ. Model. Softw. 188:106421.

Zhang, W., Zhang, J. 2025. Hallucination mitigation for retrieval-augmented large language models: a review. Mathematics 13:856.

CRediT authorship contribution

Shreeram Sawant contributed to conception and design of the RAG-based LLM advisory framework, analysis and interpretation of experimental results, drafting of the original manuscript, and critical revision for important intellectual content. Rahul Nair contributed to conception and design of the system architecture, analysis and interpretation of performance data, drafting of methodology sections, and critical revision for important intellectual content. Siddharth Hariharan contributed to conception and design of the research approach, analysis and interpretation of results, critical revision of the manuscript for important intellectual content. All authors provided final approval of the version to be published and agreed to be accountable for all aspects of the work.

How to Cite

“Empowering farmers with artificial intelligence: a retrieval-augmented generation based large language model advisory framework” (2026) Journal of Agricultural Engineering [Preprint]. doi:10.4081/jae.2026.1908.

Download Citation

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Current Issue

Empowering farmers with artificial intelligence: a retrieval-augmented generation based large language model advisory framework

Authors

Downloads

Citations

CRediT authorship contribution

How to Cite

Download Citation

authors

reviewers

indexing

Keywords