Generative AI: information retrieval and query answering

Terms of use

Terms of Use

The use of this site and the content contained therein is governed by the Terms of Use. When you use this site you acknowledge that you have read the Terms of Use and that you accept and will be bound by the terms hereof and such terms as may be modified from time to time.

All text, graphics, audio, design and other works on the site are the copyrighted works of nasscom unless otherwise indicated. All rights reserved.
Content on the site is for personal use only and may be downloaded provided the material is kept intact and there is no violation of the copyrights, trademarks, and other proprietary rights. Any alteration of the material or use of the material contained in the site for any other purpose is a violation of the copyright of nasscom and / or its affiliates or associates or of its third-party information providers. This material cannot be copied, reproduced, republished, uploaded, posted, transmitted or distributed in any way for non-personal use without obtaining the prior permission from nasscom.
The nasscom Members login is for the reference of only registered nasscom Member Companies.
nasscom reserves the right to modify the terms of use of any service without any liability. nasscom reserves the right to take all measures necessary to prevent access to any service or termination of service if the terms of use are not complied with or are contravened or there is any violation of copyright, trademark or other proprietary right.
From time to time nasscom may supplement these terms of use with additional terms pertaining to specific content (additional terms). Such additional terms are hereby incorporated by reference into these Terms of Use.

Disclaimer

The Company information provided on the nasscom web site is as per data collected by companies. nasscom is not liable on the authenticity of such data.
nasscom has exercised due diligence in checking the correctness and authenticity of the information contained in the site, but nasscom or any of its affiliates or associates or employees shall not be in any way responsible for any loss or damage that may arise to any person from any inadvertent error in the information contained in this site. The information from or through this site is provided "as is" and all warranties express or implied of any kind, regarding any matter pertaining to any service or channel, including without limitation the implied warranties of merchantability, fitness for a particular purpose, and non-infringement are disclaimed. nasscom and its affiliates and associates shall not be liable, at any time, for any failure of performance, error, omission, interruption, deletion, defect, delay in operation or transmission, computer virus, communications line failure, theft or destruction or unauthorised access to, alteration of, or use of information contained on the site. No representations, warranties or guarantees whatsoever are made as to the accuracy, adequacy, reliability, completeness, suitability or applicability of the information to a particular situation.
nasscom or its affiliates or associates or its employees do not provide any judgments or warranty in respect of the authenticity or correctness of the content of other services or sites to which links are provided. A link to another service or site is not an endorsement of any products or services on such site or the site.
The content provided is for information purposes alone and does not substitute for specific advice whether investment, legal, taxation or otherwise. nasscom disclaims all liability for damages caused by use of content on the site.
All responsibility and liability for any damages caused by downloading of any data is disclaimed.
nasscom reserves the right to modify, suspend / cancel, or discontinue any or all sections, or service at any time without notice.

For any grievances under the Information Technology Act 2000, please get in touch with Grievance Officer, Mr. Anirban Mandal at data-query@nasscom.in.

New

See all

No notification found.

Generative AI: information retrieval and query answering

EY Global Delivery Services

@EY GDS

May 7, 2024

AI Analytics Application Function Current Issues Data Science & AI Community

978

Generative AI (GenAI) has received significant attention since the launch of ChatGPT by OpenAI. Many business units, small and large, want to leverage it for their business needs. The widespread adoption of such GenAI offerings by the industry has led to newer ones being added at a rapid pace.

Many organizations have structured data stored in sources such as databases (DB) and spreadsheets. Retrieval of information from databases is generally performed by writing DB queries.

Search techniques are commonly employed to retrieve information from unstructured data sources such as documents. Users input keywords or terms and the system displays matching parts of the documents to read and interpret. But this can be time consuming, especially when dealing with large volumes of documents. Additionally, the summarization process can be subjective and prone to manual errors.

The rise … and rise … of Large Language Models

Chat applications are vital to respond to user queries, and cater to business users who seek answers from both structured and unstructured data sources. Basic chatbots rely on predefined questions and answers, requiring human intervention for unanswered queries. But as scaling chat operations with human help isn’t always feasible, this is where GenAI emerges as a scalable and expedient solution.

Large Language Models (LLM), trained on a vast corpus of text, are effective for many Natural Language Processing (NLP) tasks. LLMs can capture the semantic meaning of text and to perform searches. They are also trained to perform tasks such as text summarization and converting text to DB queries. Hence, LLMs can provide effective responses to queries from both structured and unstructured sources.

LLMs are trained on either proprietary or public data, enabling them to effectively generalize to unseen data. However, in the context of structured data, the DB query should correspond to the business schema, and summary must follow basic business guidelines. Additional information to the user query, commonly termed “prompt engineering,” guides LLMs to provide desired outputs.

User query in structured data

Structured data are stored in DBs like SQL and Neo4j. Publicly available source code repositories contain DB queries that LLMs are trained to understand. They can generate DB queries from user text, enabling text-based interactions with DBs.

Off-the-shelf LLM models lack knowledge of business application DB schemas, requiring schema to be provided to generate pertinent DB queries. For example, if numerous database tables exist, their schema should accompany the user query. Trained on data with this capability, LLMs can generate database queries that combine information from multiple tables.

While off-the-shelf LLMs excel at generating DB queries, they are challenged when presented with complex DB schemas. For example, with numerous tables, inaccurate results may arise from incorrect table choices. Similarly, complex queries spanning multiple tables may challenge their performance. This can be addressed by efficient prompt design or by fine-tuning the LLM.

User query in unstructured data

Business users seek answers from various documents like user manuals, policy documents, reports, and knowledge articles, and across formats like word, PDF, PPT, or spreadsheets. The information scattered across these multiple documents requires a cohesive summary in response to user queries. LLMs can be used to effectively locate and summarize pertinent information from these documents.

There are two tasks that need to be performed by the LLM, which it can perform inherently, as outlined in the next section:

Locate relevant information
Summarize found data

LLM: the options for efficient query answering

In business applications, user queries rely on proprietary and/or public data. Strategies for optimizing LLMs vary depending on the use case, volume of data, timelines and available budget. You can adopt a suitable strategy based on pros and cons.

Below are the strategies suggested to choose from:

Retrieval Augmented Generation (RAG): LLMs can retrieve relevant information from sources and generate a response to the user query. RAG approaches typically use LLMs to retrieve data:

For structured sources like databases, the DB schema, including table details, is used with the user query. This affirms that the generated DB query matches the schema and can be readily executed.
For unstructured documents, the content is indexed and stored for semantic retrieval, often chunked at section, page, paragraph or sentence level, with embeddings or vectors appropriately computed and stored. When a user query must be processed, an embedding is generated and compared with those from the documents and the matching ones are retrieved. Choosing the right LLM is crucial to avoid missing key information needed to answer queries.

Once retrieved, information can be summarized to respond to the user query. Irrelevant retrieved information can be handled as summarization is contextual. RAG, a common strategy, uses pre-trained models and eliminates the need for additional training. It can be easily scaled to new data and offers short development cycles, saving time and cost.

Fine-tuning LLM for business application: Fine-tuning is an option to customize a LLM for a complex or a domain-specific problem. Usually, trained LLMs can be fine-tuned for specific tasks for better performance than an off-the-shelf LLM. For example, LLMs can be fine-tuned to convert a user query to a SQL query relevant to the tables and, or data.

Fine-tuned models require regular monitoring, retraining and re-deployment to stay current with new data. Choosing between RAG and fine-tuning depends on the tradeoffs specific to the business application.

Train an LLM on data to be queried: Consider training LLMs on specific data for higher accuracy, especially when they are significantly different from data used for pre-training. For unstructured sources, LLMs respond directly to user queries like ChatGPT. With structured sources, there is no need to provide the DB schema along with the user query.

LLMs such as GPT-4 require significant compute power like Graphics Processing Units (GPU) to train, which is expensive. Offering clean data with minimal noise and bias for training requires significant effort.

With millions of parameters, LLMs risk overfitting or underfitting the data used for training. This requires careful preparation, training, and evaluation. Also, regular retraining keeps LLMs updated. In the context of documents, training on the entire text corpus of available documents will suffice, while a few hundred or thousand example pairs are needed for user to DB query conversion, depending on the LLM architecture.

Transforming business applications with evolving LLMs

Given the vast potential of LLMs to retrieve information from structured and unstructured sources, we see the development of applications where users can query them in natural language to find the details they need. With their rapid progress, more advanced models providing higher performance will be released regularly.

We've discussed three strategies — RAG, fine-tuning, and training — that can be adopted. The choice will depend on the business application and the performance that is required. With regular advances being made in LLMs, information retrieval using them will become more accurate and efficient.

About the author:

Vishnu Vardhan Makkapati is an Associate Director at EY Global Delivery Services India LLP.

He leads the Client Technology AI Center of Excellence in India. He holds more than 35 patents, with many others pending.

Vishnu has published several papers in reputed international conferences and journals.

The views reflected in this article are the views of the author and do not necessarily reflect the views of the global EY organization or its member firms.

articial intelligence Gen AI GenAI in tech services Gen AI Impact Tech and GEn AI tech and talent

Disclaimer

That the contents of third-party articles/blogs published here on the website, and the interpretation of all information in the article/blogs such as data, maps, numbers, opinions etc. displayed in the article/blogs and views or the opinions expressed within the content are solely of the author's; and do not reflect the opinions and beliefs of NASSCOM or its affiliates in any manner. NASSCOM does not take any liability w.r.t. content in any manner and will not be liable in any manner whatsoever for any kind of liability arising out of any act, error or omission. The contents of third-party article/blogs published, are provided solely as convenience; and the presence of these articles/blogs should not, under any circumstances, be considered as an endorsement of the contents by NASSCOM in any manner; and if you chose to access these articles/blogs , you do so at your own risk.

EY Global Delivery Services

Empowering PMO by Embedding AI in Project Management

Kytes by Prod..

@ProductDossier

29 Aug 2025

Application AI

Every Project Management Office (PMO) deals with an overwhelming flow of data—project plans, timesheets, financials, risks, compliance reports and more. Yet the paradox is clear: the more information teams collect, the harder it becomes to use it…

How AI is Transforming Mobile App Development

Infowind Tech..

@Infowind

28 Aug 2025

Mobile & Web Development AI

Artificial Intelligence (AI) is no longer a futuristic concept—it is now one of the driving forces behind innovation in mobile technology. From personalized recommendations on e-commerce apps to voice assistants that understand natural language, AI…

Generative AI vs Agentic AI: Which is More Cost-Effective?

Cyfuture.AI

@cyfutureai

28 Aug 2025

AI AI Inside

Artificial Intelligence (AI) is transforming the way businesses operate, and two of the most talked-about paradigms today are Generative AI (GenAI) and Agentic AI. Both promise significant efficiency gains, but they operate differently, and their…

AI as a Service Pricing in India: Pay-As-You-Go vs Subscription

Cyfuture.AI

@cyfutureai

28 Aug 2025

AI Inside AI

Artificial Intelligence (AI) is no longer a futuristic concept; it has become a critical driver of digital transformation across Indian enterprises. From automating customer support to predictive analytics in retail and finance, AI adoption is…

Best Practices for Fine-Tuning Large Language Models (LLMs)

Cyfuture.AI

@cyfutureai

28 Aug 2025

AI AI Inside

Large Language Models (LLMs) like GPT, LLaMA, and other open-source variants have revolutionized AI applications by enabling natural language understanding, generation, and reasoning at scale. However, the out-of-the-box performance of these models…

What is an AI Model Library? A Beginner's Guide to Ready-to-Use AI Models

Cyfuture

@Cyfuture India

27 Aug 2025

In a world where 78% of organizations now use AI in at least one business function—up from 55% just a year ago—the question isn't whether your enterprise should adopt AI, but how quickly you can implement it effectively. Picture this: Your…

Topics In Demand

Notification

New

Generative AI: information retrieval and query answering

Share this blog

Related blogs