The Next Wave in Generative AI: Harnessing the Power of Agents

Terms of use

Terms of Use

The use of this site and the content contained therein is governed by the Terms of Use. When you use this site you acknowledge that you have read the Terms of Use and that you accept and will be bound by the terms hereof and such terms as may be modified from time to time.

All text, graphics, audio, design and other works on the site are the copyrighted works of nasscom unless otherwise indicated. All rights reserved.
Content on the site is for personal use only and may be downloaded provided the material is kept intact and there is no violation of the copyrights, trademarks, and other proprietary rights. Any alteration of the material or use of the material contained in the site for any other purpose is a violation of the copyright of nasscom and / or its affiliates or associates or of its third-party information providers. This material cannot be copied, reproduced, republished, uploaded, posted, transmitted or distributed in any way for non-personal use without obtaining the prior permission from nasscom.
The nasscom Members login is for the reference of only registered nasscom Member Companies.
nasscom reserves the right to modify the terms of use of any service without any liability. nasscom reserves the right to take all measures necessary to prevent access to any service or termination of service if the terms of use are not complied with or are contravened or there is any violation of copyright, trademark or other proprietary right.
From time to time nasscom may supplement these terms of use with additional terms pertaining to specific content (additional terms). Such additional terms are hereby incorporated by reference into these Terms of Use.

Disclaimer

The Company information provided on the nasscom web site is as per data collected by companies. nasscom is not liable on the authenticity of such data.
nasscom has exercised due diligence in checking the correctness and authenticity of the information contained in the site, but nasscom or any of its affiliates or associates or employees shall not be in any way responsible for any loss or damage that may arise to any person from any inadvertent error in the information contained in this site. The information from or through this site is provided "as is" and all warranties express or implied of any kind, regarding any matter pertaining to any service or channel, including without limitation the implied warranties of merchantability, fitness for a particular purpose, and non-infringement are disclaimed. nasscom and its affiliates and associates shall not be liable, at any time, for any failure of performance, error, omission, interruption, deletion, defect, delay in operation or transmission, computer virus, communications line failure, theft or destruction or unauthorised access to, alteration of, or use of information contained on the site. No representations, warranties or guarantees whatsoever are made as to the accuracy, adequacy, reliability, completeness, suitability or applicability of the information to a particular situation.
nasscom or its affiliates or associates or its employees do not provide any judgments or warranty in respect of the authenticity or correctness of the content of other services or sites to which links are provided. A link to another service or site is not an endorsement of any products or services on such site or the site.
The content provided is for information purposes alone and does not substitute for specific advice whether investment, legal, taxation or otherwise. nasscom disclaims all liability for damages caused by use of content on the site.
All responsibility and liability for any damages caused by downloading of any data is disclaimed.
nasscom reserves the right to modify, suspend / cancel, or discontinue any or all sections, or service at any time without notice.

For any grievances under the Information Technology Act 2000, please get in touch with Grievance Officer, Mr. Anirban Mandal at data-query@nasscom.in.

New

See all

No notification found.

The Next Wave in Generative AI: Harnessing the Power of Agents

Xoriant

@xoriant

August 14, 2024

AI Data Science & AI Community

3785

Authored by: Suresh Bansal, Technical Manager - Xoriant

The journey of Artificial Intelligence (AI) and Machine Learning (ML) has been transformative. It all began when we shifted from manual coding to training computers with data. In the early days, AI could only handle specific tasks like classification and object identification—functions for which they were explicitly trained.

But everything changed at the end of 2022 with the launch of ChatGPT by OpenAI. This groundbreaking tool could generate content and perform a wide range of tasks, quickly capturing the attention of millions worldwide. As noted in Gartner's 2023 Hype Cycle for AI, Generative AI has reached the "peak of inflated expectations" and is expected to hit the "Plateau of Productivity" within the next 5 to 10 years.

Overcoming Challenges and Limitations

Reaching the Plateau of Productivity, according to Gartner, means that AI will become widely adopted, with its benefits well-defined and clear guidelines for implementation. To get there, we must first address the current limitations of AI technology and explore how agents can help overcome these challenges.

While today’s large language models (LLMs) excel at tasks like generating emails, writing essays, and conducting sentiment analysis, they still struggle with complex tasks, such as intricate math calculations or multi-step problem-solving. Additionally, LLMs have other notable limitations:

Hallucinations or misleading outputs
Technical constraints like limited context length and memory
Bias in outputs
Potential for toxic or harmful speech
Limited knowledge (e.g., ChatGPT 3.5's knowledge cutoff is September 2021)

Interestingly, these challenges are not so different from those we humans face. We, too, are prone to mistakes, bias, limited memory, and occasionally harmful responses. To manage these shortcomings, we typically:

Seek information online and use tools like Excel and Word.
Revise our work multiple times to correct errors and improve quality.
Seek feedback from peers and mentors and incorporate their insights.
Collaborate in teams to achieve better results.

By applying similar strategies, we can improve the outputs from LLMs, leading us to the concept of Generative AI Agents.

What are Generative AI Agents?

Generative AI Agents are designed to overcome many of the limitations of current LLMs by executing complex tasks that standalone models cannot handle. For example, if you want to identify the top three companies by revenue from a dataset, an agent would:

Retrieve revenue data for all companies.
Sort the companies by revenue.
Return the top three companies.

To accomplish this, agents combine LLMs with key components such as planning, memory, and tools:

Planning: The agent outlines and executes a plan using an LLM.
Memory: The agent retains information while performing multiple steps, allowing it to process complex tasks.
Tools: Agents use various tools to perform specific tasks, which are discussed in more detail below.

Generative-AI-Agents-Xoriant

Key Features of Generative AI Agents

Generative AI agents are designed to:

Plan and execute tasks
Reflect on outcomes
Use tools to achieve specified goals
Operate with minimal human intervention

Examples of such agents include website builders, data analysts who provide insights from Excel sheets, and travel agents planning trips based on user inputs.

The Role of Tools in Generative AI Agents

Tools are critical for agents, enabling them to perform their tasks effectively. In the realm of generative AI, tools allow an LLM agent to interact with external environments and applications, such as internet searches, code interpreters, and math engines. These tools can access databases, knowledge bases, and external models.

For instance, a travel agent would need tools to search and book flights, as well as search the internet. Other tools could include:

Entity Extraction: Extract specific information from unstructured documents.
Chat DB: Retrieve information from a database without needing SQL knowledge.
Knowledge Bot: Uses Retrieval-Augmented Generation (RAG) to answer questions based on a custom knowledge repository.
Internet Search: Fetches content from search engines based on user queries.
Summarization: Provides summaries of large documents tailored to specific personas.
Program Execution: Executes Python code to solve specific problems.
Wikipedia Search: Retrieves content from Wikipedia based on user queries.
Comparison: Answers comparative questions, like performance metrics or product recommendations.

Tools-Generative-AI-Agents-Xoriant

Agentic Design Patterns

To perform complex tasks, agents must orchestrate these tools effectively. Based on lectures by Andrew NG, several agentic design patterns have emerged:

Reflection: The LLM evaluates its own work to improve it.
Tool Use: The LLM utilizes tools like web searches or code execution to gather information and process data.
Planning: The LLM devises a multi-step plan to achieve a goal and then executes it.
Multi-Agent Collaboration: Multiple AI agents collaborate, dividing tasks and debating ideas to find better solutions.

While the first two patterns yield predictable outcomes, the latter two are still in the experimental phase.

The LLM Agent Framework

Building on the understanding of agents, tools, and design patterns, a variation of the planning pattern emerges. This framework involves defining a task or goal and then iteratively planning and executing the next action, followed by a feedback loop.

An LLM agent consists of core components:

Brain/LLM: Acts as the coordinator.
Memory (Vector DB): Stores intermediate steps and results.
- Short-term memory: Holds context information within the context window.
- Long-term memory: An external vector store providing relevant contextual information.
Tools/Internet: Enable the agent to perform tasks like web searches or program execution.
Policy: Ensures trust by design, preventing the processing of toxic inputs.

Flow-Narrative-Generative-AI-Agents-Xoriant

A Future with Intelligent Agents

The future of generative AI lies in the collaboration between intelligent agents and humans. Imagine a world where doctors, designers, and customer service representatives are supported by agents that enhance their capabilities. The possibilities are endless, from scientific discoveries to artistic creations.

For businesses, integrating generative AI agents into their operations offers a strategic advantage, unlocking new levels of efficiency, personalization, and problem-solving. These agents won't replace human ingenuity; they'll empower it, shaping a future rich with innovation and progress.

About Author:

Suresh Bansal is a Technical Manager at Xoriant with expertise in Generative AI and technologies such as Vector DB, LLM, Hugging Face, Llama Index, Lang Chain, Azure, and AWS. With experience in pre-sales and sales, he has exceled at creating compelling technical proposals and ensuring client success. Suresh has worked with clients from the US, UK, Japan, and Singapore, achieved advanced-level partnerships with AWS, and presented research recommendations to C-level leadership.

Generative AI Gen AI in Tech Services Gen AI

Disclaimer

That the contents of third-party articles/blogs published here on the website, and the interpretation of all information in the article/blogs such as data, maps, numbers, opinions etc. displayed in the article/blogs and views or the opinions expressed within the content are solely of the author's; and do not reflect the opinions and beliefs of NASSCOM or its affiliates in any manner. NASSCOM does not take any liability w.r.t. content in any manner and will not be liable in any manner whatsoever for any kind of liability arising out of any act, error or omission. The contents of third-party article/blogs published, are provided solely as convenience; and the presence of these articles/blogs should not, under any circumstances, be considered as an endorsement of the contents by NASSCOM in any manner; and if you chose to access these articles/blogs , you do so at your own risk.

Xoriant

Xoriant is a Silicon Valley-headquartered digital product engineering, software development, and technology services firm with offices in the USA,UK, Ireland, Mexico, Canada and Asia. From startups to the Fortune 100, we deliver innovative solutions, accelerating time to market and ensuring our clients' competitiveness in industries like BFSI, High Tech, Healthcare, Manufacturing and Retail. Across all our technology focus areas-digital product engineering, DevOps, cloud, infrastructure, and security, big data and analytics, data engineering, management and governance -every solution we develop benefits from our product engineering pedigree. It also includes successful methodologies, framework components, and accelerators for rapidly solving important client challenges. For 30 years and counting, we have taken great pride in our long-lasting, deep relationships with our clients.

Decoding AI Studios: Making AI Accessible for Business

Janhvi Juyal

@juyal janhvi

26 Aug 2025

Data Science & AI Community Emerging Tech AI Industry Trends

In the last six to nine months, we’ve seen a proliferation of business-friendly AI studios, coming after the wave of developer-friendly AI platforms that emerged with GenAI. AI studios have started gaining traction by offering simplified, guided…

Intelligent Audit Models: Enabling AI-Ready, Digitally Resilient Data Centers

SPNX Consulti..

@SPNX

25 Aug 2025

Cyber Security & Privacy Data Privacy Threat Intelligence Digital Transformation AI IT Services

AI AS THE DEFINING FORCE OF GOVERNANCE Artificial intelligence is no longer confined to chatbots, automation scripts, or headline-grabbing innovations. A quieter, yet more profound revolution is underway in how data centers the invisible backbone…

Fine-Tuning in the Age of GPT-5: What's Changing?

Cyfuture.AI

@cyfutureai

14 Aug 2025

The AI landscape just witnessed its most significant inflection point since the launch of ChatGPT. OpenAI's GPT-5, released in August 2025, isn't just another incremental improvement—it's a paradigm shift that's fundamentally rewriting the rules of…

The Logical Evolution. Traditional AI -> Gen AI -> Agentic AI

jayantsethi74..

@jayantsethi7474

14 Aug 2025

The evolution from Traditional AI to Gen AI and now to Agentic AI marks a significant progression in Tech automation for organizations. The gradual adoption of these technologies is crucial, emphasizing the importance of a strong business case…

Agentic AI Is Here, And Looks Like It Will Stay

CSM Tech

@csmtechnologies

13 Aug 2025

Recent developments in artificial intelligence have shifted focus from generative AI to a more sophisticated paradigm known as "agentic AI." This emerging technological framework merges the adaptability of large language models (LLMs) with the…

What Exactly Are Multi-Modal AI Agents?

Sparkout Tech

@sparkouttechmarketing

13 Aug 2025

In the rapidly evolving landscape of artificial intelligence, a new and transformative technology is emerging: the multi-modal AI agent. While many of us are familiar with single-modal AI systems—like a chatbot that only understands text or a voice…

Topics In Demand

Notification

New

The Next Wave in Generative AI: Harnessing the Power of Agents

Overcoming Challenges and Limitations

What are Generative AI Agents?

Key Features of Generative AI Agents

The Role of Tools in Generative AI Agents

Agentic Design Patterns

The LLM Agent Framework

A Future with Intelligent Agents

Share this blog

Related blogs