WHY LLMS MAY (YET) FALL SHORT OF SAVING THE WORLD

Terms of use

Terms of Use

The use of this site and the content contained therein is governed by the Terms of Use. When you use this site you acknowledge that you have read the Terms of Use and that you accept and will be bound by the terms hereof and such terms as may be modified from time to time.

All text, graphics, audio, design and other works on the site are the copyrighted works of nasscom unless otherwise indicated. All rights reserved.
Content on the site is for personal use only and may be downloaded provided the material is kept intact and there is no violation of the copyrights, trademarks, and other proprietary rights. Any alteration of the material or use of the material contained in the site for any other purpose is a violation of the copyright of nasscom and / or its affiliates or associates or of its third-party information providers. This material cannot be copied, reproduced, republished, uploaded, posted, transmitted or distributed in any way for non-personal use without obtaining the prior permission from nasscom.
The nasscom Members login is for the reference of only registered nasscom Member Companies.
nasscom reserves the right to modify the terms of use of any service without any liability. nasscom reserves the right to take all measures necessary to prevent access to any service or termination of service if the terms of use are not complied with or are contravened or there is any violation of copyright, trademark or other proprietary right.
From time to time nasscom may supplement these terms of use with additional terms pertaining to specific content (additional terms). Such additional terms are hereby incorporated by reference into these Terms of Use.

Disclaimer

The Company information provided on the nasscom web site is as per data collected by companies. nasscom is not liable on the authenticity of such data.
nasscom has exercised due diligence in checking the correctness and authenticity of the information contained in the site, but nasscom or any of its affiliates or associates or employees shall not be in any way responsible for any loss or damage that may arise to any person from any inadvertent error in the information contained in this site. The information from or through this site is provided "as is" and all warranties express or implied of any kind, regarding any matter pertaining to any service or channel, including without limitation the implied warranties of merchantability, fitness for a particular purpose, and non-infringement are disclaimed. nasscom and its affiliates and associates shall not be liable, at any time, for any failure of performance, error, omission, interruption, deletion, defect, delay in operation or transmission, computer virus, communications line failure, theft or destruction or unauthorised access to, alteration of, or use of information contained on the site. No representations, warranties or guarantees whatsoever are made as to the accuracy, adequacy, reliability, completeness, suitability or applicability of the information to a particular situation.
nasscom or its affiliates or associates or its employees do not provide any judgments or warranty in respect of the authenticity or correctness of the content of other services or sites to which links are provided. A link to another service or site is not an endorsement of any products or services on such site or the site.
The content provided is for information purposes alone and does not substitute for specific advice whether investment, legal, taxation or otherwise. nasscom disclaims all liability for damages caused by use of content on the site.
All responsibility and liability for any damages caused by downloading of any data is disclaimed.
nasscom reserves the right to modify, suspend / cancel, or discontinue any or all sections, or service at any time without notice.

For any grievances under the Information Technology Act 2000, please get in touch with Grievance Officer, Mr. Anirban Mandal at data-query@nasscom.in.

New

See all

No notification found.

WHY LLMS MAY (YET) FALL SHORT OF SAVING THE WORLD

L&T Technology Services

@L&T Technology Services

May 16, 2024

Emerging Tech Data Privacy Engineering Research & Design

Large Language Models (LLMs) took the world by storm after OpenAI launched its generative pretraining transformer (GPT) engine and debuted ChatGPT in November 2022. In just two months, ChatGPT achieved a significant milestone of 100 million weekly active users, garnering the attention of business and technology leaders across industries.

While a growing body of users today are eager to integrate LLMs into their operations, the technology, in its current avatar, still requires considerable research and development for optimal performance. A recent survey of 150 senior executives from 29 countries revealed that 58% of companies are experimenting with LLMs, and the number only looks set to grow even further – underscoring the need for an accelerated development paradigm.

In a short period, LLMs today have seen broad applications across segments – from customer service automation to test automation and validation. However, the underlying systems, including Natural Language Processing (NLP), continue to face a range of limitations. We explore the boundaries here and try to identify what the future holds for us.

Beyond the Hype: Exploring the Limitations of LLMs

While LLMs have undeniably captured the imagination of businesses and users worldwide, they are not without some critical limitations. These include:

Data-embedded Biases and Prejudices

LLMs are designed to create a language that feels natural to humans but not necessarily to provide accurate information. This can result in biases and incorrect results if the model is trained on skewed data, resulting in a tendency to “hallucinate” – generate convincing yet factually incorrect output.

Organizations, therefore, need to ensure their models are trained on unbiased data and verify LLM predictions against actual enterprise data.

An example of this was observed in Google’s AI chatbot, Bard, which mistakenly included non-existent discoveries by the James Webb Space Telescope in its responses. This error significantly impacted Google’s stock value, causing a $100 billion loss after being highlighted in a live demonstration. In another instance, ChatGPT was used in a legal case to cite legal precedents that did not exist, highlighting the risks of relying on LLM-generated information without proper verification.

Data Security and Privacy

LLMs learn from vast amounts of data, which can include private or confidential information like personal details, trade secrets, or intellectual property particulars. Consequently, these models might inadvertently expose or leak such information during text generation or processing. For instance, a well-known South Korean electronics company experienced data leaks when an engineer leveraged ChatGPT to correct errors in chip code. In another incident, a different employee copied the defect detection code into ChatGPT.

These cases underline the risk: if sensitive information is shared with a public LLM, it could be incorporated into its training data and become retrievable with specific prompts. Security experts caution against this danger and advise careful consideration of the information shared with LLMs. In terms of safeguarding data, deploying Llama on-premises presents a more secure option compared to using GPT on OpenAI's cloud service.

Prompt Injections

Prompt injection is a cybersecurity concern where hackers strategically manipulate inputs to influence the responses or actions of LLMs. For instance, cybercriminals who subtly alter queries in customer service chatbots can input normal-looking questions but embed commands that trick chatbots into revealing sensitive user data. This is known as direct prompt injection, where the attacker directly modifies the model’s prompts to access otherwise unauthorized data.

On the other hand, in indirect prompt injection, the hacker could insert malicious code into a document. When an LLM processes this document, perhaps to summarize its contents, the hidden code could mislead the LLM into generating false or harmful information.

The risks with prompt injection range from unauthorized data leaks to manipulating automated decisions – highlighting the importance of safeguarding LLMs against such vulnerabilities.

Development and Training Cost

While public LLMs have several disadvantages, establishing a self-hosted LLM introduces its own challenges, primarily financial. Developing and training LLMs, like GPT-3, which cost OpenAI over $4.6 million, require significant data and computing power, making it an expensive investment for any business.

Moreover, deploying and maintaining a self-hosted LLM involves more than just the initial investment in specialized hardware and software, which can amount to around $60,000 over five years for basic setups and up to $95,000 for scalable options. The often-prohibitive costs would also include outlays for hiring a team of data scientists and support staff, building an appropriate operating environment for the LLM, and covering ongoing maintenance expenses.

Environmental Impact

Datacenters, essential for housing the servers needed for language processing models, consume a vast amount of energy and contribute considerably to carbon emissions. Models like ChatGPT have a significant environmental impact, with an estimated annual carbon dioxide emission of 8.4 tons.

Another study by the University of California highlighted the water footprint of AI models. It reported that training Microsoft’s GPT-3 model led to the consumption of about 700,000 liters of freshwater in datacenters. This quantity is on par with the water required to produce hundreds of cars. The training process generates considerable heat, necessitating large amounts of freshwater for cooling purposes.

As language models grow larger, finding ways to reduce their environmental impact will therefore become crucial for sustainable advancement. However, it is important to note that the environmental and sustainability challenges we face are not exclusive to Large Language Models (LLMs), but are prevalent across the cloud computing technologies landscape.

Collaborative Efforts to Strengthen LLMs: Addressing Flaws and Mitigating Challenges

The rapid growth and accelerating adoption of LLMs signal a transformative shift across industries and segments. It is advised to leverage LLMs cautiously in crucial projects, ensuring they undergo expert scrutiny. The models, however, remain ideal for creative tasks that do not demand the same level of focus as in mission-critical assignments.

As we move forward, it is essential to balance innovation with ethical considerations, ensuring that LLMs are developed and used in ways that benefit society while aiding businesses. Our journey toward overcoming these limitations, therefore, has to be a collective effort involving developers, users, and policymakers. The narrative should include reasons behind the creation of LLMs, examine their current status, and chart a course for their future development and integration.

By tackling these challenges head on, we can harness the full potential of LLMs to create more informed, equitable, and sustainable solutions for the road ahead.

Large Language Models GENERATIVE PRETRAINING TRANSFORMER Natural Language Processing LIMITATIONS OF LLMS DATA SECURITY AND PRIVACY PROMPT INJECTIONS

Disclaimer

That the contents of third-party articles/blogs published here on the website, and the interpretation of all information in the article/blogs such as data, maps, numbers, opinions etc. displayed in the article/blogs and views or the opinions expressed within the content are solely of the author's; and do not reflect the opinions and beliefs of NASSCOM or its affiliates in any manner. NASSCOM does not take any liability w.r.t. content in any manner and will not be liable in any manner whatsoever for any kind of liability arising out of any act, error or omission. The contents of third-party article/blogs published, are provided solely as convenience; and the presence of these articles/blogs should not, under any circumstances, be considered as an endorsement of the contents by NASSCOM in any manner; and if you chose to access these articles/blogs , you do so at your own risk.

L&T Technology Services

ER&D

L&T Technology Services

AI Driven Portfolio Management: Redefining Investment Strategies in the Digital Age

alina1

@alina1

04 Sep 2025

Blockchain

The investment landscape is undergoing a seismic shift. Traditional portfolio management, once rooted in historical data and manual decision-making, is giving way to more dynamic, technology-driven strategies. At the forefront of this evolution is…

Generative AI or Agentic AI? Understanding the Right Fit for Business Growth

Inspirisys So..

@Inspirisys

03 Sep 2025

Emerging Tech AI

Over the past few years, artificial intelligence has moved from the margins of experimentation to the center of daily operations for professionals across industries. Its influence is no longer confined to back-end automation or technical teams. From…

GCCs' AI ambition outpaces its educational foundation

Sneha Sharma

@snsharma

03 Sep 2025

GCC AI Talent & Skills

India's tech landscape is undergoing a massive transformation, with Global Capability Centers (GCCs) leading the charge. These centres are now at the heart of the country's AI boom, creating new tech jobs. But there's a problem, a severe talent…

The Future of Market Research and Strategy: AI, Big Data & Beyond

Tanya Gupta

@tanyagupta

01 Sep 2025

Analytics Big Data Analytics Sales & Marketing

In today's fast-changing business world, accurate market research and strong strategies are significant. Consumer priorities are changing rapidly, digital changes are again reforming industries, and competition is really high. Organisations are…

AI in Automated Number Plate Recognition: How Machine Learning Improves Accuracy

iProgrammer S..

@iProgrammer

01 Sep 2025

AI Inside AI

The way cities move, watch, and protect themselves has shifted significantly over the past decade. From jammed highways filled with cars to filled parking garages and vulnerable business districts, manual watching has just become unsustainable.…

How Machine Learning Improves User Experience in Mobile Apps

Infowind Tech..

@Infowind

01 Sep 2025

Mobile & Web Development Machine Learning

In today’s digital-first world, user experience (UX) is the biggest factor that defines the success of a mobile app. With millions of apps competing for user attention, offering personalized, seamless, and engaging interactions is no longer optional…

Topics In Demand

Notification

New

WHY LLMS MAY (YET) FALL SHORT OF SAVING THE WORLD

Beyond the Hype: Exploring the Limitations of LLMs

Collaborative Efforts to Strengthen LLMs: Addressing Flaws and Mitigating Challenges

ER&D

Share this blog

Related blogs