Get clean object-based responses from LLMs with LangChain

Terms of use

Terms of Use

The use of this site and the content contained therein is governed by the Terms of Use. When you use this site you acknowledge that you have read the Terms of Use and that you accept and will be bound by the terms hereof and such terms as may be modified from time to time.

All text, graphics, audio, design and other works on the site are the copyrighted works of nasscom unless otherwise indicated. All rights reserved.
Content on the site is for personal use only and may be downloaded provided the material is kept intact and there is no violation of the copyrights, trademarks, and other proprietary rights. Any alteration of the material or use of the material contained in the site for any other purpose is a violation of the copyright of nasscom and / or its affiliates or associates or of its third-party information providers. This material cannot be copied, reproduced, republished, uploaded, posted, transmitted or distributed in any way for non-personal use without obtaining the prior permission from nasscom.
The nasscom Members login is for the reference of only registered nasscom Member Companies.
nasscom reserves the right to modify the terms of use of any service without any liability. nasscom reserves the right to take all measures necessary to prevent access to any service or termination of service if the terms of use are not complied with or are contravened or there is any violation of copyright, trademark or other proprietary right.
From time to time nasscom may supplement these terms of use with additional terms pertaining to specific content (additional terms). Such additional terms are hereby incorporated by reference into these Terms of Use.

Disclaimer

The Company information provided on the nasscom web site is as per data collected by companies. nasscom is not liable on the authenticity of such data.
nasscom has exercised due diligence in checking the correctness and authenticity of the information contained in the site, but nasscom or any of its affiliates or associates or employees shall not be in any way responsible for any loss or damage that may arise to any person from any inadvertent error in the information contained in this site. The information from or through this site is provided "as is" and all warranties express or implied of any kind, regarding any matter pertaining to any service or channel, including without limitation the implied warranties of merchantability, fitness for a particular purpose, and non-infringement are disclaimed. nasscom and its affiliates and associates shall not be liable, at any time, for any failure of performance, error, omission, interruption, deletion, defect, delay in operation or transmission, computer virus, communications line failure, theft or destruction or unauthorised access to, alteration of, or use of information contained on the site. No representations, warranties or guarantees whatsoever are made as to the accuracy, adequacy, reliability, completeness, suitability or applicability of the information to a particular situation.
nasscom or its affiliates or associates or its employees do not provide any judgments or warranty in respect of the authenticity or correctness of the content of other services or sites to which links are provided. A link to another service or site is not an endorsement of any products or services on such site or the site.
The content provided is for information purposes alone and does not substitute for specific advice whether investment, legal, taxation or otherwise. nasscom disclaims all liability for damages caused by use of content on the site.
All responsibility and liability for any damages caused by downloading of any data is disclaimed.
nasscom reserves the right to modify, suspend / cancel, or discontinue any or all sections, or service at any time without notice.

For any grievances under the Information Technology Act 2000, please get in touch with Grievance Officer, Mr. Anirban Mandal at data-query@nasscom.in.

New

See all

No notification found.

Get clean object-based responses from LLMs with LangChain

Opcito

@Opcito Technologies

January 14, 2025

Application

Have you ever felt the frustration of getting an LLM response that's almost what you need, but still requires a lot of manual work to transform it into something usable? You're not alone. We've all been in a situation where you ask a language model a question, and it spits out a string of text. It looks promising, but the real work is just beginning. You have to manually parse that text, extract the relevant information, and then shape it into a format that your application can understand. It's a tedious and error-prone process, especially when dealing with complex queries or multiple responses.

If you've tried using JSON prompts, you might think you've found a solution. While JSON provides a structured format, it's still a string that needs to be parsed into usable objects. It's better than raw text, but it's far from ideal. Wouldn't it be great if you could get direct, structured responses from your LLM?

Imagine receiving Python objects – dictionaries, lists, or custom classes – straight from your language model, ready to be used in your application. No more string manipulation, no more parsing headaches. With LangChain's with_structured_output, this is now possible. In this blog, I'll show you how to use with_structured_output to simplify your workflow and eliminate the need for manual parsing. You'll be able to focus on building better applications, faster.

What is `with_structured_output` ?

At its core, with_structured_output is a feature in LangChain that allows you to receive structured data directly from your LLM as objects. Instead of the typical string-based responses, which often require extra parsing and manipulation, LangChain’s with_structured_output ensures that the data comes back in a usable, object-based format—without you having to write additional code.

In simple terms, with_structured_output transforms LLM responses from raw text into structured objects (such as dictionaries, lists, or custom classes). This means you don’t need to write additional code to convert the output into something meaningful. The data is ready for use right out of the box.

For example, let’s say you’re building an application that needs to extract certain fields like a person’s name, age, and address.

Without `with_structured_output` :

You might get a string response that you have to parse manually and convert it into something structured, like a dictionary or a class instance.

# The LLM return a string that needs parsing
llm_response = "Name: John, Age: 25, Address: 123 Main St"

With `with_structured_output` :

When using with_structured_output, the LLM returns the data directly as an instance of a Person class, like so:\

# The LLM will return a Person object directly:
llm_response = Person(name="John", age=25, address="123 Main St")

Why its better than JSON response?

Many developers use JSON prompts to structure LLM responses. For instance, they might ask the model to return output in a specific JSON format by including an example in the prompt. While this approach works, it has some major limitations:

Parsing hassles: JSON outputs are still strings. You need to parse them into Python objects (e.g., dictionaries or classes) before they can be used, adding extra steps and complexity.
Error-prone: LLMs may occasionally generate invalid JSON due to their probabilistic nature. Missing commas, unmatched brackets, or malformed structures can break your parsing code.
Inconsistent keys: Without strict enforcement, the keys in JSON outputs might vary slightly (e.g., first_name vs. firstname), leading to errors in your application.

LangChain’s with_structured_output takes JSON prompts to the next level. Instead of generating raw strings, it uses Python’s native class structure to ensure that the output is returned as valid objects, eliminating the need for manual parsing or validation.

Here’s why LangChain's `with_structured_output` is a game-changer:

Direct object output: The LLM directly returns an object (e.g., an instance of a class like Person), ready to use. No parsing required.
Error-free: You avoid issues with malformed JSON. The structure is predefined, and the LLM adheres to it.
Easier debugging: Working with objects is easier to debug and more intuitive compared to string-based JSON.
Clean code: Your application logic becomes cleaner because you’re directly working with objects instead of processing and converting strings.

The bottom line is that JSON prompts are a good workaround, but with_structured_output is a more robust, reliable, and developer-friendly solution for getting structured, object-based data directly from your LLM.

How does `with_structured_output` work?

LangChain’s with_structured_output feature leverages schema-based validation to ensure that the output from the LLM matches a predefined structure. Instead of returning a free-form string or even loosely formatted JSON, the model adheres to a strict schema and directly returns a Python object.

Example: Workout assistant

Imagine an AI that specializes in creating a workout plan. A user asks for a specific plan, and the model provides the response as a structured Python object, perfect for immediate use in applications like workout assistants or workout websites.

With with_structured_output, the process is seamless, ensuring that the response is accurate and follows a predefined structure. Let’s break it down into three simple steps:

Step 1 - Define the Schema

The first step is to define a schema for the expected structured output. This is done by creating a Python class using Pydantic, which serves as the blueprint for the data. For a workout plan assistant, the schema might look like this:

from pydantic import BaseModel, Field
from typing import List 

class WorkoutPlan(BaseModel):
    name: str = Field(description="Name of the workout plan")
    duration_weeks: int = Field(description="Duration of the plan in weeks")
    workouts: List[str] = Field(description="List of workouts for each day")
    goals: List[str] = Field(description="Fitness goals for the plan")
    equipment_needed: List[str] = Field(description="List of required equipment")

This schema serves two purposes: -
• It tells the LLM the structure you want.
• It ensures the data returned is valid and adheres to this structure.

Step 2 - Setup `with_structured_output`

Next, we use LangChain’s with_structured_output to bind the schema to the LLM. This ensures that the LLM returns a WorkoutPlan object directly, without needing to embed JSON examples in the prompt.

from langchain_openai import ChatOpenAI 
from langchain.prompts import PromptTemplate 
from langchain_core.runnables.base import RunnableSequence 

# Initialize the LLM
model = ChatOpenAI(model="gpt-3.5-turbo-0125", temperature=0)

# Enable structured output using the WorkoutPlan schema
 structured_llm = model.with_structured_output(WorkoutPlan) 

# Define the prompt template 
prompt = """
You are a fitness expert knowledgeable in creating workout plans. Your task is to: 
1. Provide a personalized workout plan depending on the user’s needs. 
2. Do not answer questions unrelated to workouts.

Question: {question} 
""" 

# Create a chain with the prompt and structured LLM
prompt_template = PromptTemplate(template=prompt, input_variables=["question"]) 
chain = RunnableSequence(prompt_template | structured_llm)

This setup ensures the model understands both the task and the required output format.

Step 3 - Run chain

Now, let’s ask the AI for a specific workout plan.

# User query 
question = "Can you create a 4-week workout plan for weight loss?"

# Invoke the chain to get the structured response 
workout_plan: WorkoutPlan = chain.invoke({"question": question})

Output

The response from the LLM is directly a WorkoutPlan object:

print(workout_plan) 
"""
Output: 
WorkoutPlan( 
   name="4-Week Weight Loss Plan",
   duration_weeks=4,
   workouts=[ 
      "Day 1: Full Body Strength Training", 
      "Day 2: 30-minute Cardio", 
      "Day 3: HIIT Workout", 
      "Day 4: Rest", 
      "Day 5: Lower Body Focus", 
      "Day 6: 45-minute Jogging", 
      "Day 7: Rest", 
   ], 
   goals=[ 
      "Lose weight", 
      "Increase endurance",
      "Build muscle", 
   ],
   equipment_needed=[ 
      "Dumbbells", 
      "Exercise mat",
      "Resistance bands",
      "Running shoes", 
   ],
) 
"""

This example illustrates how to create a structured response for a fitness workout plan, allowing for easy integration into various applications focused on health and fitness.

Streamline your workflows with confidence

LangChain's with_structured_output isn't just a tool; it's a game-changer for developers and businesses looking to streamline their workflows. By eliminating the need to parse raw text or format messy JSON, this feature ensures you get clean, structured data directly from your LLM—saving time, reducing errors, and improving the overall efficiency of your applications. Ready to Leverage the Power of Structured Outputs? The engineers at Opcito are experts in LLM technology and can help you implement with_structured_output effectively.

LLM LangChain

Disclaimer

That the contents of third-party articles/blogs published here on the website, and the interpretation of all information in the article/blogs such as data, maps, numbers, opinions etc. displayed in the article/blogs and views or the opinions expressed within the content are solely of the author's; and do not reflect the opinions and beliefs of NASSCOM or its affiliates in any manner. NASSCOM does not take any liability w.r.t. content in any manner and will not be liable in any manner whatsoever for any kind of liability arising out of any act, error or omission. The contents of third-party article/blogs published, are provided solely as convenience; and the presence of these articles/blogs should not, under any circumstances, be considered as an endorsement of the contents by NASSCOM in any manner; and if you chose to access these articles/blogs , you do so at your own risk.

Opcito

Leadership in Digital Transformatio...

Sumeet Jha

Digital Transfo..

23 May 2025

Monitoring Kubernetes clusters with...

Opcito Technologies

Web 3.0

22 May 2025

The Secret to Building a High-Growt...

Fenix Venture

Application

21 May 2025

Accelerating Growth with a Diversif...

Getlatest

Application

21 May 2025

Serverless isn’t exactly serverless...

Opcito Technologies

Cloud Computing

19 May 2025

How CSC E-Governance Solutions Brid...

Pranav

Analytics

15 May 2025

Mental Health Tech: Developing Apps...

Larisa Albanians

Application

13 May 2025

Enhancing API Security with AI/ML: ...

NuSummit

Cyber Security ..

13 May 2025

Automation trends in software testi...

Opcito Technologies

123

DevOps

12 May 2025

Feign Client: Powerful and clean Sp...

Opcito Technologies

Application

12 May 2025

Top Reasons Why Organizations Are M...

Chirag Akbari

Cloud Computing

08 May 2025

Modernizing Legacy Banking Systems ...

Inspirisys Solutions..

1312

Fintech

07 May 2025

Monitoring Kubernetes clusters with Prometheus and Grafana

Opcito Techno..

@Opcito Technologies

22 May 2025

Web 3.0 Application

Monitoring Kubernetes clusters is different from monitoring client-server networks because of the master-node architecture. Some of the parameters that need consistent monitoring include pod resources, memory usage, CPU utilization, network…

The Secret to Building a High-Growth Company

Fenix Venture

@FenixVenture

21 May 2025

Application

Behind every high-growth company lies more than just a great product—it’s a carefully crafted combination of vision, execution, and adaptability. While each company’s journey is unique, the underlying principles that drive accelerated growth remain…

Accelerating Growth with a Diversified IT Services Model

Getlatest

@Getlatest

21 May 2025

Application

In today’s rapidly changing technology landscape, businesses must adapt quickly to meet evolving customer needs and stay ahead of competitors. One effective way to do this is by adopting a diversified IT services model. This approach allows…

How CSC E-Governance Solutions Bridge India’s Digital Divide

Pranav

@pranav1

15 May 2025

Analytics Application

Introduction India, home to over 900 million people in rural areas, continues to face a persistent urban-rural divide in digital access. While urban India rapidly embraces digital transformation, rural populations often lack basic access to…

Mental Health Tech: Developing Apps for Accessible Behavioral Health Solutions

Larisa Albani..

@larisaalbanians

13 May 2025

Application Mobile & Web Development

Introduction Mental health care is a critical yet underserved area of healthcare, with nearly 1 in 5 adults globally experiencing a mental health condition each year. Stigma, cost, and limited access to therapists often prevent people from seeking…

Enhancing API Security with AI/ML: A Critical Imperative

NuSummit

@nusummit

13 May 2025

Cyber Security & Privacy Digital Transformation

Imagine this: A major retailer suffers a massive data breach. Customer credit card information, social security numbers, and purchase histories – all exposed. The culprit? It is not a sophisticated phishing attack but a vulnerability in a seemingly…

New

Get clean object-based responses from LLMs with LangChain

Opcito

What is `with_structured_output` ?

Without `with_structured_output` :

With `with_structured_output` :

Why its better than JSON response?

Here’s why LangChain's `with_structured_output` is a game-changer:

How does `with_structured_output` work?

Example: Workout assistant

Step 1 - Define the Schema

Step 2 - Setup `with_structured_output`

Step 3 - Run chain

Streamline your workflows with confidence

Opcito

Monitoring Kubernetes clusters with Prometheus and Grafana

Opcito Techno..

The Secret to Building a High-Growth Company

Fenix Venture

Accelerating Growth with a Diversified IT Services Model

Getlatest

How CSC E-Governance Solutions Bridge India’s Digital Divide

Pranav

Mental Health Tech: Developing Apps for Accessible Behavioral Health Solutions

Larisa Albani..

Enhancing API Security with AI/ML: A Critical Imperative

NuSummit

About Us

Knowledge Center

In the News

Topics In Demand

Notification

New

Get clean object-based responses from LLMs with LangChain

What is with_structured_output ?

Without with_structured_output :

With with_structured_output :

Why its better than JSON response?

Here’s why LangChain's with_structured_output is a game-changer:

How does with_structured_output work?

Example: Workout assistant

Step 1 - Define the Schema

Step 2 - Setup with_structured_output

Step 3 - Run chain

Streamline your workflows with confidence

Share this blog

Related blogs

Sumeet Jha

23 May 2025

Opcito Technologies

22 May 2025

Fenix Venture

21 May 2025

Getlatest

21 May 2025

Opcito Technologies

19 May 2025

Pranav

15 May 2025

Larisa Albanians

13 May 2025

NuSummit

13 May 2025

Opcito Technologies

12 May 2025

Opcito Technologies

12 May 2025

Chirag Akbari

08 May 2025

Inspirisys Solutions..

07 May 2025

About Us

Knowledge Center

In the News

Newsletter

What is `with_structured_output` ?

Without `with_structured_output` :

With `with_structured_output` :

Here’s why LangChain's `with_structured_output` is a game-changer:

How does `with_structured_output` work?

Step 2 - Setup `with_structured_output`