Topics In Demand
Notification
New

No notification found.

Machine Learning Models for Demand Forecasting
Machine Learning Models for Demand Forecasting

550

0

Artificial Intelligence and Machine Learning (AI-ML) are gaining popularity in wider areas – Advanced Driver Assistance Systems (ADAS), Computer Vision, Speech Recognition, Robotics, Fintech, Medical Applications, Supply Chain, Logistics, and many other areas. Every industry is eager to adopt AI-ML in at least one of their practices and automate their systems. Even Supply Chain practices are rapidly transforming and digitalizing by applying AI-ML technologies.

AI-ML concepts can be applied to various components of the supply chain – demand forecast, logistics &transportation, inventory management, production planning, and procurement.

In this article, we will discuss one of the areas of the supply chain, which is an elephant in the room.  However, it can be easily addressed by Machine Learning. It is an essential component of the supply chain; however, many industries are still struggling with it or using rudimentary methods. If performed accurately, many issues which planners face in the later stages can be tackled effectively. This value is used for production planning, inventory management, procurement, and eventually logistics and transportation. It is Demand Forecasting.

Demand Forecasting  

Demand forecasting is the process of using predictive analysis of historical data to estimate and predict customers' future demand for a product or service. Statistical methods of performing forecasting are Simple Moving Average, Holt’s Winter Method, Croston Method, and Syntetos-Boylan Approximation (SBA). Croston and SBA are majorly used for intermittent forecasting.

The Machine Learning approach to forecasting involves –

  1. Time Series Analysis
  2. Regression Modelling
  3. Deep Learning Modelling

 

In this blog, we will focus on Machine Learning Approach for Demand Forecasting. What all data pre-processing steps can be employed? Which models can be built under each approach, their benefits and limitations? How to perform multivariate forecasting under these approaches? What KPI metrices can be used? And how can the model check and autocorrect its output?

Data Pre-processing  

It is an essential approach in any ML project. However, it gets tricky when we are dealing with Time Series Data. We need to pre-process the data ensuring that trend and seasonality are not getting impacted. Following pre-processing can be performed –

  1. Anomaly Detection & Correction It is the identification of rare events, items, or observations that are suspicious because they differ significantly from standard behaviors or patterns. Anomaly in Time-Series data can be tricky as trend and seasonality must be considered while finding anomalies.
  2. Breakpoint Analysis – Breakpoint analysis is a way of looking at demand data to determine when there are shifts or breaks in demand levels. This can eliminate previous historical data which are no more significant.
  3. Seasonality and Trend Calculation – Seasonality and trend can be calculated using seasonal_decompose. This gives an idea about the overall trend and any seasonality.

Time Series Analysis  

ARIMA is a way to perform Time Series Analysis. There are few variants of ARIMA –

  1. ARIMAX – Standard ARIMA with an additional feature to train using exogenous factors.
  2. SARIMA – Seasonal ARIMA
  3. SARIMAX – Seasonal ARIMA with exogenous factors
  4. Auto ARIMA – Automatically tries to find best fitted ARIMA considering seasonality and exogenous factors

Pros  

  1. Accuracy is very high
  2. Easy to implement
  3. Can generate a model with few data points. In a case, a minimum of 12 data points were used to generate this model

Cons  

  1. In a few cases, it fails to identify trends or seasonality
  2. In some cases, it identifies the incorrect trends and extrapolates the data exponentially

Regression Modelling   

Regression models like Random Forest & XGBoost can also be used to forecast demand for the future. In our case, XGBoost has outperformed Random Forest. 24 data points are used as input and the 25th data points as prediction; the entire data set must be broken down into sets of 24+1 values. A minimum number of inputs in the training set should be 24 (if it is monthly data) to identify seasonality.

Pros  

  1. High accuracy
  2. It can automatically identify seasonality and trends

Cons  

  1. Needs more data points at least 30 (in this case) to perform this regression.

Deep Learning Modelling   

ANN and LSTM models can be used for forecasting. LSTM gives better results as compared to ANN. In LSTM, the training set should have 24 inputs to predict next month’s output.  

Pros   

  1. Its accuracy is better than XGBoost Regression

Cons   

  1. It needs several data points

Causal Factor Integration   

We have added causal factors which have a strong correlation with demand data. These causal factors are – Holiday, Discounts Data, Promotions, and Price Change Points.

Auto-Correction Algorithm   

Once the final forecast is generated, the algorithm performs a sanity check of the output. It calculates trends and standard deviation of the output to check if the algorithm is fit properly. If there is any issue in the output, the model calculates the forecast using Statistical Method – Holt’s Winter Algorithm.

Conclusion: We have used MAPE (Mean Absolute Percentage Error) as our KPI. We are adopting these principles can achieve accuracy up to 85% and for products, it is above 90%.

--

About the Author

Author image
Shrestha Priya
Lead Consultant, SCM, ITC Infotech

 Shrestha is a Lead Consultant in Machine Learning and Data Science with 6+ years of technical experience across industries of Semiconductors, ADAS, and Digital Transformation in Logistics and  Supply Chain. He has worked intensively to develop and deploy end-to-end AI solutions, provided solutions, and optimized key metrics using the Model and Data-Centric approach.


That the contents of third-party articles/blogs published here on the website, and the interpretation of all information in the article/blogs such as data, maps, numbers, opinions etc. displayed in the article/blogs and views or the opinions expressed within the content are solely of the author's; and do not reflect the opinions and beliefs of NASSCOM or its affiliates in any manner. NASSCOM does not take any liability w.r.t. content in any manner and will not be liable in any manner whatsoever for any kind of liability arising out of any act, error or omission. The contents of third-party article/blogs published, are provided solely as convenience; and the presence of these articles/blogs should not, under any circumstances, be considered as an endorsement of the contents by NASSCOM in any manner; and if you chose to access these articles/blogs , you do so at your own risk.


ITC Infotech is a leading global technology services and solutions provider, led by Business and Technology Consulting. ITC Infotech provides business-friendly solutions to help clients succeed and be future-ready, by seamlessly bringing together digital expertise, strong industry specific alliances and the unique ability to leverage deep domain expertise from ITC Group businesses. The company provides technology solutions and services to enterprises across industries such as Banking & Financial Services, Healthcare, Manufacturing, Consumer Goods, Travel and Hospitality, through a combination of traditional and newer business models, as a long-term sustainable partner. ITC Infotech is a wholly owned subsidiary of ITC Ltd. ITC is one of India’s leading private sector companies and a diversified conglomerate with businesses spanning Consumer Goods, Hotels, Paperboards and Packaging, Agri Business and Information Technology. For more information, please visit: http://www.itcinfotech.com/

© Copyright nasscom. All Rights Reserved.