Hardware Acceleration of Deep Neural Network Models on FPGA ( Part 1 of 2)

Terms of use

Terms of Use

The use of this site and the content contained therein is governed by the Terms of Use. When you use this site you acknowledge that you have read the Terms of Use and that you accept and will be bound by the terms hereof and such terms as may be modified from time to time.

All text, graphics, audio, design and other works on the site are the copyrighted works of nasscom unless otherwise indicated. All rights reserved.
Content on the site is for personal use only and may be downloaded provided the material is kept intact and there is no violation of the copyrights, trademarks, and other proprietary rights. Any alteration of the material or use of the material contained in the site for any other purpose is a violation of the copyright of nasscom and / or its affiliates or associates or of its third-party information providers. This material cannot be copied, reproduced, republished, uploaded, posted, transmitted or distributed in any way for non-personal use without obtaining the prior permission from nasscom.
The nasscom Members login is for the reference of only registered nasscom Member Companies.
nasscom reserves the right to modify the terms of use of any service without any liability. nasscom reserves the right to take all measures necessary to prevent access to any service or termination of service if the terms of use are not complied with or are contravened or there is any violation of copyright, trademark or other proprietary right.
From time to time nasscom may supplement these terms of use with additional terms pertaining to specific content (additional terms). Such additional terms are hereby incorporated by reference into these Terms of Use.

Disclaimer

The Company information provided on the nasscom web site is as per data collected by companies. nasscom is not liable on the authenticity of such data.
nasscom has exercised due diligence in checking the correctness and authenticity of the information contained in the site, but nasscom or any of its affiliates or associates or employees shall not be in any way responsible for any loss or damage that may arise to any person from any inadvertent error in the information contained in this site. The information from or through this site is provided "as is" and all warranties express or implied of any kind, regarding any matter pertaining to any service or channel, including without limitation the implied warranties of merchantability, fitness for a particular purpose, and non-infringement are disclaimed. nasscom and its affiliates and associates shall not be liable, at any time, for any failure of performance, error, omission, interruption, deletion, defect, delay in operation or transmission, computer virus, communications line failure, theft or destruction or unauthorised access to, alteration of, or use of information contained on the site. No representations, warranties or guarantees whatsoever are made as to the accuracy, adequacy, reliability, completeness, suitability or applicability of the information to a particular situation.
nasscom or its affiliates or associates or its employees do not provide any judgments or warranty in respect of the authenticity or correctness of the content of other services or sites to which links are provided. A link to another service or site is not an endorsement of any products or services on such site or the site.
The content provided is for information purposes alone and does not substitute for specific advice whether investment, legal, taxation or otherwise. nasscom disclaims all liability for damages caused by use of content on the site.
All responsibility and liability for any damages caused by downloading of any data is disclaimed.
nasscom reserves the right to modify, suspend / cancel, or discontinue any or all sections, or service at any time without notice.

For any grievances under the Information Technology Act 2000, please get in touch with Grievance Officer, Mr. Anirban Mandal at data-query@nasscom.in.

New

See all

No notification found.

Hardware Acceleration of Deep Neural Network Models on FPGA ( Part 1 of 2)

Ignitarium

@Ignitarium

June 24, 2021

604

Artificial Intelligence has become all-pervasive, by finding applications in areas which seemed impossible earlier. Deep Learning, which is a subfield of Machine Learning, has become a state-of-the-art solution to all AI problems due to its high accuracy and efficiency. It helps in making real time decisions in applications like Advanced Driver Assistance Systems (ADAS), Robots, Autonomous Vehicles, Industrial Automation, Aerospace and Défense. For accurate decisions and real time behaviour, a massive amount of data needs to be processed. Deep Neural Network (DNN) models achieve this by using a large number of neural network layers.

Deep Neural Network

Want to know how Deep Learning works? Here's a quick guide for everyone.

Source: freecodecamp.org

Deep Neural Networks is the state-of-art solution for a variety of applications like computer vision, speech recognition and natural language processing etc. Artificial Neural Networks is a mathematical construct that tie together a large number of simple elements, called neurons, each of which can make simple mathematical decisions. A shallow neural network has only three layers: input layer, one hidden layer and output layer. A neural network becomes a Deep Neural Network (DNN) as the number of hidden layers increases. So, Deep Learning can be considered as a class of Artificial Neural Networks that is composed of many processing layers. They are more accurate and keep improving in accuracy as more neuron layers are added. Some important Deep Neural Network models are Feed-Forward Neural Network, Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN).

Hardware Accelerators for Deep Neural Networks

Hardware acceleration is defined as a process in which an application will offload a high computational task into specialised hardware for achieving high efficiency when compared to software implementation in CPU alone. To achieve accurate results in real-time, better models operating on a larger dataset are required. Also, time taken for decision making is an important factor. As new Deep Learning models evolve, the model structure becomes more complex. Thus, a huge number of operations and parameters, as well as more computing resources are needed. Three options for Hardware Accelerators are GPU’s, ASICs and FPGAs.

Source: https://ysu.edu/news/ysu-hosts-national-gpu-computing-workshop

GPUs are designed for processing images through massive parallelism, but nowadays they are used in big data analytics, acceleration of a portion of an application that requires high throughput and memory bandwidth. GPUs are excellent in parallel processing. They can provide acceleration where the same operations are required many times in rapid succession. But GPUs consume a huge amount of power which throws a challenge to DNN applications that need to be enabled on edge devices, especially battery-operated devices. GPUs achieve throughput with their ability to process input batches of large size, but typically the latency will be high. So, they are not suitable for latency-critical applications.

Source: https://www.eebinc.org/post/the-global-semiconductor-crunch

ASICs are integrated circuits specially designed for a particular purpose or application. They are highly optimized in terms of power and performance for one particular application. They have less I/O bandwidth, limited memory and other computing resources. Although they can attain moderate performance at low power, the downside is that the development time and costs to realize them are high.

What Is FPGA and FPGA Applications - Latest open tech from seeed studio

Source: https://www.renesas.com/

FPGAs can be used to accelerate a portion of an algorithm by assigning the high computational tasks to the programmable logic. They can attain high performance through extensive parallelism and at the same time, are energy efficient when compared to GPUs, and have less time to market and costs compared to ASICs. Another important feature of FPGAs is their; reconfigurability which is not possible with GPU and ASIC. As deep learning structures are advancing day by day, reconfigurability is an added advantage.

The following section lists out the reasons for considering FPGAs as hardware accelerators.

FPGAs as Hardware Accelerators:

When compared to GPU; ASIC and FPGA have less I/O bandwidth, limited memory and other computing resources but they can attain moderate performance at low power. ASIC is optimized for power and performance, but cost and development time is more. Also, they are not flexible. As an alternative to GPU and ASIC, FPGA based accelerators are currently used due to the following advantages:

FPGA offers high performance per watt when compared to GPU, making it a strong candidate for DNN computations and inference.
Architecture is customizable and flexible so that the required resources can be used.
Provide high throughput with massive parallelism at low latency.
FPGA has block RAM which allows faster data transfer compared to off-chip memory.
FPGAs are reconfigurable according to application. This enables a reduction in time to market. As the new machine learning algorithm evolves, less development time and reconfigurability make them a better option when compared to ASIC.
Apart from power efficiency and throughput, the speed of a DNN deployed on an FPGA can be further increased when the inferred algorithm uses low numeric precision in the calculation. For example, the quantization process converts a 32-bit or 64-bit floating-point network models to a fixed point which reduces computations by maintaining reasonable accuracy.

On the other hand, one of the main reasons for engineers not adopting FPGA is the difficulty in programming. FPGA is programmed by describing functionalities using Hardware Description Language (HDL) coding like VHDL or Verilog. This is different from regular programming like C or C++.

To reduce complexity, tools like High-Level Synthesis (HLS) that synthesize high level languages to HDL codes exist. There are different hardware frameworks developed by FPGA vendors and other third-party companies to implement inference on FPGA. Xilinx and Intel have their own frameworks to improve the performance over others. Some of the hardware frameworks are OpenCL, Intel’s OpenVino, Xilinx DNNDK and Xilinx Vitis AI which we will cover in part 2 of our blog.

Read Part 2 here…

This blog originally appeared on Ignitarium.com's Blog Page.

Keywords: #hardwareacceleration #deepneuralnetworks #DNN #FPGA #GPU #deeplearningframework #deeplearning

Disclaimer

That the contents of third-party articles/blogs published here on the website, and the interpretation of all information in the article/blogs such as data, maps, numbers, opinions etc. displayed in the article/blogs and views or the opinions expressed within the content are solely of the author's; and do not reflect the opinions and beliefs of NASSCOM or its affiliates in any manner. NASSCOM does not take any liability w.r.t. content in any manner and will not be liable in any manner whatsoever for any kind of liability arising out of any act, error or omission. The contents of third-party article/blogs published, are provided solely as convenience; and the presence of these articles/blogs should not, under any circumstances, be considered as an endorsement of the contents by NASSCOM in any manner; and if you chose to access these articles/blogs , you do so at your own risk.

Ignitarium

Decoding AI Studios: Making AI Accessible for Business

Janhvi Juyal

@juyal janhvi

26 Aug 2025

Data Science & AI Community Emerging Tech AI Industry Trends

In the last six to nine months, we’ve seen a proliferation of business-friendly AI studios, coming after the wave of developer-friendly AI platforms that emerged with GenAI. AI studios have started gaining traction by offering simplified, guided…

Intelligent Audit Models: Enabling AI-Ready, Digitally Resilient Data Centers

SPNX Consulti..

@SPNX

25 Aug 2025

Cyber Security & Privacy Data Privacy Threat Intelligence Digital Transformation AI IT Services

AI AS THE DEFINING FORCE OF GOVERNANCE Artificial intelligence is no longer confined to chatbots, automation scripts, or headline-grabbing innovations. A quieter, yet more profound revolution is underway in how data centers the invisible backbone…

Fine-Tuning in the Age of GPT-5: What's Changing?

Cyfuture.AI

@cyfutureai

14 Aug 2025

The AI landscape just witnessed its most significant inflection point since the launch of ChatGPT. OpenAI's GPT-5, released in August 2025, isn't just another incremental improvement—it's a paradigm shift that's fundamentally rewriting the rules of…

The Logical Evolution. Traditional AI -> Gen AI -> Agentic AI

jayantsethi74..

@jayantsethi7474

14 Aug 2025

The evolution from Traditional AI to Gen AI and now to Agentic AI marks a significant progression in Tech automation for organizations. The gradual adoption of these technologies is crucial, emphasizing the importance of a strong business case…

Agentic AI Is Here, And Looks Like It Will Stay

CSM Tech

@csmtechnologies

13 Aug 2025

Recent developments in artificial intelligence have shifted focus from generative AI to a more sophisticated paradigm known as "agentic AI." This emerging technological framework merges the adaptability of large language models (LLMs) with the…

What Exactly Are Multi-Modal AI Agents?

Sparkout Tech

@sparkouttechmarketing

13 Aug 2025

In the rapidly evolving landscape of artificial intelligence, a new and transformative technology is emerging: the multi-modal AI agent. While many of us are familiar with single-modal AI systems—like a chatbot that only understands text or a voice…

Topics In Demand

Notification

New

Hardware Acceleration of Deep Neural Network Models on FPGA ( Part 1 of 2)

Deep Neural Network

Hardware Accelerators for Deep Neural Networks

FPGAs as Hardware Accelerators:

Share this blog

Related blogs