Topics In Demand
Notification
New

No notification found.

Computer Vision – How does it work and how enterprises can leverage this tech?
Computer Vision – How does it work and how enterprises can leverage this tech?

July 19, 2021

AI

151

0

“Beauty lies in the eyes of the beholder” – This holds true for human beings, but does it hold true for Computers as well? Let us find out.

Human eyes capture the image and the human brain analyses to extract some meaning out of it.

But have we ever thought as to how the brain analyses and understands the images? What algorithms run in our brain which can correctly distinguish between an Apple or Orange? Or how can it correctly identify a human being and give it a name?

The brain is one of the most complex organs of the human body. It has inspired scientists to explore the ideas of brain functioning through technology and see things that are beyond human capabilities. In a way, this is making humans superhuman.

What is the Brain made of?

A complex network of 100 billion neuron cells makes the human brain. Each neuron cell has many inputs called dendrites and 1 output called Axon.

Brain Diagram

Each neuron acts as a logical gate to filter and outputs a relevant signal which then passes to many other neurons creating a complex network.

So how does it relate to Computer Vision?

Computer Vision deals with Images. Before understanding what Computer Vision does, let us understand Images and the structure of images.

Each image is a matrix of pixels (intensity of images) with RGB channels (for Colour images). In short, an image is a matrix of numerical values (intensity).

Matrix of Pixels

So just by changing the values of pixels, you can change/transform the image using specified image masks:

Image transformation

How does this transformation help in Computer Vision?

In real life, Images come with a lot of noise, depending on the type of image capture (X-ray, Digicam, Ultrasound, CT Scan, etc.), there can be different types of noise that hides the important details in the image (E.g.: There may be an Image of minute tumour present in Xray, which goes undetected through the human eye), on transforming the image through some filters, tumour can be detected. Mapping of other Vision areas to detect edges, corners of the object using different image filters.

Techniques mentioned above, come under Classical Image Processing Techniques, which still hold true along with modern Image processing techniques using AI/Deep Learning.

What is AI/ Deep Learning?

Artificial Intelligence is a big umbrella under which comes Classical Machine Learning and Deep Learning (Neural Networks) and various other areas like Data Science. The basic working of AI is to develop the algorithm through Data. This means for different data, there will be different algorithms whereas in traditional programming, we develop the algorithm based on fixed rules.

Deep Learning work with billions/trillions of data points (which is in the case of Images as each image has huge number of pixels and to create smart algorithms, we require huge number of images).


Deep Learning contains various Neural network architectures inspired by biological neurons. These architectures, and a huge number of Image Dataset, learn pattern/features from the images and are fine-tuned to create an algorithm that further detects patterns on unknown images. (E.g.: face pattern for Face Recognition, object pattern for Object detection and many others).

How Computer Vision can be utilized by Enterprises?

The use of Computer Vision promises huge opportunities in almost every domain like Agriculture, Cyber Security, Transportation, Defence, Chip Manufacturing, Life Sciences, and Healthcare, etc.

In Agriculture, Vision-based algorithms detect the quality of farm produce, timely monitoring of ripening of fruits/vegetables, detection of leaf diseases, and then suggesting the right amount of nutrients.

Cyber Security, detects mouse movement patterns and any fraudulent user and then flagging them.

In Chip manufacturing, detection of components in PCB and minute level of lamination defects can easily be detected using trained Vision models.

In Healthcare, early-stage detection of tumors can help prevent cancer or another life-threatening disease.

There are numerous other domains where computer vision can be used, and the opportunities are infinite. The simplest one we see is Face Recognition in our mobile phones, irrespective of your beard or changes in your face muscles, every time it recognizes your face correctly.

Computer vision algorithms today are easily identifying the hidden Beauties of nature that are not visible by the human eye. Thus, the statement “Beauty lies in the eyes of the beholder” stands true for technology as well.


That the contents of third-party articles/blogs published here on the website, and the interpretation of all information in the article/blogs such as data, maps, numbers, opinions etc. displayed in the article/blogs and views or the opinions expressed within the content are solely of the author's; and do not reflect the opinions and beliefs of NASSCOM or its affiliates in any manner. NASSCOM does not take any liability w.r.t. content in any manner and will not be liable in any manner whatsoever for any kind of liability arising out of any act, error or omission. The contents of third-party article/blogs published, are provided solely as convenience; and the presence of these articles/blogs should not, under any circumstances, be considered as an endorsement of the contents by NASSCOM in any manner; and if you chose to access these articles/blogs , you do so at your own risk.


IVL is a global IT Service and Software solutions company Headquartered in Pune, India. We provide a wide range of software development services to our clients. These include Digital Transformation, Enterprise App Services, Outsourced Product Development, Digital Marketing, UX, and Validation Services. Our goal is to solve various business problems faced by industries and provide them optimum and effective technology solutions.

© Copyright nasscom. All Rights Reserved.