How ‘Object Recognition’ Is Helping the Blind

3 min readMar 3, 2020

Woman uses Amazon Echo Show’s “Show and Tell” feature

Digital health technology has improved the way people manage their conditions. If you take prescription medication, you can use dose tracking apps. If you have heart disease, you can use wearables, like Fitbit, to monitor a healthy lifestyle. That’s great, but what about underserved populations like the blind or visually impaired?

Technologies with object recognition are proving quite useful. For instance, Amazon has a smart display called, “Echo Show,” which has a “Show and Tell” feature. The feature uses object recognition to identify household pantry items for blind or visually impaired people. It works like this:

1) A person holds up a common pantry item in front of the camera.
2) They say, “Alexa, what am I holding?”
3) Alexa tells them the name of the object.

It sounds simple enough, but there’s way more to it than that.

Object recognition works by identifying objects in images and videos. This is made possible through computer vision and machine learning — a subset of artificial intelligence. The computer vision technique allows a computer to literally “see” and understand content. The process by which this happens involves machine learning.

With machine learning, a computer uses models and algorithms to learn for itself. It starts by recognizing features in objects (manual feature extraction) and then grouping them into a specific class.

As you can see, the computer is able to recognize different versions of one thing and categorize it correctly. Based on what it has learned from data, it can recognize patterns, and ultimately make decisions on its own. Here’s another example:

The two different types of cats are classified as “cat” and stored into an algorithm. The more cats the computer sees, the better it becomes at recognizing one.

Deep Learning vs. Machine Learning

Deep learning, a subset of machine learning, does this on a massive scale. It’s similar to machine learning but provides far greater accuracy. That’s because deep learning can involve showing a computer thousands (or millions) of images of cats and dogs until the system is able to automatically distinguish the two. Some deep learning models, such as convolutional neural networks (CNNs) make this possible by mimicking the human brain’s neural networks.

It can take enormous quantities of data to achieve deep learning. So, if you don’t possess a large quantity of training images, it may be best to use machine learning.

Just remember, no matter how you get there, if your goal is to achieve object recognition, you may be doing more than creating a cool feature. You could also be helping someone see.

References:

https://www.mathworks.com/solutions/image-video-processing/object-recognition.html

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Written by Vanessa Martinez

43 Followers

29 Following

Freelance Software Engineer. Experience in JavaScript, React.js, and Ruby on Rails.

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

More from Vanessa Martinez

Using CSS Animations to Visualize Breathing Techniques

Vanessa Martinez

Using CSS Animations to Visualize Breathing Techniques

One of my favorite things to do with CSS is create animations. It’s a fun way to manipulate shapes, create transitions, and bring your app…

Sep 15, 2020

Using Async/Await When Chaining Fetch Requests in JavaScript

Vanessa Martinez

Using Async/Await When Chaining Fetch Requests in JavaScript

For my latest project, I wanted to create the ability to favorite and unfavorite a breathing technique. When I first decided to tackle…

Dec 14, 2020

How to Create an Event in the FullCalendar Library

Vanessa Martinez

How to Create an Event in the FullCalendar Library

If you’re looking to add a calendar to one of your vanilla JavaScript or React apps, FullCalendar is a great choice for accomplishing…

Nov 30, 2020

How to Add a Fly-Out Nav to a Dropdown Menu

Vanessa Martinez

How to Add a Fly-Out Nav to a Dropdown Menu

If you’ve recently started learning Javascript, you may have discovered how to create a dropdown menu using both Javascript and HTML…

May 29, 2020

See all from Vanessa Martinez

Recommended from Medium

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jessica Stillman

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Oct 30, 2024

732

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Level Up Coding

Jacob Bennett

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Tools I use that are cheaper than Netflix

Jan 7

260

Lists

Staff picks

826 stories1649 saves

Stories to Help You Level-Up at Work

19 stories948 saves

Self-Improvement 101

20 stories3355 saves

Productivity 101

20 stories2818 saves

How I Am Using a Lifetime 100% Free Server

Harendra

How I Am Using a Lifetime 100% Free Server

Get a server with 24 GB RAM + 4 CPU + 200 GB Storage + Always Free

Oct 26, 2024

170

I used OpenAI’s o1 model to develop a trading strategy. It is DESTROYING the market

DataDrivenInvestor

Austin Starks

I used OpenAI’s o1 model to develop a trading strategy. It is DESTROYING the market

It literally took one try. I was shocked.

Sep 15, 2024

242

Predict

Will Lockett

This Is How Tesla Will Die

The vultures are circling the tech giant.

5d ago

135

I Wrote On LinkedIn for 100 Days. Now I Never Worry About Finding a Job.

Alexander Nguyen

I Wrote On LinkedIn for 100 Days. Now I Never Worry About Finding a Job.

Everyone is hiring.

Sep 21, 2024

973

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams