Skip to main content

Facebook uses AI to help the blind ‘see’ images

Facebook is ready to help blind people ‘see’ images via artificial intelligence. The new feature, called automatic alternate text, works with existing screen reader apps used by blind and visually impaired people. The AI-generated descriptions identify objects and scenes but there is no facial recognition –although we can imagine it’s on the way. So you if you share an image with A visually impaired friend it won’t tell him or her who is in the picture or what everyone is wearing, but it might read “Image may contain: three people, smiling, birthday cake.”

There are more than 246 million people around the world with severe visual impairments and 39 million who are blind, according to Facebook. More than 2 billion photos are shared daily on Facebook, Instagram, Messenger, and WhatsApp. Automatic alternate text can help social media be more inclusive.

Recommended Videos

Facebook’s automatic alternate text introduction follows Twitter’s announcement last week of a 420 image character description field also intended for visually impaired people who use screen readers with mobile devices. With the Twitter app the person who composes the Tweet also writes the description. The Facebook feature automatically attempts to describe the image, with the disclaimer “Image may contain.” Of course, a Facebook post creator has plenty of space to describe images already, while Twitter limits regular text to just 140 characters. In each case blind and visually impaired people get less of a raw deal.

Facebook’s automatic alternate text feature is available now for people who use iOS devices in English in the U.S., U.K., Canada, Australia, and New Zealand. The company plans to support more platforms, languages, and market in the near future.

The accuracy of Facebook’s automatic alternate text feature matters and will likely improve over time.

We imagine that facial recognition is already on the planning board. On the other hand, perhaps it’s better not to attempt to identify people in photos Until the tech is flawless. Imagine if a Facebook screen reader misidentified and called out the wrong names. In some circumstances that could be pretty embarrassing.

Bruce Brown
Bruce Brown Contributing Editor   As a Contributing Editor to the Auto teams at Digital Trends and TheManual.com, Bruce…
ChatGPT o1 vs. o3 vs. 4o: Which should you use?
ChatGPT on a laptop

We're now into the third year of the AI boom, and industry leaders are showing no signs of slowing down, pushing out newer and (presumably) more capable models on a regular basis. ChatGPT, of course, remains the undisputed leader.

But with more than a half-dozen models available from OpenAI alone, figuring out which one to use for your specific project can be a daunting task.
o1

Read more
ChatGPT vs. Perplexity: battle of the AI search engines
Perplexity on Nothing Phone 2a.

The days of Google's undisputed internet search dominance may be coming to an end. The rise of generative AI has ushered in a new means of finding information on the web, with ChatGPT and Perplexity AI leading the way.

Unlike traditional Google searches, these platforms scour the internet for information regarding your query, then synthesize an answer using a conversational tone rather than returning a list of websites where the information can be found. This approach has proven popular with users, even though it's raised some serious concerns with the content creators that these platforms scrape for their data. But which is best for you to actually use? Let's dig into how these two AI tools differ, and which will be the most helpful for your prompts.
Pricing and tiers
Perplexity is available at two price points: free and Pro. The free tier is available to everybody and offers unlimited "Quick" searches, 3 "Pro" searches per day, and access to the standard Perplexity AI model. The Pro plan, which costs $20/month, grants you unlimited Quick searches, 300 Pro searches per day, your choice of AI model (GPT-4o, Claude-3, or LLama 3.1), the ability to upload and analyze unlimited files as well as visualize answers using Playground AI, DALL-E, and SDXL.

Read more
​​OpenAI spills tea on Musk as Meta seeks block on for-profit dreams
A digital image of Elon Musk in front of a stylized background with the Twitter logo repeating.

OpenAI has been on a “Shipmas” product launch spree, launching its highly-awaited Sora video generator and onboarding millions of Apple ecosystem members with the Siri-ChatGPT integration. The company has also expanded its subscription portfolio as it races toward a for-profit status, which is reportedly a hot topic of debate internally.

Not everyone is happy with the AI behemoth abandoning its nonprofit roots, including one of its founding fathers and now rival, Elon Musk. The xAI chief filed a lawsuit against OpenAI earlier this year and has also been consistently taking potshots at the company.

Read more