Skip to main content

HuggingSnap app serves Apple’s best AI tool, with a convenient twist

HuggingSnap recognizing contents on a table.
Nadeem Sarwar / DigitalTrends

Machine learning platform, Hugging Face, has released an iOS app that will make sense of the world around you as seen by your iPhone’s camera. Just point it at a scene, or click a picture, and it will deploy an AI to describe it, identify objects, perform translation, or pull text-based details.

Named HuggingSnap, the app takes a multi-model approach to understanding the scene around you as an input, and it’s now available for free on the App Store. It is powered by SmolVLM2, an open AI model that can handle text, image, and video as input formats.

Recommended Videos

The overarching goal of the app is to let people learn about the objects and scenery around them, including plant and animal recognition. The idea is not too different from Visual Intelligence on iPhones, but HuggingSnap has a crucial leg-up over its Apple rival.

It doesn’t require internet to work

SmolVLM2 running in an iPhone

All it needs is an iPhone running iOS 18 and you’re good to go. The UI of HuggingSnap is not too different from what you get with Visual Intelligence. But there’s a fundamental difference here.

Apple relies on ChatGPT for Visual Intelligence to work. That’s because Siri is currently not capable of acting like a generative AI tool, such as ChatGPT or Google’s Gemini, both of which have their own knowledge bank. Instead, it offloads all such user requests and queries to ChatGPT.

That requires an internet connection since ChatGPT can’t work in offline mode. HuggingSnap, on the other hand, works just fine. Moreover, an offline approach means no user data ever leaves your phone, which is always a welcome change from a privacy perspective. 

What can you do with HuggingSnap?

HuggingSnap identifying perfume bottle.
Nadeem Sarwar / DigitalTrends

HuggingSnap is powered by the SmolVLM2 model developed by Hugging Face. So, what can this model running the show behind this app accomplish? Well, a lot. Aside from answering questions based on what it sees through an iPhone’s camera, it can also process images picked from your phone’s gallery.

For example, show it a picture of any historical monument, and ask it to give you travel suggestions. It can understand the stuff appearing on a graph, or make sense of an electricity bill’s picture and answer queries based on the details it has picked up from the document.

It has a lightweight architecture and is particularly well-suited for on-device applications of AI. On benchmarks, it performs better than Google’s competing open PaliGemma (3B) model and rubs shoulders with Alibaba’s rival Qwen AI model with vision capabilities.

Running HuggingSnap app on iPhone.
Nadeem Sarwar / DigitalTrends

The biggest advantage is that it requires less system resources to run, which is particularly important in the context of smartphones. Interestingly, the popular VLC media player is also using the same SmolVLM2 model to provide video descriptions, letting users search through a video using natural language prompts.

It can also intelligently extract the most important highlight moments from a video. “Designed for efficiency, SmolVLM can answer questions about images, describe visual content, create stories grounded on multiple images, or function as a pure language model without visual inputs,” says the app’s GitHub repository.

Nadeem Sarwar
Nadeem is a tech and science journalist who started reading about cool smartphone tech out of curiosity and soon started…
Cost-cutting strips Pixel 9a of the best Gemini AI features in Pixel 9
Person holds Pixel 9a in hand while sitting in a car.

The Pixel 9a has been officially revealed, and while it's an eye candy, there are some visible cutbacks over the more premium Pixel 9 and 9 Pro series phones. The other cutbacks we don't see include lower RAM than the Pixel 9 phones, which can limit the new mid-ranger's ability to run AI applications, despite running the same Tensor G4 chipset.

Google's decision to limit the RAM to 8GB, compared to the 12GB on the more premium Pixel 9 phones, sacrifices its ability to run certain AI tasks locally. ArsTechnica has reported that as a result of the cost-cutting, Pixel 9a runs an "extra extra small" or XXS variant -- instead of the "extra small" variant on Pixel 9 -- of the Gemini Nano 1.0 model that drives on-device AI functions.

Read more
Does the Google Pixel 9a come with a charger? Here’s what’s in the box
A woman holding a purple Google Pixel 9a.

After much speculation in recent months, the Google Pixel 9a has finally been announced. Google's Pixel A series is an excellent choice for those seeking a reliable Android smartphone at a lower price point, and the latest model follows this trend. While it is undeniably part of the Google Pixel 9 series, it has fewer features than its higher-end counterparts.

One question you might have when considering the Pixel 9a is whether it comes with a charger. We’ve got the answer
The Pixel 9a does not come with a charger.
The short answer is that the Pixel 9a does not have a charger. This has become common practice for most smartphones today, including other models in the Pixel 9 series, like the Pixel 9 Pro. While this may be disappointing, it's not surprising.

Read more
Google Pixel 9a vs. Pixel 8a: should you upgrade?
Google Pixel 9a vs Pixel 8a.

Google has released a new budget phone, the Pixel 9a. How does it compare to its predecessor, the Pixel 8a? We've got the answers, and the changes are significant in some ways. In others, not so much. If you have a Pixel 8a and are considering upgrading, read this first.
Google Pixel 9a: vs. Google Pixel 8a: specs

Google Pixel 9a
Google Pixel 8a

Read more