Softonic
AI

Visual Intelligence: This is all we can do thanks to AI just by taking a photo

The information around us as input for artificial intelligence

Visual Intelligence: This is all we can do thanks to AI just by taking a photo
David Bernal Raspall

David Bernal Raspall

  • December 18, 2024
  • Updated: December 18, 2024 at 12:43 AM

With iOS 18.2 and the arrival of Visual Intelligence —in addition to a new app—, Apple has made a huge leap in how we use the camera on our iPhones. Exclusive to the iPhone 16 and iPhone 16 Pro, it allows us to turn any image into an entry to Apple Intelligence, ChatGPT, and more. One of the easiest ways to interact with our environment and artificial intelligence.

The key to this feature lies in the Camera Control. In addition to a shortcut for taking a picture or video, with a long press, this button takes us to Visual Intelligence, which analyzes the capture and offers us contextual actions and extra responses thanks to the integration of ChatGPT and Google Search.

Immediate information from an image

The process begins when capturing a photo after holding down the Camera Control on our iPhone for a second or two. From there, we access several options that greatly facilitate our most everyday tasks.

If we photograph a restaurant or a store, we can directly obtain their hours, make a reservation, call by phone, or view their menu in a matter of seconds. The same goes for text links: we capture an image of a sign, and Visual Intelligence allows us to go directly to the website that appears in the image. No need to type or copy manually, we just click the link and that’s it.

If we find a long block and want to extract the most important parts, Visual Intelligence offers us a quick summary function. Ideal for articles, reports, or any extensive document. Also, if the text is in another language, it offers us an instant translation —although currently only available in English—. There’s more. We can convert text to voice, which means Siri can read aloud what appears in the image.

From searching for objects to interacting with them

Beyond text and links, Visual Intelligence also understands the objects we capture. If we want to know what something we see is —a monument, a plant, or even a product—, the “Ask” option takes us directly to ChatGPT, which, after an initial introduction describing what we are seeing, is ready to answer all our questions.

If we prefer a more visual search, we can use the “Search” function to send the image directly to Google and find related results. Ideal when we see a product that interests us and we want to locate it in online stores or search for more information.

Visual Intelligence also offers us the possibility to detect email addresses, phone numbers, and addresses in any photo. It then allows us to interact with them immediately: draft an email, make a call, or open an address in Maps without the need to copy or search manually.

The same happens with dates and events. If we photograph something that contains a date, like a poster or an invitation, the system gives us the option to add an event directly to our calendar. All with a couple of taps. And of course, Visual Intelligence doesn’t leave out the basics: reading QR codes.

Apple Support Download

Visual Intelligence completely transforms what we can do with the camera on our iPhone. Thanks to its integration with Siri, ChatGPT —both GPT 4o and o1, if we have the subscription— and, of course, with the entire Apple Intelligence system, we can turn any photo into specific information or actions in a matter of seconds. From obtaining details about a place, to adding events to our calendar or translating texts, everything flows naturally.

Latest Articles

Loading next article