Seeing AI is a free, intelligent camera app developed by Microsoft that harnesses the power of artificial intelligence to narrate the visual world for people who are blind or have low vision.
Launched as an ongoing research project, the app effectively turns the smartphone camera into a set of ‘talking eyes,’ providing crucial information about the user’s surroundings and enabling greater independence in daily life. It operates through various specialized ‘channels,’ each dedicated to a specific task, leveraging different AI models, some running instantly on the device and others using the power of the Microsoft cloud.
The app’s diverse functionalities are designed to tackle a wide range of everyday challenges. The Short Text channel instantly reads text as soon as the camera points to it, ideal for quick reads like signs, menus, or labels. The Document channel provides audio cues to help users correctly capture a printed page (like a letter or book), then reads the content aloud, even preserving the original formatting. Newer features also allow users to “chat” with their documents, asking questions about the content to quickly find specific information.
For navigating purchases, the Product channel scans barcodes, guiding the user with beeps to correctly position the camera and then speaking the product name and package information. Beyond text and products, Seeing AI includes channels for People, which recognize and name saved faces, and describe the age, gender, and estimated expression of others nearby.
It can also recognize Currency notes, identify Colors, and use an audible tone to measure the surrounding Light intensity. The Scene channel offers a detailed, rich description of the environment or a photograph, with the ability to explore the image by touching the screen to hear the location of different objects. By consolidating these powerful features into a single, intuitive application, Seeing AI has significantly enhanced accessibility, moving beyond basic text-to-speech to provide a truly comprehensive visual assistant.
The impact of Seeing AI is profound, as it empowers users to accomplish tasks that were previously difficult or impossible without assistance, from privately reading mail to identifying ingredients while cooking. Its availability on both iOS and Android devices, along with support for numerous languages, has extended its reach globally. By continuously evolving based on community feedback and advancements in AI, Seeing AI stands as a prime example of how thoughtful technological innovation can promote inclusivity and unlock a new level of self-sufficiency for the visually impaired community
