Menu

Post image 1
Post image 2
1 / 2
0

Iris: an offline visual assistant in Brazilian Portuguese powered by Gemma 4

DEV Community·Junior Martins·20 days ago
#4uVSBIb5
Reading 0:00
15s threshold

This is a submission for the Gemma 4 Challenge: Build with Gemma 4 . TL;DR Iris is an Android visual assistant for blind and low-vision users. It describes what the camera sees, out loud, in Brazilian Portuguese . Gemma 4 multimodal runs 100% on the phone via LiteRT-LM — no cloud, no telemetry, no INTERNET permission. Three intent-specific modes: Continuous, Question, Reading. What I Built I built Iris , an offline Android visual assistant for blind and low-vision users. The interaction is intentionally simple: point the phone camera at something, tap the screen, and Iris speaks what it sees in Brazilian Portuguese. Iris has three modes: Continuous Mode describes the scene in front of the user. Question Mode lets the user ask a spoken question about the current camera image. Reading Mode reads visible text aloud, such as a label, medicine box, sign, or package. A fourth control, Repeat , replays the last description — useful for users who could not catch it all the first time.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Read More