You open the app, fill fields, select options, and submit. It works but it’s friction. I wanted something simpler. What if a user could just say: _** “Netflix ₹499 monthly” ** …and the system handles everything?_ The Core Idea Instead of forcing users to adapt to the system… Make the system adapt to the user. The pipeline looks like this: Each step reduces ambiguity and moves toward structured data. Step 1 — Handling Voice & Text Input The system doesn’t just rely on one type of input. Users can either: Speak (“Netflix ₹499 monthly”) Type a quick message (just like a notification or note) So the first step is to normalize everything into plain text . If the input is voice, we convert it using a speech-to-text service. If it’s already text, we process it directly. The goal is simple: everything becomes text before any processing begins.…