Building a Full Stack AI Engine From Scratch: The Architecture Behind Cevahir AI For the last 16 months, I’ve been building an open-source AI infrastructure project called Cevahir AI. The original goal wasn’t simply creating another chatbot or wrapping existing APIs with a new interface. I wanted to explore something much deeper: What would it look like to build a modular AI engine architecture from the tokenizer layer all the way to reasoning orchestration? Most AI projects today focus on a single layer of the stack: inference APIs, RAG pipelines, agent wrappers, fine-tuning systems, or prompt engineering workflows. Very few projects attempt to unify tokenizer training, neural architectures, training orchestration, model lifecycle management, reasoning systems, and local inference pipelines under a single engineering structure. Cevahir AI was created to explore exactly that problem. The project is fully open source and designed as a modular AI infrastructure system capable of running locally and offline.…