Building a cognitive assessment covering matrix reasoning, numerical reasoning, spatial reasoning, and working memory. Questions are AI generated using structured prompt templates. Looking for guidance on: Has anyone built a reliable question bank with LLM generation? What prompt approaches worked and what were the common failure modes? For spatial reasoning β cutting operations, multi-step transformations, cross sections β is there a library that handles boolean solid geometry and face counting computationally? GeoGebra doesn't scale. Any psychometric item pools accessible to independent developers at non-enterprise pricing? Minimum defensible sample size for pre-launch difficulty calibration? submitted by /u/IDontLikeYou7 [link] [comments]