Australia has one of the best open address datasets in the world. The Geocoded National Address File (GNAF) contains 15.8 million addresses β every house, apartment, rural property, and indigenous community in the country β published by the federal government under a free commercial licence. The catch: it ships as a ~1.5 GB zip of pipe-delimited files across eight relational tables, one set per state. If you want to actually search it, you need to parse it, join it, index it, and build a search API on top of it. That's what I did. The result is DingoFind β a hosted Australian address autocomplete API with typo tolerance, reverse geocoding, and ABS boundary enrichment. This post covers the technical architecture: how the data pipeline works, how the search index is built, and how the API achieves sub-10ms p99 latency on a single ARM server. The Data Problem: GNAF's Schema GNAF is a normalised relational dataset.β¦