What is YOLO26 The latest object detection model, released by Ultralytics in January 2026. Compared to the previous-generation YOLO11, CPU inference is up to 43% faster , greatly improving its practicality on edge devices. Its biggest feature is end-to-end inference with no NMS . The Non-Maximum Suppression (NMS) post-processing step that used to be mandatory in YOLO is gone — the model outputs the final detections directly. Model mAP CPU inference Params YOLO26n 40.9 38.9ms 2.5M YOLO26s 48.6 63.3ms 9.2M YOLO26m 53.1 155ms 18.7M Why run it on iPhone Real-time inference : 30+ FPS using the Neural Engine Privacy : data never leaves the device Offline : works without a network Low latency : no server round-trip, so results come back instantly Preparing the CoreML model Option 1: Download a converted model You can grab a converted model from the CoreML-Models repository.…