In 2024, 72% of enterprises deploying LLMs rely on fine-tuned open-source models to cut inference costs by 60% compared to API-only workflows, yet 58% of engineering teams struggle to implement reproducible fine-tuning pipelines with modern tooling. 📡 Hacker News Top Stories Right Now AISLE Discovers 38 CVEs in OpenEMR Healthcare Software (118 points) Localsend: An open-source cross-platform alternative to AirDrop (550 points) BookStack Moves from GitHub to Codeberg (36 points) Microsoft VibeVoice: Open-Source Frontier Voice AI (235 points) Laguna XS.2 and M.1 (44 points) Key Insights PyTorch 2.5’s compiled fine-tuning mode reduces training time by 34% over PyTorch 2.4 for 7B parameter models on A100 GPUs Hugging Face Transformers 4.36+ natively supports PyTorch 2.5’s SDPA attention and gradient checkpointing v2 Fine-tuning a 7B Llama 3 model on 10k instruction pairs costs ~$12 on spot A100 instances vs $210 for GPT-4 Turbo fine-tuning By 2026, 80% of production LLM fine-tuning will use quantized adapters…