ai startup development

Profile-Guided Optimization for Android App Startup: Baseline Profiles, Cloud Profiles, and the Dex Layout Pipeline That Cut Our Cold Start From 1.2s to 380ms

Krystian Wiewiór · Jun 16, 2026 · 1 min read

TAGS: android, kotlin, mobile, architecture, jetpackcompose

KV cache quantization: Llama 3.2 3B in 2 GB on Android

Deep dive into KV cache memory management for on-device LLM inference on Android — covering per-layer INT4/INT8 mixed quantization of key-value caches, grouped-

Jun 16, 2026 · 6 min read

ai startup development

Profile-Guided Optimization for Android App Startup: Baseline Profiles, Cloud Profiles, and the Dex Layout Pipeline That Cut Our Cold Start From 1.2s to 380ms

Deep dive into how ART's ahead-of-time compilation interacts with Baseline Profiles and cloud-aggregated profiles, covering the DEX layout reordering pipeline,

Jun 16, 2026 · 1 min read

ai startup development

Apple Foundation Models SDK with Claude Code: Building Hybrid On-Device/Cloud AI Pipelines for iOS Apps in Swift

Deep dive into Apple's just-announced Foundation Models framework (from WWDC/the new SDK docs trending on HN today), showing how to architect a tiered inference

Jun 15, 2026 · 5 min read

Profile-Guided Optimization for Android App Startup: Baseline Profiles, Cloud Profiles, and the Dex Layout Pipeline That Cut Our Cold Start From 1.2s to 380ms

TAGS: android, kotlin, mobile, architecture, jetpackcompose

Related Posts

KV cache quantization: Llama 3.2 3B in 2 GB on Android

Profile-Guided Optimization for Android App Startup: Baseline Profiles, Cloud Profiles, and the Dex Layout Pipeline That Cut Our Cold Start From 1.2s to 380ms

Apple Foundation Models SDK with Claude Code: Building Hybrid On-Device/Cloud AI Pipelines for iOS Apps in Swift