Related Posts
ai startup development
PostgreSQL Advisory Locks for Distributed Job Scheduling: Replacing Redis and SQS with Native Database Primitives That Scale to 10K Jobs/Minute
Deep dive into pg_try_advisory_xact_lock for leader election and job claiming in multi-instance deployments, covering lock granularity strategies (transactional
· 1 min read
ai startup development
Redis Streams: the event bus that delays Kafka by 2 years
Deep dive into Redis Streams' consumer group semantics (XREADGROUP, XACK, pending entry list), implementing exactly-once processing with idempotency keys in Kto
· 6 min read
ai startup development
KV cache quantization: Llama 3.2 3B in 2 GB on Android
Deep dive into KV cache memory management for on-device LLM inference on Android — covering per-layer INT4/INT8 mixed quantization of key-value caches, grouped-
· 6 min read