Justin Miller
Senior Software Engineer at LanceDB
Justin lives in Los Angeles with his wife Megan and their dog Eddie. In his free time, he enjoys photography and learning about machine learning.
Since February, Justin has joined LanceDB and is working on their feature engineering system Geneva.
As a Principal Platform Engineer at ZEFR, Justin introduced tools like Ray, developed data pipelines integrating NLP and CV embeddings with Qdrant and Snowflake, and implemented cost-saving measures that reduced expenses by reducing resource utilization. He also modernized infrastructure, transitioning services to Kubernetes and streamlining deployments using GitHub Actions and ArgoCD.
With prior roles at GoSpotCheck, ProtectWise, and eHarmony, Justin has extensive experience building scalable systems with Scala, Java, and Python. His projects include Kafka stream processors, Spark and Snowflake data warehouses, and media retrieval/storage services. He has extensive experience mentoring engineers across all levels to strengthen team capabilities.
Talks
Data Con LA 2026
Embeddings at Scale: Lessons from LanceDB and the Lance Format
Storing hundreds of millions of multimodal embeddings exposes the limits of traditional vector stores. How the columnar Lance format handles it, with benchmarks and production pitfalls.