Always-on Ingestion for Data at Scale
Applications of the Apriori Algorithm on Open Data
Brainwashed: Building an IDE for Feature Engineering
Compiling DSLs for Diverse Execution Environments
Data Lake: Re-Birth of Enterprise Data Thinking
Data mining, forecasting, and BI at the RRCC
Data Science ≠ Big Data
Data Science at Whisper: From content quality to personalization
Event Driven Architecture for Web Analytics
Feature Engineering
HBase at Factual: Real time and Batch Uses
How to model anything in Redis
Ideal Platform for Managing Log Data: Search or SQL?
Keynote: Abhi Nemani
Keynote: Alan Gates
Keynote: Karen Lopez
Keynote: Michael Stack
Keynote: Reynold Xin
Large Scale Distinct Count: The HyperLogLog algorithm and its applications
Lessons Learned Designing Data Ingest Systems
Machine Learning on Largish Data
Managing Fast Data as the Front End to Hadoop
Mongoose v/s Waterline: Battle of the ORM
Scalable and High-Performance Analytics with Distributed R and Vertica
Spark after Dark
The AWS Big Data Platform
The Big Data Journey: How Big Data Practices Evolve at Connexity
Transforming into a data driven enterprise using existing skill sets
What's new and next in Apache Tez