Home
BDDLA '16
Schedule
Sessions
Speakers

Data Provenance Support in Spark

Hadoop / Spark / Kafka

Matteo Interlandi

PostDoc at UCLA

Debugging data processing logic in Data-Intensive Scalable Computing (DISC) systems is a difficult and time consuming effort. To aid this effort, we built Titian, a library that enables data provenance tracking data through transformations in Apache Spark.

Tickets

Terms of Service
Privacy Policy
Media Kit