Stream processing with R and Amazon Kinesis

Data Science

This talk presents an original R client library to interact with Amazon Kinesis via a simple daemon to start multiple R sessions on a machine or cluster of nodes to process data from theoretically any number of shards, and will also feature some demo micro-applications streaming dummy credit cards transactions, enriching this data and then triggering other data consumers for various business needs, such as scoring transactions, updating real-time dashboards and messaging customers. Besides the technical details, the motives behind choosing R and Kinesis will be also covered, including a quick overview on the related data infrastructure changes at CARD.