Artisanal Data

Use Case Driven

We have lots of data at Factual. But to solve some of our harder problems, we need to get down and dirty with our data -- to examine, evaluate, and experience it (sometimes even smell it). This talk attempts to re-brand this kind of work with the new, alternative buzzword, "Artisanal Data". I review the "Artisanal Data" technologies and techniques we use at Factual, including how we document experiments so that they get read, evaluate failure modes and judge successes, and keep our annotation data as accurate as possible. With the right statistical precautions, Artisanal Data can use be used to more effectively and emotionally communicate impact of our data.