Tag archive: Apache Crunch

sticky
Jan 9, 2015

Personalization at Spotify using Cassandra

At Spotify we have have over 60 million active users who have […]
sticky
Dec 19, 2014

Solving MapReduce Performance Problems With Sharded Joins

Sometimes the answer to a sluggish data pipeline isn’t more […]
sticky
Nov 27, 2014

Data Processing with Apache Crunch at Spotify

All of our lovely Spotify users generate many terabytes of […]