Tag archive: Data
Sep 5, 2024
Are You a Dalia? How We Created Data Science Personas for Spotify’s Analytics Platform
On Spotify’s Analytics Platform, we’re dedicated to building products that empower data practitioners to...
Aug 28, 2024
Unlocking Insights with High-Quality Dashboards at Scale
We have a lot of dashboards at Spotify. Our Insight teams and analysts from across the company are constantly...
May 28, 2024
Data Platform Explained Part II
Check out , where we started sharing the journey of building a data platform, its building blocks, and the...
May 15, 2024
Fixed-Power Designs: It’s Not IF You Peek, It’s WHAT You Peek at
TL;DR Sometimes we cannot estimate the required sample size needed to power an experiment before starting it....
Apr 2, 2024
Data Platform Explained Part I
As engineers working at Spotify, we frequently find ourselves explaining our robust data platform to fellow...
Jan 5, 2023
What’s a “Listening Personality”?
We did a couple of new things in Wrapped this year, and one of these is a thing called Your Listening...
Mar 14, 2022
Why We Switched Our Data Orchestration Service
TL;DR Within Spotify, we run 20,000 batch data pipelines defined in 1,000+ repositories, owned by 300+ teams...
Feb 16, 2022
Fred Wang: Senior Backend Engineer
As a Senior Backend Engineer at Spotify New York, Fred’s role on 2021 Wrapped involved serving data stories...
Dec 17, 2021
The Audio Aura Story: Mystical to Mathematical
TL;DR For 2021 Wrapped, we were challenged to visually express a user’s based on how they listened this...
Oct 20, 2021
Changing the Wheels on a Moving Bus — Spotify’s Event Delivery Migration
At Spotify, data rules all. We log a variety of data, from listening history, to results of A/B testing, to...
Mar 1, 2021
2020 Unwrapped: The people behind the numbers
2020 Wrapped is a story of gratitude and resilience. And we’re grateful for the people and teams behind the...
Feb 11, 2021
How Spotify Optimized the Largest Dataflow Job Ever for Wrapped 2020
In this post we’ll discuss how Spotify optimized and sped up elements from our largest Dataflow job, , for...
Sep 21, 2020
Spotify Unwrapped 2019: How We Built an In-App Experience Just for You
As we prepare to launch , we remind ourselves of the challenges we took on and lessons learned to make this...
Sep 3, 2020
Listening Together, Miles Apart
Every second more than 30,000 people around the world press play on the same song on Spotify. Imagine if you...
Jul 22, 2020
Leveraging Mobile Infrastructure with Data-Driven Decisions
TL;DR The pursuit wasn’t always easy, but “putting data first” has helped Spotify dramatically improve the...
May 28, 2020
Spotify Modernizes Client-Side Architecture to Accelerate Service on All Devices
Engineer Carl Engström explains how lifting the 10,000 limit on Liked Songs enabled his team to address a...
Apr 17, 2020
Ann Clifton: Senior Research Scientist
Ann is a Senior Research Scientist and has worked in our New York office for just over a year. She spoke to...
Apr 16, 2020
Introducing the Spotify Podcast Dataset and TREC Challenge 2020
Podcasts are exploding in popularity. Since 2015, we’ve added hundreds of thousands of shows, and users are...
Apr 14, 2020
When Should I Write an Architecture Decision Record
An Architecture Decision Record (ADR) is a document that captures a decision, including the context of how...
Feb 27, 2020
How We Improved Data Discovery for Data Scientists at Spotify
At Spotify, we believe strongly in data-informed decision making. Whether we’re considering a big shift in...
Feb 18, 2020
Spotify Unwrapped: How we brought you a decade of data
The is one of Spotify’s largest marketing and social campaigns of the year. It enables our users to see a...
Feb 6, 2020
Techbytes: Hans Dockter discusses Developer Productivity Engineering
Tune in to Hans Dockter, CEO & Founder of Gradle, as he explains the emerging practice of Developer...
Jan 18, 2020
Anna Smith: Staff Engineer
Anna started at our New York office three years ago and has recently been promoted to Staff Engineer. She and...
Dec 9, 2019
Views From The Cloud: A History of Spotify’s Journey to the Cloud, Part 1
Spotify’s Chief Architect, Niklas Gustavsson, was at the heart of the company’s journey to migrate its data...
Nov 12, 2019
Spotify’s Event Delivery – Life in the Cloud
Spotify is a data informed company and in such a company Event Delivery is a key component. Every event...
Oct 16, 2019
Techbytes: Handling Big Data at Spotify
Hear from Spotify’s Erin Palmer, as she illustrates the use of Scio with Big Data, discusses why it makes...
May 30, 2019
Scio 0.7: a deep dive
Large-scale data processing is a critical component of Spotify’s business model. It drives music...
Nov 15, 2018
Introducing Chartify: Easier chart creation in Python for data scientists
Have you ever been frustrated with the complicated experience of making charts in Python? We have, so we...
Sep 18, 2018
Scalable User Privacy
At Spotify, we have a complex and diverse data processing ecosystem. Our backend infrastructure handles...
Sep 4, 2018
Introducing cstar: The Spotify Cassandra orchestration tool, now open source
Today, we announce that we are open sourcing , our Cassandra orchestration tool.