What problem are we solving?

We want to be able to perform data analysis on Kubernetes scheduling and autoscaling behaviour.

What steps do we need to take to get there?

What questions do we need to answer?

Where do we get the data from?

How do we get the data out?

What format do we want to store the data in (while we’re doing the analysis? After we’re done?)