Let’s take a look at Bindle’s data warehouse again. It contains:
- production database tables (users, purchases)
- mobile attribution data from Adjust
- web analytics events
- mobile analytics events
- marketing spends from Facebook
- marketing spends from Twitter
The last two live in the same table marketing_spends
but the data comes from two different marketing platforms (Facebook and Twitter).
Let’s think about how this data finds its way into the data warehouse. Does someone run INSERT
queries or import CSVs? That would be a LOT of CSV files every...