Welcome to the Bellroy Data Engineer Application Repo :)
Before you start with the questions, here are some things you need to note:
- Please create a private clone (don't fork) this repo.
- Please complete all parts of the application question, but don't spend more than 20-30m across all of them. (We’re looking for an idea of how you’d approach questions like this, how you think, and the sorts of things you’ll notice. We do want to respect your time, and we don’t want you to spend hours on these. We’re expecting the level of detail you can get out in that time, and won’t judge you harshly against applicants who obviously spent more time than that.)
- If you're not familiar with SQLite it might be worth reviewing ABOUT_SQLITE.md
Given the 4 tables in the sqlite3 db bellroy_question_1.sqlite3
(invoices
, invoice_lines
, products
and orders
), please write a SQL SELECT statement to prepare a dataset for a sales report returning columns month
, style_code
, color_name
, revenue
. (Out of the box, SQLite's date and time functions are not super complicated - month number (01 to 12) is just fine for this report.)
Assume the four rows of data you have for each table in bellroy_question_1.sqlite3
is only a small sample of the data your query will process; submit a SQL query that you expect to work on the whole dataset.
Comment freely on our schema and anything else you notice. (We mention this because we suspect that if you’re the candidate we’re looking for, you’re at least a little horrified by something in this database.)
- Expected answers:
- A SQL SELECT statement that returns the appropriate recordset. The statement will be run in sqlite3.
- Your comments & notes.