In this example we will go through the steps required for easily create visualization reports using Amazon SageMaker Studio Notebooks on a scheduled base, by using the capabilities of Interactive Data Processing and Notebook Jobs.
The example dataset is based on the Electricity Forecasting dataset taken from Kaggle.
In this repo, we will show you:
- How perform distributed data processing by using PySpark in Studio Notebooks with Glue Interactive Session
- Create scheduled executions of the notebook by using SageMaker Scheduled Notebook Job for creating weekly feature and insight extractions directly by using the notebbok
- Visualize results and reports in a SageMaker Studio Notebook