This folder contains hands-on labs introducing some of the essential services that Azure provides for handling big-data workloads and visualizing the results. Learn how to spin up Apache Spark clusters in the cloud using Azure HDInsight, use Apache Hadoop to extract information from large datasets, use Microsoft Power BI to explore and visualize data, and more.
Lab | Scenario | Technology/Language | Cost |
---|---|---|---|
Azure Data Lake | Import data from disparate sources into an Azure Data Lake Store and use Azure Data Lake Analytics to perform federated queries with U-SQL. | Azure Data Lake Azure SQL Database U-SQL |
$$ |
Hadoop on Azure HDInsight | Deploy an Hadoop cluster on Azure and use MapReduce to analyze a text file and Hive to analyze a log file. | Apache Hadoop Azure HDInsight Hive Python |
$$$ |
Spark on Azure HDInsight | Deploy a Spark cluster on Azure and use Jupyter notebooks to analyze food-inspection data from the city of Chicago, build a machine-learning model around it, and visualize the results. | Apache Spark Azure HDInsight Jupyter Python |
$$$ |
Microsoft Power BI | Use Microsoft Power BI to view sales data for a fictitious company and create reports and dashboards containing visualizations of that data. | Microsoft Power BI | $ |