Skip to content

logicalclocks/hopsworks-tutorials

Repository files navigation

👨🏻‍🏫 Hopsworks Tutorials

We are happy to welcome you to our collection of tutorials dedicated to exploring the fundamentals of Hopsworks and Machine Learning development. In addition to offering different types of use cases and common subjects in the field, it facilitates navigation and use of models in a production environment using Hopsworks Feature Store.

⚙️ How to run the tutorials:

For the tutorials to work, you will need a Hopsworks account. To do so, go to app.hopsworks.ai and create one. With a managed account, just run the Jupyter notebook from within Hopsworks.

If you have an app.hopsworks.ai account; you may connect to Hopsworks with the following line; this will prompt you with a link to your Token which will link to the feature store.

import hopsworks
 
project = hopsworks.login()
fs = project.get_feature_store()

In some cases, you may also need to install Hopsworks; to be able to work with the package. Simply start your notebook with:

!pip install -U 'hopsworks[python]' --quiet

✍🏻 Concepts:

Familiarity with Machine Learning and Python development is recommended. For more information, visit the Hopsworks documentation.

🗄️ Table of Content:

  • QuickStart: Introductory tutorial to get started quickly.

🚀 Real-time AI Systems

⚙️ Batch AI Systems

🔮 LLM AI Systems

🧬 API Examples

🔬 Integrations

  • Airflow GCP: Apache Airflow integration with Google Cloud Platform.
  • AzureSQL: Create an External Feature Group using Azure SQL Database.
  • BigQuery: Create an External Feature Group using BigQuery Storage Connector.
  • Bytewax: Real-time feature computation using Bytewax.
  • DBT with BigQuery: Perform feature engineering in DBT on BigQuery.
  • Federated Offline Query: Execute federated queries across offline data sources.
  • Google Cloud Storage: Create an External Feature Group using GCS Storage Connector.
  • Great Expectations: Introduction to Great Expectations concepts for Hopsworks MLOps platform.
  • Java: Java-based integrations including Apache Beam and Apache Flink.
  • LangChain: Integration with LangChain for LLM applications.
  • MageAI: Build and operate ML systems with Mage and Hopsworks.
  • Neo4j: Perform Anti-money laundering predictions using Neo4j Graph.
  • Polars: Introductory tutorial on using Polars with Hopsworks.
  • PySpark Streaming: Real-time feature computation using PySpark.
  • Redshift: Create an External Feature Group using Redshift Storage Connector.
  • Snowflake: Create an External Feature Group using Snowflake Storage Connector.
  • Weights & Biases: Build machine learning models with Weights & Biases.

📝 Feedback & Comments:

We welcome your input through: