Support for Dynamic Quota in Chat-Models #91

g4challenge · 2024-08-06T06:13:58Z

Is there an existing issue for this?

I have searched the existing issues

Description

I would need to enable dynamic quota for the default gpt models

New or Affected Resource(s)/Data Source(s)

openai_deployment

Potential Terraform Configuration

eployment = {                                # TODO make sure to update corresponding litellm config
    "chat_model" = {
      name          = "gpt-4o"
      model_format  = "OpenAI"
      model_name    = "gpt-4o"
      model_version = "2024-05-13"
      scale_type    = "Standard"
      dynamic_quota_enabled = true
      #capacity      = 120
    },

References

https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/dynamic-quota

zioproto · 2024-08-06T06:50:37Z

Hello @g4challenge,

TL;DR waiting on Terraform provider feature

to implement this feature in the module we need first the azurerm provider to support this feature.

I see there is a open issue hashicorp/terraform-provider-azurerm#23988 and an existing PR hashicorp/terraform-provider-azurerm#25401

We have to wait for the feature to be merged.

thanks

zioproto · 2024-08-06T06:58:37Z

Hello @g4challenge

I also noticed that the feature is in preview. We will add the feature to the module only when it is promoted to GA.

github-project-automation bot added this to Azure Module Kanban Aug 6, 2024

github-project-automation bot moved this to Todo in Azure Module Kanban Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Dynamic Quota in Chat-Models #91

Support for Dynamic Quota in Chat-Models #91

g4challenge commented Aug 6, 2024

zioproto commented Aug 6, 2024

zioproto commented Aug 6, 2024

Support for Dynamic Quota in Chat-Models #91

Support for Dynamic Quota in Chat-Models #91

Comments

g4challenge commented Aug 6, 2024

Is there an existing issue for this?

Description

New or Affected Resource(s)/Data Source(s)

Potential Terraform Configuration

References

zioproto commented Aug 6, 2024

zioproto commented Aug 6, 2024