-
Notifications
You must be signed in to change notification settings - Fork 86
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat: Added section for Azure AI with 3 recommendations (#555)
Co-authored-by: Zach Trocinski <[email protected]> Co-authored-by: Zach Trocinski <[email protected]> Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>
- Loading branch information
1 parent
9a7e000
commit 771fd5e
Showing
7 changed files
with
156 additions
and
51 deletions.
There are no files selected for viewing
Validating CODEOWNERS rules …
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
1 change: 1 addition & 0 deletions
1
azure-resources/CognitiveServices/accounts/kql/0c193899-da60-4a52-b4a0-77d75ac8c5c5.kql
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
// cannot-be-validated-with-arg |
1 change: 1 addition & 0 deletions
1
azure-resources/CognitiveServices/accounts/kql/8aa9744b-f302-4b05-9776-51d6dd3d0c3a.kql
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
// cannot-be-validated-with-arg |
1 change: 1 addition & 0 deletions
1
azure-resources/CognitiveServices/accounts/kql/ac3add17-013e-41a5-af91-9fefce794a00.kql
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
// cannot-be-validated-with-arg |
53 changes: 44 additions & 9 deletions
53
azure-resources/CognitiveServices/accounts/recommendations.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,16 +1,51 @@ | ||
- description: Enable diagnostic logging for Azure AI services and send the data to Log Analytics | ||
aprlGuid: d6d9e18a-9ad2-491e-878d-86d621785453 | ||
- description: Deploy a PAYG instance of the model with provisioned throughput to manage overflow effectively | ||
aprlGuid: 0c193899-da60-4a52-b4a0-77d75ac8c5c5 | ||
recommendationTypeId: null | ||
recommendationControl: MonitoringAndAlerting | ||
recommendationImpact: Low | ||
recommendationControl: HighAvailability | ||
recommendationImpact: High | ||
recommendationResourceType: Microsoft.CognitiveServices/Accounts | ||
recommendationMetadataState: Active | ||
longDescription: | | ||
All Logs and Metrics should be configured. These logs provide rich, frequent data about the operation of a resource that are used for issue identification and debugging. | ||
potentialBenefits: Enhanced monitoring and troubleshooting capabilities | ||
pgVerified: false | ||
Provisioned Throughput offers pre-allocated capacity for consistent workloads, while Pay-as-You-Go charges for actual usage, ideal for variable workloads. During overflow, the Pay-as-You-Go instance manages excess load, ensuring service efficiency | ||
potentialBenefits: PAYG model balances cost and performance and helps scale | ||
pgVerified: true | ||
automationAvailable: false | ||
tags: null | ||
learnMoreLink: | ||
- name: Enable diagnostic logging for Azure AI services | ||
url: "https://learn.microsoft.com/en-us/azure/ai-services/diagnostic-logging" | ||
- name: Learn More | ||
url: "https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/provisioned-throughput" | ||
|
||
- description: Ensure that models are deployed using Global batch for large scale processing | ||
aprlGuid: 8aa9744b-f302-4b05-9776-51d6dd3d0c3a | ||
recommendationTypeId: null | ||
recommendationControl: Scalability | ||
recommendationImpact: High | ||
recommendationResourceType: Microsoft.CognitiveServices/Accounts | ||
recommendationMetadataState: Active | ||
longDescription: | | ||
Global batch efficiently handles large-scale tasks within 24 hours. Submit requests in a single file, with a separate quota to protect online workloads. Key uses: data processing, content generation, document review, customer support automation, data extraction, NLP tasks, and marketing | ||
potentialBenefits: Cost effective faster turnaround for large-scale processing. | ||
pgVerified: true | ||
automationAvailable: false | ||
tags: null | ||
learnMoreLink: | ||
- name: Learn More | ||
url: "https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/deployment-types#global-batch" | ||
|
||
- description: Ensure AOAI models are deployed using Data Zone Standard for data residency requirements | ||
aprlGuid: ac3add17-013e-41a5-af91-9fefce794a00 | ||
recommendationTypeId: null | ||
recommendationControl: Governance | ||
recommendationImpact: High | ||
recommendationResourceType: Microsoft.CognitiveServices/Accounts | ||
recommendationMetadataState: Active | ||
longDescription: | | ||
Data zone deployments route customer traffic to the highest availability data center within the defined data zone, ensuring data at rest remains within the Azure OpenAI resource geography. This approach offers increased quota limits and ensures data processing occurs within the specified data zone | ||
potentialBenefits: Enforce data residency and compliance standards | ||
pgVerified: true | ||
automationAvailable: false | ||
tags: null | ||
learnMoreLink: | ||
- name: Learn More | ||
url: "https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/deployment-types#data-zone-standard" | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
--- | ||
title: Artificial Intelligence (GPT-RAG) | ||
geekdocCollapseSection: true | ||
geekdocHidden: false | ||
--- | ||
|
||
## Dependent Azure Resource Recommendations | ||
|
||
| Recommendation | Provider Namespace | Resource Type | | ||
| :------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | :----------------: | :-----------: | | ||
| [Monitor Batch account quota](../../../Azure-Proactive-Resiliency-Library-v2/azure-resources/Batch/batchAccounts/#monitor-batch-account-quota) | Batch | batchAccounts | | ||
| [Create an Azure Batch pool across Availability Zones](../../../Azure-Proactive-Resiliency-Library-v2/azure-resources/Batch/batchAccounts/#create-an-azure-batch-pool-across-availability-zones) | Batch | batchAccounts | | ||
| [Deploy a PAYG instance of the model with provisioned throughput to manage overflow effectively](../../../Azure-Proactive-Resiliency-Library-v2/azure-resources/CognitiveServices/accounts/#deploy-a-PAYG-instance-of-the-model-with-provisioned-throughput-to-manage-overflow-effectively) | CognitiveServices | accounts | | ||
| [Ensure that models are deployed using Global batch for large scale processing](../../../Azure-Proactive-Resiliency-Library-v2/azure-resources/CognitiveServices/accounts/#ensure-that-models-are-deployed-using-global-batch-for-large-scale-processing) | CognitiveServices | accounts | | ||
| [Ensure AOAI models are deployed using Data Zone Standard for data residency requirements](../../../Azure-Proactive-Resiliency-Library-v2/azure-resources/CognitiveServices/accounts/#ensure-aoai-models-are-deployed-using-data-zone-standard-for-data-residency-requirements) | CognitiveServices | accounts | | ||
|
||
<br> | ||
|
||
## General Workload Guidance | ||
|
||
{{< azure-specialized-workloads-recommendationlist name="azure-specialized-workloads-recommendationlist" >}} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters