Optimized the profanity detection function in the toxicity rule to improve latency for inferences with a large number of consecutive repeating characters
The user input (inference) token counts now available in the inference query endpoint response
Maintenance:
Removed the deprecated API response body attributes in BaseDetailsResponse and PIIDetailsResponse. They were deprecated in October 2023.
BaseDetailsResponse.claims
BaseDetailsResponse.pii_results
BaseDetailsResponse.pii_entities
BaseDetailsResponse.toxicity_score
PIIDetailsResponse.pii_results
Removed the Hallucination v1 rule that was deprecated on May 24, 2024. If you have the rule in your active tasks, they will be automatically archived with this Shield upgrade.
Removed the experimental Hallucination v3 rule. It will be reintroduced when the next challenger to replace the v2 rule becomes available. If you have the rule in your active tasks, they will be automatically archived with this Shield upgrade.
Shield can now run on GPUs for Kubernetes deployments, resulting in low latency evaluations. The new GPU based deployment configuration is tested and supported for ASW EKS.
Enhancements:
The API key metadata lookup endpoint now contains the enrolled RBAC role names
Fully automated the process of updating Shield ingress URL from the installers
The task rules cache configuration is now tunable from the installers
Shield can now run on GPUs for ECS deployments, resulting in significantly faster performance.
Product Enhancements:
Users can now see the subcategories of toxicity or toxicity violation types in the UI
The sentence transformer model is now bundled inside the Shield container image instead of downloading it during server startup. This change results in a more reliable and faster server startup.
Improved the performance of the inference query endpoint
Bug Fixes:
Fixed an issue with refreshing the task list in the UI
The delete functionality when managing keyword rules in the UI now works as expected