April 29, 2025 - v2.0.662

by Noriaki Tatsumi

Enhancements:

  • Patched a critical vulnerability in PyTorch
  • Improved the configuration parser for the LLM service connection string

Bug Fix:

  • Users now have the ability to archive disabled task rules via both the API & the UI

April 10, 2025 - v2.0.658

by Noriaki Tatsumi

Enhancements:

  • Optimized the profanity detection function in the toxicity rule to improve latency for inferences with a large number of consecutive repeating characters
  • The user input (inference) token counts now available in the inference query endpoint response

Maintenance:

  • Removed the deprecated API response body attributes in BaseDetailsResponse and PIIDetailsResponse. They were deprecated in October 2023.
    • BaseDetailsResponse.claims
    • BaseDetailsResponse.pii_results
    • BaseDetailsResponse.pii_entities
    • BaseDetailsResponse.toxicity_score
    • PIIDetailsResponse.pii_results
  • Removed the Hallucination v1 rule that was deprecated on May 24, 2024. If you have the rule in your active tasks, they will be automatically archived with this Shield upgrade.
  • Removed the experimental Hallucination v3 rule. It will be reintroduced when the next challenger to replace the v2 rule becomes available. If you have the rule in your active tasks, they will be automatically archived with this Shield upgrade.

New Feature:

  • Shield can now run on GPUs for Kubernetes deployments, resulting in low latency evaluations. The new GPU based deployment configuration is tested and supported for ASW EKS.

Enhancements:

  • The API key metadata lookup endpoint now contains the enrolled RBAC role names
  • Fully automated the process of updating Shield ingress URL from the installers
  • The task rules cache configuration is now tunable from the installers

January 16, 2025 - v2.0.626

by Noriaki Tatsumi

New Feature:

  • Any Shield task can now be assigned as the Arthur Chat firewall via API endpoint call

Enhancements:

  • Optimized the toxicity detection model for significant reduction of speed and compute cost (45+% latency reduction) without compromising accuracy
  • Optimized the prompt injection detection model for reduction of speed and compute cost without compromising accuracy
  • AWS ECS deployments can now configure warm instances on the GPU autoscaling group via CloudFormation for scaling out faster
  • Reduced the instance warmup time for faster dynamic autoscaling
  • OSS library upgrades

New Features:

  • AWS ECS GPU deployments now supports fast dynamic scaling
  • Added the capability for AWS ECS GPU deployments to scale-in and scale-out on a configurable schedule

November 6, 2024 - v2.0.567

by Noriaki Tatsumi

New Feature:

Product Enhancements:

  • Caching is now used for rules lookups to improve the speed of inference validations
  • Concurrency improvement made for GPU deployments
  • The Arthur Auth cluster coordinator now works with self-signed certificates

October 22, 2024 - v2.0.544

by Noriaki Tatsumi

New Feature:

  • Shield can now run on GPUs for ECS deployments, resulting in significantly faster performance.

Product Enhancements:

  • Users can now see the subcategories of toxicity or toxicity violation types in the UI
  • The sentence transformer model is now bundled inside the Shield container image instead of downloading it during server startup. This change results in a more reliable and faster server startup.
  • Improved the performance of the inference query endpoint

Bug Fixes:

  • Fixed an issue with refreshing the task list in the UI
  • The delete functionality when managing keyword rules in the UI now works as expected

October 16, 2024 - v2.0.532

by Noriaki Tatsumi

Bug Fixes:

  • Fixed the problem in Shield that was requiring unnecessary embedded model configuration when Arthur Chat is disabled

October 15, 2024 - v2.0.531

by Noriaki Tatsumi

Product Enhancements:

  • Further restricted what's on the Shield container by building on a distroless image