May 24, 2024 - v2.0.356
11 months ago by Noriaki Tatsumi
Product Enhancements:
- Improved the concurrency of the toxicity rule execution for performance improvement
- Reduced the false positives of partial profanities, accidental profanities, and obscured niche scandalous internet terms in the toxicity rule
- Added the capability for the hallucination v2 rule to return partial results instead of responding with “Unavailable” for the whole inference when it’s not able to successfully evaluate all claims
- Retrained the prompt injection model and updated the input truncation schema for higher accuracy
- Introduced a new experimental version of hallucination rule (not ready for production use)
- The hallucination v1 rule has been deprecated
- UI: It’s now more clear that sorting the inference deep dive table is done according to timestamp
- UI: Added the ability to navigate to specific pages on the tasks page
- UI: Made usability improvements to filtering on the inference deep dive page
Bug Fix:
- Fixed an issue where an inference was labeled as an overall pass but there was at least one rule that failed