Add Amazon Bedrock Guardrails integration #2

bdruth · 2025-04-23T21:11:28Z

Summary

Add support for Amazon Bedrock Guardrails at both gateway and request levels
Implement guardrail configuration through environment variables
Apply guardrails to both standard Bedrock models and custom imported models
Add example guardrail parameters in API documentation

Test plan

Verify guardrails are applied correctly with default gateway settings
Verify request-level guardrails override gateway defaults
Test with both standard and custom imported models

🤖 Generated with Claude Code

This commit adds the ability to use custom models imported into AWS Bedrock through the OpenAI-compatible API interface. Key features include: - User-friendly model IDs that include the model name (e.g., mistral-7b-instruct-id:custom.a1b2c3d4) - Support for both custom models and imported models - Detailed documentation in CUSTOM_MODELS_IMPLEMENTATION.md - Updates to README.md and Usage.md - Custom models are always enabled by default 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

This file provides guidance for Claude Code assistant when working with this repository, including commands and code style guidelines. Updated to use pipx for running ruff. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Update schema.py to handle user-friendly custom model IDs - Update model.py router to preserve user-friendly IDs in responses 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Add EcrAccountId parameter to both Lambda and Fargate templates - Default to the current official account ID (366590864501) - Use the parameter in ECR repository URLs and IAM policy This change allows users to easily deploy with custom ECR repositories. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Enhances custom model streaming support to handle: 1. Missing contentType in stream chunks 2. Support for 'generation' field format used by some models This ensures that custom imported models can be used with streaming mode.

Enhances custom model handling to support additional response formats: 1. Add support for 'generation' field in response 2. Add support for 'prompt_token_count' and 'generation_token_count' fields 3. Add debug logging of response keys to help troubleshoot This ensures that custom imported models can be used with both streaming and non-streaming modes.

These tests verify that: 1. Custom models appear in the models list API 2. Custom models can be invoked successfully with a non-empty response 3. Custom models support streaming mode The tests require at least one custom model to be set up in the AWS account.

This utility script allows direct testing of custom imported models using boto3: 1. Tests direct invocation with the AWS Bedrock runtime 2. Tests streaming invocation with the AWS Bedrock runtime 3. Parses different response formats used by custom models 4. Extracts complete texts from streaming chunks Useful for debugging or verifying custom model behavior outside of the gateway API.

- Extract helper function to reduce code duplication - Simplify custom model listing logic - Improve model lookup to handle friendly IDs and AWS IDs efficiently - Add helper methods to extract completion text and usage - Replace DEBUG blocks with logger.debug() calls - Streamline validation and response handling 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Add gateway-level guardrail configuration options - Apply guardrails to both standard and custom models - Support request-level guardrail parameters - Add documentation in chat.py example request 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

msharp9

I was so confused at first since I was just looking for the guardrails stuff, but the majority of this PR is actually you adding the custom names for the custom models.

I am a bit suspect about how the custom naming is going to work with model versions and running into name conflicts there, but overall looks good.

bdruth and others added 10 commits April 16, 2025 11:45

Update model router and schema for custom models

09f6573

- Update schema.py to handle user-friendly custom model IDs - Update model.py router to preserve user-friendly IDs in responses 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Fix streaming for custom imported models

71bf0f2

Enhances custom model streaming support to handle: 1. Missing contentType in stream chunks 2. Support for 'generation' field format used by some models This ensures that custom imported models can be used with streaming mode.

msharp9 approved these changes Apr 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Amazon Bedrock Guardrails integration #2

Add Amazon Bedrock Guardrails integration #2

Uh oh!

bdruth commented Apr 23, 2025

Uh oh!

msharp9 left a comment

Uh oh!

Uh oh!

Add Amazon Bedrock Guardrails integration #2

Are you sure you want to change the base?

Add Amazon Bedrock Guardrails integration #2

Uh oh!

Conversation

bdruth commented Apr 23, 2025

Summary

Test plan

Uh oh!

msharp9 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!