-
Notifications
You must be signed in to change notification settings - Fork 4.3k
feat(inferenceprofiles): add inference and cross-region inference pro… #35048
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
First review done
packages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/inference-profile.ts
Show resolved
Hide resolved
packages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/application-inference-profile.ts
Show resolved
Hide resolved
packages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/application-inference-profile.ts
Show resolved
Hide resolved
packages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/application-inference-profile.ts
Show resolved
Hide resolved
...ages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/cross-region-inference-profile.ts
Show resolved
Hide resolved
packages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/inference-profile.ts
Outdated
Show resolved
Hide resolved
packages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/application-inference-profile.ts
Show resolved
Hide resolved
packages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/application-inference-profile.ts
Show resolved
Hide resolved
packages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/application-inference-profile.ts
Show resolved
Hide resolved
...ages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/cross-region-inference-profile.ts
Show resolved
Hide resolved
packages/@aws-cdk/aws-bedrock-alpha/bedrock/inference-profiles/application-inference-profile.ts
Show resolved
Hide resolved
|
Thank you for contributing! Your pull request will be updated from main and then merged automatically (do not update manually, and be sure to allow changes to be pushed to your fork). |
|
Thank you for contributing! Your pull request will be updated from main and then merged automatically (do not update manually, and be sure to allow changes to be pushed to your fork). |
|
@Mergifyio update |
☑️ Nothing to do, the required conditions are not met
|
|
@Mergifyio requeue |
✅ The queue state of this pull request has been cleaned. It can be re-embarked automatically |
|
@Mergifyio requeue |
✅ The queue state of this pull request has been cleaned. It can be re-embarked automatically |
|
Thank you for contributing! Your pull request will be updated from main and then merged automatically (do not update manually, and be sure to allow changes to be pushed to your fork). |
|
Thank you for contributing! Your pull request will be updated from main and then merged automatically (do not update manually, and be sure to allow changes to be pushed to your fork). |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
|
Comments on closed issues and PRs are hard for our team to see. |
Issue # (if applicable)
Closes #.
Reason for this change
This PR introduces comprehensive support for Amazon Bedrock Inference Profiles in the AWS CDK Bedrock Alpha construct library, addressing the need for better cost tracking, model usage optimization, and cross-region inference capabilities.
Description of changes
Application Inference Profiles : Added support for user-defined inference profiles that enable cost tracking and model usage monitoring
Single-region application profiles for basic cost tracking
Multi-region application profiles using cross-region inference profiles
Cross-Region Inference Profiles: Implemented system-defined profiles that enable seamless traffic distribution across multiple AWS regions
Prompt Routers: Added intelligent prompt routing capabilities
Describe any new or updated permissions being added
Implemented
grantProfileUsage()method for proper IAM permission handlingDescription of how you validated changes
Added unit test
Added integ test
And tested it with a cdkApp deployment.
Checklist
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache-2.0 license