-
Notifications
You must be signed in to change notification settings - Fork 697
feat(ai-proxy): support Google Cloud Vertex #2119
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #2119 +/- ##
===========================================
+ Coverage 35.91% 46.06% +10.15%
===========================================
Files 69 81 +12
Lines 11576 13010 +1434
===========================================
+ Hits 4157 5993 +1836
+ Misses 7104 6671 -433
- Partials 315 346 +31 🚀 New features to boost your workflow:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
我改了一点格式问题,麻烦pull一下。
return err | ||
} | ||
|
||
err = v.getAccessToken(jwtToken) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
现在是每次请求都去调用认证服务吗?这个开销会不会有点太大了。
Ⅰ. Describe what this PR did
support
Google Cloud Vertex
providerⅡ. Does this pull request fix one issue?
fix: #1697
Ⅳ. Describe how to verify it
docker-compose.yaml
envoy.yaml
测试非流式请求:
测试流式请求:
Ⅴ. Special notes for reviews
vertex api 文档:https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/inference