Loading...

XML

Word

Printable

Type: Story
Resolution: Unresolved
Priority: Undefined
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
- token-rate-limiting

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Intelligence Requested:
Market:

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Needs validation as a user story. Models have input token limits, so having limits in 2 places is questionable. Also, input tokens are included in token usage in response already for TokenRateLimitPolicy as of v1alpha1

As a platform engineer, I want to enforce input token rate limits at the Gateway and HTTPRoute level so that I can prevent excessive usage of expensive LLM APIs before requests reach the model server.

Assignee:: Unassigned

Reporter:: David Martin

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Created:: 2025/07/24 8:46 AM

Updated:: 2025/09/19 11:06 AM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates

Hide