scenarios/workload-genai/policies/genai-policy.xml

scenarios/workload-genai/policies/genai-policy.xml (19 lines of code) (raw):

<policies> <inbound> <base />  <include-fragment fragment-id="simple-priority-weighted" />   <include-fragment fragment-id="rate-limiting-by-tokens" />  <include-fragment fragment-id="usage-tracking-with-appinsights" /> </inbound> <backend>    <retry condition="@(context.Response.StatusCode == 429)" count="3" interval="1" first-fast-retry="true"> <forward-request buffer-request-body="true" /> </retry>  </backend> <outbound> <base />   </outbound> <on-error> <base /> </on-error> </policies>