This is a complete dealbreaker! I experienced the same issue. I also did not find a way to disable this safety evaluations.
In my case I have an AI agent which uses Grounding with Bing and GPT 4.1. Grounding with Bing is already extremely expensive (USD 35/1000 transactions). But the hidden cost caused by the AI agent are even twice as high.
In my case the cost for the LLM is completely irrelevant compared to the additional cost caused by the pure Microsoft services. In my case this means that the Azure AI Agent cannot be used for the project (which could even mean that Azure will not be used at all).
Service | % of total cost |
---|---|
LLM (GPT 4.1) | 2.5 % |
Grounding with Bing | 30.3 % |
Agent - Safety Evaluations Input/Output tokens (hidden cost) | 67.2 % |
Microsoft should explain why those hidden cost apply and how they can be avoided.