Azure OpenAI Insecure Output Handling

Sep 30, 2025 · Domain: LLM Data Source: Azure OpenAI Data Source: Azure Event Hubs Use Case: Insecure Output Handling Resources: Investigation Guide ·

Share on:

Detects when Azure OpenAI requests result in zero response length, potentially indicating issues in output handling that might lead to security exploits such as data leaks or code execution. This can occur in cases where the API fails to handle outputs correctly under certain input conditions.

Elastic rule (View on GitHub)

 1[metadata]
 2creation_date = "2025/02/25"
 3integration = ["azure_openai"]
 4maturity = "production"
 5updated_date = "2025/09/25"
 6
 7[rule]
 8author = ["Elastic"]
 9description = """
10Detects when Azure OpenAI requests result in zero response length, potentially indicating issues in output handling that
11might lead to security exploits such as data leaks or code execution. This can occur in cases where the API fails to
12handle outputs correctly under certain input conditions.
13"""
14false_positives = ["Queries that are designed to expect empty responses or benign system errors"]
15from = "now-60m"
16interval = "10m"
17language = "esql"
18license = "Elastic License v2"
19name = "Azure OpenAI Insecure Output Handling"
20note = """## Triage and analysis
21
22> **Disclaimer**:
23> This investigation guide was created using generative AI technology and has been reviewed to improve its accuracy and relevance. While every effort has been made to ensure its quality, we recommend validating the content and adapting it to suit your specific environment and operational needs.
24
25### Investigating Azure OpenAI Insecure Output Handling
26
27Azure OpenAI integrates AI capabilities into applications, enabling natural language processing tasks. However, improper output handling can lead to vulnerabilities, such as data leaks or unauthorized code execution. Adversaries might exploit these by crafting inputs that cause the API to mishandle responses. The detection rule identifies anomalies by flagging instances where API responses are unexpectedly empty, suggesting potential misuse or misconfiguration, especially when such events occur frequently.
28
29### Possible investigation steps
30
31- Review the logs for the specific Azure resource name flagged in the alert to understand the context and frequency of zero-length responses.
32- Examine the request lengths associated with the zero-length responses to identify any patterns or anomalies in the input data that might be causing the issue.
33- Check the cloud account ID associated with the alert to determine if there are any known issues or recent changes in configuration that could affect output handling.
34- Investigate the operation name "ChatCompletions_Create" to ensure that the API is being used as intended and that there are no unauthorized or unexpected uses.
35- Assess the overall environment for any recent updates or changes in the Azure OpenAI configuration that might have impacted output handling.
36
37### False positive analysis
38
39- Frequent legitimate requests with zero response length can occur during testing or development phases. To manage this, exclude known test environments or accounts from the detection rule by adding exceptions for specific cloud.account.id or azure.resource.name values.
40- Some applications may intentionally send requests that do not require a response, resulting in zero response length. Identify these applications and adjust the rule to exclude their specific azure.resource.name.
41- Network issues or temporary service disruptions can lead to zero-length responses. Monitor for patterns of such occurrences and consider excluding specific time frames or network segments if they are known to cause false positives.
42- Automated scripts or bots that interact with the API might generate zero-length responses as part of their normal operation. Identify these scripts and exclude their associated identifiers from the rule to prevent false alerts.
43
44### Response and remediation
45
46- Immediately isolate the affected Azure OpenAI resource to prevent further exploitation. This can be done by temporarily disabling the API or restricting access to it.
47- Review and validate the input handling mechanisms of the affected API to ensure they are robust against malformed or malicious inputs that could lead to insecure output handling.
48- Conduct a thorough audit of recent API requests and responses to identify any unauthorized access or data leaks. Pay special attention to requests with zero response length.
49- Implement additional logging and monitoring for the affected API to capture detailed information about requests and responses, which can help in identifying patterns or repeated attempts of exploitation.
50- Notify the security team and relevant stakeholders about the incident, providing them with detailed findings and any potential impact on data security.
51- If unauthorized access or data leakage is confirmed, follow the organization's incident response plan to notify affected parties and comply with any regulatory requirements.
52- Enhance detection capabilities by integrating anomaly detection tools that can identify unusual patterns in API usage, such as frequent zero-length responses, to prevent similar threats in the future.
53"""
54references = ["https://genai.owasp.org/llmrisk/llm02-insecure-output-handling"]
55risk_score = 21
56rule_id = "fb16f9ef-cb03-4234-adc2-44641f3b71ee"
57setup = """## Setup
58
59For more information on streaming events, see the Azure OpenAI documentation:
60
61https://learn.microsoft.com/en-us/azure/azure-monitor/essentials/stream-monitoring-data-event-hubs
62"""
63severity = "low"
64tags = [
65    "Domain: LLM",
66    "Data Source: Azure OpenAI",
67    "Data Source: Azure Event Hubs",
68    "Use Case: Insecure Output Handling",
69    "Resources: Investigation Guide",
70]
71timestamp_override = "event.ingested"
72type = "esql"
73
74query = '''
75from logs-azure_openai.logs-*
76| where
77    azure.open_ai.properties.response_length == 0 and
78    azure.open_ai.result_signature == "200" and
79    azure.open_ai.operation_name == "ChatCompletions_Create"
80| keep
81    azure.open_ai.properties.request_length,
82    azure.open_ai.result_signature,
83    cloud.account.id,
84    azure.resource.name
85| stats
86    Esql.event_count = count(*)
87  by
88    azure.resource.name
89| where
90    Esql.event_count >= 10
91| sort
92    Esql.event_count desc
93'''

Triage and analysis

Disclaimer: This investigation guide was created using generative AI technology and has been reviewed to improve its accuracy and relevance. While every effort has been made to ensure its quality, we recommend validating the content and adapting it to suit your specific environment and operational needs.

Investigating Azure OpenAI Insecure Output Handling

Azure OpenAI integrates AI capabilities into applications, enabling natural language processing tasks. However, improper output handling can lead to vulnerabilities, such as data leaks or unauthorized code execution. Adversaries might exploit these by crafting inputs that cause the API to mishandle responses. The detection rule identifies anomalies by flagging instances where API responses are unexpectedly empty, suggesting potential misuse or misconfiguration, especially when such events occur frequently.

Possible investigation steps

Review the logs for the specific Azure resource name flagged in the alert to understand the context and frequency of zero-length responses.
Examine the request lengths associated with the zero-length responses to identify any patterns or anomalies in the input data that might be causing the issue.
Check the cloud account ID associated with the alert to determine if there are any known issues or recent changes in configuration that could affect output handling.
Investigate the operation name "ChatCompletions_Create" to ensure that the API is being used as intended and that there are no unauthorized or unexpected uses.
Assess the overall environment for any recent updates or changes in the Azure OpenAI configuration that might have impacted output handling.

False positive analysis

Frequent legitimate requests with zero response length can occur during testing or development phases. To manage this, exclude known test environments or accounts from the detection rule by adding exceptions for specific cloud.account.id or azure.resource.name values.
Some applications may intentionally send requests that do not require a response, resulting in zero response length. Identify these applications and adjust the rule to exclude their specific azure.resource.name.
Network issues or temporary service disruptions can lead to zero-length responses. Monitor for patterns of such occurrences and consider excluding specific time frames or network segments if they are known to cause false positives.
Automated scripts or bots that interact with the API might generate zero-length responses as part of their normal operation. Identify these scripts and exclude their associated identifiers from the rule to prevent false alerts.

Response and remediation

Immediately isolate the affected Azure OpenAI resource to prevent further exploitation. This can be done by temporarily disabling the API or restricting access to it.
Review and validate the input handling mechanisms of the affected API to ensure they are robust against malformed or malicious inputs that could lead to insecure output handling.
Conduct a thorough audit of recent API requests and responses to identify any unauthorized access or data leaks. Pay special attention to requests with zero response length.
Implement additional logging and monitoring for the affected API to capture detailed information about requests and responses, which can help in identifying patterns or repeated attempts of exploitation.
Notify the security team and relevant stakeholders about the incident, providing them with detailed findings and any potential impact on data security.
If unauthorized access or data leakage is confirmed, follow the organization's incident response plan to notify affected parties and comply with any regulatory requirements.
Enhance detection capabilities by integrating anomaly detection tools that can identify unusual patterns in API usage, such as frequent zero-length responses, to prevent similar threats in the future.

References

LLM02:2025 Sensitive Information Disclosure

Sensitive information can affect both the LLM and its application context. This includes personal identifiable information (PII), financial details, health records, confidential business data, security credentials, and legal documents. Proprietary models may also have unique training methods and source code considered sensitive, especially in closed or foundation models. LLMs, especially when embedded in applications, risk […]

Read More