Defensive Handling of MIME Parsing Ambiguities in Email Delivery

Internet-Draft	MIME Ambiguity Defense	March 2026
Chen	Expires 13 September 2026	[Page]

Abstract

Email security gateways and endpoint mail clients frequently rely on different MIME parsers, decoders, and error-recovery behavior. An attacker can exploit those differences so that a security control fails to extract or scan an attachment that a downstream client later exposes to a user. This document describes defensive processing guidance for SMTP receivers, mail gateways, and message stores that handle MIME messages with malformed or ambiguous structure.¶

This document provides operational guidance for ingress validation, strict decoding floors, ambiguity detection, multi-view extraction, union scanning, logging, and policy handling. It also defines an optional "MIME-Ambiguity-Results" header field for conveying receiver-generated ambiguity assessments to downstream components inside an administrative domain.¶

1. Introduction

Email attachment defenses often assume that the object scanned by a gateway is the same object that a receiving mail client will later present for download or execution. That assumption is not always true. Divergent handling of malformed or ambiguous MIME can create a gap between the detector-side view and the client-side view of the same message. That gap can be exploited to evade attachment detection.¶

The problem is operational rather than purely theoretical: deployed products differ in how they resolve duplicate or conflicting header fields, how they parse multipart boundaries, and how they decode malformed transfer encodings. This document provides defensive guidance intended to ensure that a receiving system scans at least every attachment view that mainstream clients could plausibly expose, or else blocks or quarantines the message.¶

This document is intentionally scoped to receiver-side defenses. It does not attempt to standardize all client parser behavior, nor does it provide exploit construction guidance.¶

2. Requirements Language and Conventions

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.¶

"Detector side" means any SMTP receiver, mail gateway, content filter, malware scanner, sandbox, or message store component that parses or scans inbound content before user access. "Client side" means the mail user agent or webmail interface that renders message structure or makes attachments available for download.¶

"Strict parse" means message parsing and decoding that follows Internet Message Format and MIME specifications, including the baseline decoding semantics required by those specifications. "Compatible parse" means a receiver-controlled parsing path used to approximate tolerated client behavior without inventing new semantics beyond what deployed clients are known to expose.¶

"Attachment view" means the set of extracted byte sequences that a given parsing path would make available to a user as attachments, downloadable body parts, or equivalent objects.¶

5. Receiver Processing Model

A receiver implementing this specification SHOULD process inbound messages using the following high-level sequence:¶

Ingress structural validation¶
Strict parsing and extraction¶
Compatible parsing and extraction¶
Construction of a union attachment view¶
Scanning of every extracted object in that union¶
Disposition according to local policy¶
Optional emission of receiver-generated ambiguity results¶

Receivers MAY combine or pipeline these steps internally, but the effective security outcome MUST be equivalent.¶

5.1. Ingress Structural Validation

Before normal delivery, a receiver SHOULD evaluate the message for structural conditions that are highly correlated with parser disagreement. At minimum, implementations SHOULD detect the following classes of conditions:¶

duplicate or conflicting MIME structural header fields, including multiple Content-Type fields with different effective values;¶
control characters, including NUL, in MIME-relevant header field names or values;¶
multipart bodies with absent, empty, or otherwise invalid boundary parameters;¶
use of RFC 2047 encoded-word syntax inside MIME parameter values, such as boundary or filename, where such usage is not permitted;¶
decoding anomalies in transfer encodings that are known to create divergent extraction outcomes; and¶
malformed line folding or abnormal header/body separation that can change message interpretation.¶

A receiver MUST classify each detected condition as either:¶

fatal: the message cannot be trusted to have a single safe interpretation and MUST be rejected or quarantined; or¶
ambiguous: the message might still be processable, but additional extraction and union scanning are required before any delivery decision.¶

Empty multipart boundaries, NUL in MIME-relevant header data, and directly conflicting MIME structural header fields SHOULD be treated as fatal by default.¶

5.2. Strict Parsing and Decoding Floor

A receiver MUST implement at least one strict parsing path grounded in [RFC5322], [RFC2045], [RFC2046], [RFC2047], and [RFC2183].¶

For transfer encodings, a receiver MUST NOT implement a decoding behavior that is weaker than the minimum semantics already required by the MIME specifications. In particular, if a MIME decoding rule requires tolerant handling of certain non-alphabet characters or whitespace, a receiver MUST NOT stop extraction earlier than that specification permits if doing so would produce a narrower scan view than a conformant client could expose.¶

5.3. Compatible Parsing

A receiver SHOULD implement at least one receiver-controlled compatible parsing path to approximate attachment views that common downstream clients may expose in practice. The purpose of the compatible path is defensive coverage, not message repair for end-user fidelity.¶

A compatible path MUST be constrained so that it does not invent new attachment semantics unsupported by realistic client behavior. Compatible parsing SHOULD be derived from observed receiver or client interoperability needs, regression testing, or differential parser analysis.¶

5.4. Union Extraction and Scanning

A receiver that performs both strict and compatible parsing MUST form a union attachment view from all extracted objects. Every object in that union MUST be subject to the same malware detection, content policy, archive expansion, and sandboxing controls that would apply to a normal attachment.¶

If any object in the union is classified as malicious or disallowed, the receiver MUST apply that disposition to the message as a whole, unless local policy instead replaces the object with a safe, auditable sanitization result.¶

If the union attachment view differs from the strict attachment view, the receiver MUST treat the message as ambiguous. Local policy MAY still permit delivery after successful scanning, but the default action SHOULD be quarantine or other restricted handling.¶

5.5. Resource Limits and Abuse Resistance

Because multi-view parsing and scanning can expand resource consumption, implementations MUST enforce limits on message size, extracted object count, nested multipart depth, recursive archive expansion, decoding output size, and processing time. Messages that exceed such limits MUST fail closed, typically by quarantine or rejection.¶

6. Minimum Anomaly Classes

The following anomaly classes form a minimum common vocabulary for receiver implementations. Implementations MAY define additional local classes.¶

dup-content-type: Duplicate or conflicting Content-Type fields or parameter interpretations that could change body-part structure.¶
nul-in-header: NUL or comparable control characters in MIME-relevant header field names or values.¶
empty-boundary: Missing or empty multipart boundary values, or equivalent boundary invalidity causing structure disagreement.¶
invalid-b64-char: Base64 decoding anomalies that alter extraction outcome across implementations.¶
qp-broken-softbreak: Quoted-printable soft line break anomalies that can alter recovered bytes or part delimitation.¶
encoded-word-in-parameter: Use of RFC 2047 encoded-word syntax in MIME parameters where not permitted.¶

Receivers SHOULD log anomaly classes in structured security telemetry even when local policy ultimately delivers the message.¶

8. The MIME-Ambiguity-Results Header Field

This section defines an OPTIONAL receiver-generated header field, MIME-Ambiguity-Results, for use within an administrative domain. The field communicates whether the receiver detected MIME ambiguity and what disposition was applied.¶

This field is not an originator assertion. It MUST be inserted only by trusted receiving infrastructure. Downstream consumers MUST ignore instances that originate outside the local trust boundary.¶

8.1. Syntax

The syntax in this section is described using ABNF [RFC5234]. The FWS, CFWS, and CRLF rules are imported from [RFC5322]. The authserv-id, token, and value rules are imported from [RFC8601].¶

MIME-Ambiguity-Results = "MIME-Ambiguity-Results:" FWS authserv-id
                           *( CFWS ";" CFWS mar-param ) CRLF

mar-param    = mar-result / mar-policy / mar-anomaly / mar-ext
mar-result   = "result=" ( "pass" / "ambiguous" / "fail" )
mar-policy   = "policy=" ( "accept" / "quarantine" /
                            "reject" / "sanitize" )
mar-anomaly  = "anomaly=" anomaly-code
anomaly-code = "dup-content-type" /
               "nul-in-header" /
               "empty-boundary" /
               "invalid-b64-char" /
               "qp-broken-softbreak" /
               "encoded-word-in-parameter" /
               x-anomaly
x-anomaly    = "x-" 1*(ALPHA / DIGIT / "-")
mar-ext      = token ["=" value]

The ALPHA and DIGIT rules are imported from [RFC5234].¶

8.2. Semantics

result indicates the receiver's overall ambiguity assessment. policy indicates the disposition taken by the receiver. anomaly identifies one or more anomaly classes that contributed to the assessment.¶

A receiver SHOULD place this field near other receiver-generated assessment fields. A downstream consumer that uses the field for policy decisions MUST rely only on instances inserted by trusted infrastructure inside the same administrative domain.¶

8.3. Example

MIME-Ambiguity-Results: mx.example.net; result=ambiguous;
  policy=quarantine; anomaly=dup-content-type;
  anomaly=invalid-b64-char

10. Security Considerations

This entire document is about security. The central security property is coverage equivalence: the detector-side scan view must not be narrower than the client-side exposure view.¶

Overly aggressive message repair can itself create security problems. Receivers SHOULD avoid speculative rewriting that changes message structure or attachment semantics in ways not directly justified by local sanitization policy.¶

The MIME-Ambiguity-Results header field is trustworthy only within a local administrative trust boundary. Attackers can forge the field in received messages; therefore downstream consumers MUST ignore untrusted instances.¶

Multi-view parsing increases computational cost and therefore creates a denial-of-service risk. Implementations MUST enforce hard resource limits and fail closed when those limits are exceeded.¶

13. Normative References

[RFC2045]: Freed, N. and N. Borenstein, "Multipurpose Internet Mail Extensions (MIME) Part One: Format of Internet Message Bodies", RFC 2045, DOI 10.17487/RFC2045, November 1996, <https://www.rfc-editor.org/info/rfc2045>.
[RFC2046]: Freed, N. and N. Borenstein, "Multipurpose Internet Mail Extensions (MIME) Part Two: Media Types", RFC 2046, DOI 10.17487/RFC2046, November 1996, <https://www.rfc-editor.org/info/rfc2046>.
[RFC2047]: Moore, K., "MIME (Multipurpose Internet Mail Extensions) Part Three: Message Header Extensions for Non-ASCII Text", RFC 2047, DOI 10.17487/RFC2047, November 1996, <https://www.rfc-editor.org/info/rfc2047>.
[RFC2119]: Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, DOI 10.17487/RFC2119, March 1997, <https://www.rfc-editor.org/info/rfc2119>.
[RFC2183]: Troost, R., Dorner, S., and K. Moore, Ed., "Communicating Presentation Information in Internet Messages: The Content-Disposition Header Field", RFC 2183, DOI 10.17487/RFC2183, August 1997, <https://www.rfc-editor.org/info/rfc2183>.
[RFC5234]: Crocker, D., Ed. and P. Overell, "Augmented BNF for Syntax Specifications: ABNF", STD 68, RFC 5234, DOI 10.17487/RFC5234, January 2008, <https://www.rfc-editor.org/info/rfc5234>.
[RFC5322]: Resnick, P., Ed., "Internet Message Format", RFC 5322, DOI 10.17487/RFC5322, October 2008, <https://www.rfc-editor.org/info/rfc5322>.
[RFC8174]: Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174, May 2017, <https://www.rfc-editor.org/info/rfc8174>.
[RFC8601]: Kucherawy, M., "Message Header Field for Indicating Message Authentication Status", RFC 8601, DOI 10.17487/RFC8601, May 2019, <https://www.rfc-editor.org/info/rfc8601>.

Defensive Handling of MIME Parsing Ambiguities in Email Delivery

Abstract

Status of This Memo

Copyright Notice

Table of Contents

1. Introduction

2. Requirements Language and Conventions

3. Threat Model

4. Defensive Goals

5. Receiver Processing Model

5.1. Ingress Structural Validation

5.2. Strict Parsing and Decoding Floor

5.3. Compatible Parsing

5.4. Union Extraction and Scanning

5.5. Resource Limits and Abuse Resistance

6. Minimum Anomaly Classes

7. SMTP Handling and Disposition

8. The MIME-Ambiguity-Results Header Field

8.1. Syntax

8.2. Semantics

8.3. Example

9. Operational Deployment Guidance

10. Security Considerations

11. Privacy Considerations

12. IANA Considerations

13. Normative References

14. Informative References

Author's Address