<?xml version="1.0" encoding="UTF-8"?>
  <?xml-stylesheet type="text/xsl" href="rfc2629.xslt" ?>
  <!-- generated by https://github.com/cabo/kramdown-rfc version 1.7.39 (Ruby 4.0.5) -->


<!DOCTYPE rfc  [
  <!ENTITY nbsp    "&#160;">
  <!ENTITY zwsp   "&#8203;">
  <!ENTITY nbhy   "&#8209;">
  <!ENTITY wj     "&#8288;">

]>


<rfc ipr="trust200902" docName="draft-rswg-rfc7997bis-11" category="info" submissionType="editorial" obsoletes="7997" tocInclude="true" sortRefs="true" symRefs="true">
  <front>
    <title abbrev="Text in RFCs">Text in RFCs</title>

    <author initials="P." surname="Hoffman" fullname="Paul Hoffman">
      <organization></organization>
      <address>
        <email>paul.hoffman@icann.org</email>
      </address>
    </author>

    <date year="2026" month="June" day="01"/>

    
    
    <keyword>Internet-Draft</keyword>

    <abstract>


<?line 42?>

<t>This document sets policy for the inclusion of characters in the definitive versions and publication formats of RFCs.
The policy for the RFC Series is that all displayable text is allowed as long as there is a high expectation that readers of an RFC will be able to interpret its text as intended.
This document obsoletes RFC 7997.</t>



    </abstract>



  </front>

  <middle>


<?line 48?>

<section anchor="introduction"><name>Introduction</name>

<t>The early policy for the RFC Series was that RFCs could only contain characters from the ASCII character set.
Later policies, from <xref target="RFC7997"/>, allowed more characters, set the language of the RFC Series to be English, and set the encoding for RFCs of UTF-8.
In the time since <xref target="RFC7997"/> was published, the IETF community has had much more experience of using non-ASCII characters in RFCs.</t>

<t>This document obsoletes <xref target="RFC7997"/>.
This document makes substantial changes to the policies in <xref target="RFC7997"/> based on the positive experience since its publication.</t>

<t>The RFC Publication Center (RPC) is responsible for implementing the policies in this document, as described in <xref target="RFC9720"/>.
The RPC style guides may define which characters authors may use and how they are used.</t>

<section anchor="terminology"><name>Terminology</name>

<t>The term "non-ASCII characters" means characters outside the set that was defined in ASCII.
ASCII is described in <xref target="RFC20"/>.</t>

<t>The term "Unicode characters" means characters defined in <xref target="UnicodeLatest"/>.</t>

<t>"U+ notation" means using the characters "U+" and a hexadecimal number to represent a Unicode code point.
See <xref target="BCP137"/> for more on U+ notation.</t>

<t>More terminology about characters and encoding formats can be found in <xref target="RFC6365"/>.</t>

</section>
</section>
<section anchor="basic-requirements-for-text-in-rfcs"><name>Basic Requirements for Text in RFCs</name>

<t>RFCs should only contain text that can be displayed correctly across a wide range of readers and browsers.
People whose systems do not have the fonts needed to display part of a particular RFC still need to be able to read the definitive versions and publication formats correctly in order to understand and implement the information described in the document.</t>

<t>The ability to use non-ASCII characters in RFCs in a clear and consistent manner will allow the correct display of proper names and improve the ability to describe internationalized protocols.
Apart from their role in proper names, non-ASCII characters should be used only when they enhance the technical content and accuracy of the document.</t>

</section>
<section anchor="policy-for-text-in-rfcs"><name>Policy for Text in RFCs</name>

<t>English is the required language of RFCs.
However, because non-ASCII characters are often required for instances including proper names and examples, the policy for the RFC Series is that all displayable text is allowed as long as there is a high expectation that readers of an RFC will be able to interpret its text as intended.
Apart from their role in proper names, non-ASCII characters should be used only when they enhance the technical content and accuracy of the document.</t>

<t>There are some Unicode characters that cannot be displayed (such as control and whitespace characters).
Additionally, many characters are also not widely available in the font packages that are available to viewers.
Although HTML and PDF formats render any character that has a glyph, that can involve additional steps in preparation.
The text publication format <xref target="RFC7994"/> of an RFC, as well as excerpts copied from an RFC, are displayed using the font packages supplied by the viewer, which might not include the necessary glyphs.
RFCs need to include descriptions of any such characters.</t>

<t>The preferred method for describing such characters is using the U+ notation from <xref target="BCP137"/> and/or using the character's official name from the Unicode Standard <xref target="UnicodeLatest"/>.
<xref target="BCP137"/> describes the pros and cons of different options for identifying Unicode characters and may help authors decide how to represent the non-ASCII characters in their documents.</t>

<t>Note that this policy only applies to normative or descriptive text; text such as names does not need character description.
Further, some RFC authors might choose to use something other than the U+ notation to describe characters, such as if the RFC already covers a different syntax that the reader will understand from the rest of the RFC.</t>

<t>Characters in an RFC will generally appear in Normalization Form C (NFC) as defined in <xref target="UnicodeLatest"/>.
If the RFC would be more correct and more understandable with particular characters not in NFC, the RPC can use unnormalized text.
In such a case, a note should be included to describe why unnormalized text was used.</t>

<section anchor="names"><name>Names</name>

<t>Authors of RFCs whose names include non-ASCII characters will likely have preferences for how their names are displayed.
If the author's preferred name is in a non-Latin script, they must also provide an alternate name in Latin script.
Authors' preferences for display of their names should be honored.</t>

<t>Company names and geographic names generally do not need Latin alternates, but they can be included at the discretion of the author and the RPC.</t>

</section>
<section anchor="examples"><name>Examples</name>

<t>Where the use of non-ASCII characters is purely part of an example or not otherwise required for correct protocol operation, giving the Unicode code points and Unicode names of the non-ASCII characters is not required, but it can improve the readability of the RFC.
An RFC can use either or both forms, whichever is sensible in the circumstance.
For example, for text that might just say "The value can be followed by a monetary symbol such as ¥ or €", text that says one of the following is likely more beneficial to the reader:</t>

<t><list style="symbols">
  <t>The value can be followed by a monetary symbol such as ¥ (U+00A5) or € (U+20AC)</t>
  <t>The value can be followed by a monetary symbol such as ¥ (YEN SIGN) or € (EURO SIGN)</t>
  <t>The value can be followed by a monetary symbol such as ¥ (U+00A5, YEN SIGN) or € (U+20AC, EURO SIGN)</t>
</list></t>

<t>RFCs may be viewed using only black and white or grayscale, particularly when printed.
Because of this, examples should generally use characters that do not specify a color.
However, some examples might require text with color due to the nature of the examples.
If so, those examples need to also include U+ notation.
For example, "A color display should be able to differentiate 🔴 (U+1F534, LARGE RED CIRCLE), 🟢 (U+1F7E2, LARGE GREEN CIRCLE), and 🔵 (U+1F535, LARGE BLUE CIRCLE)."</t>

</section>
</section>
<section anchor="rfc-publication-language"><name>RFC Publication Language</name>

<t>The RFC publication language is English.</t>

</section>
<section anchor="rfc-publication-encoding"><name>RFC Publication Encoding</name>

<t>The encoding format for the RFC Series is UTF-8 <xref target="STD63"/>.</t>

</section>
<section anchor="iana-considerations"><name>IANA Considerations</name>

<t>This document contains no IANA considerations.</t>

</section>
<section anchor="security-considerations"><name>Security Considerations</name>

<t>Authors and the RPC should cross-check that the used characters match their code point numbers or Unicode character names.</t>

</section>


  </middle>

  <back>


<references title='References' anchor="sec-combined-references">

    <references title='Normative References' anchor="sec-normative-references">



<referencegroup anchor="BCP137" target="https://www.rfc-editor.org/info/bcp137">
  <reference anchor="RFC5137" target="https://www.rfc-editor.org/info/rfc5137">
    <front>
      <title>ASCII Escaping of Unicode Characters</title>
      <author fullname="J. Klensin" initials="J." surname="Klensin"/>
      <date month="February" year="2008"/>
      <abstract>
        <t>There are a number of circumstances in which an escape mechanism is needed in conjunction with a protocol to encode characters that cannot be represented or transmitted directly. With ASCII coding, the traditional escape has been either the decimal or hexadecimal numeric value of the character, written in a variety of different ways. The move to Unicode, where characters occupy two or more octets and may be coded in several different forms, has further complicated the question of escapes. This document discusses some options now in use and discusses considerations for selecting one for use in new IETF protocols, and protocols that are now being internationalized. This document specifies an Internet Best Current Practices for the Internet Community, and requests discussion and suggestions for improvements.</t>
      </abstract>
    </front>
    <seriesInfo name="BCP" value="137"/>
    <seriesInfo name="RFC" value="5137"/>
    <seriesInfo name="DOI" value="10.17487/RFC5137"/>
  </reference>
</referencegroup>
<referencegroup anchor="STD63" target="https://www.rfc-editor.org/info/std63">
  <reference anchor="RFC3629" target="https://www.rfc-editor.org/info/rfc3629">
    <front>
      <title>UTF-8, a transformation format of ISO 10646</title>
      <author fullname="F. Yergeau" initials="F." surname="Yergeau"/>
      <date month="November" year="2003"/>
      <abstract>
        <t>ISO/IEC 10646-1 defines a large character set called the Universal Character Set (UCS) which encompasses most of the world's writing systems. The originally proposed encodings of the UCS, however, were not compatible with many current applications and protocols, and this has led to the development of UTF-8, the object of this memo. UTF-8 has the characteristic of preserving the full US-ASCII range, providing compatibility with file systems, parsers and other software that rely on US-ASCII values but are transparent to other values. This memo obsoletes and replaces RFC 2279.</t>
      </abstract>
    </front>
    <seriesInfo name="STD" value="63"/>
    <seriesInfo name="RFC" value="3629"/>
    <seriesInfo name="DOI" value="10.17487/RFC3629"/>
  </reference>
</referencegroup>
<reference anchor="RFC7997">
  <front>
    <title>The Use of Non-ASCII Characters in RFCs</title>
    <author fullname="H. Flanagan" initials="H." role="editor" surname="Flanagan"/>
    <date month="December" year="2016"/>
    <abstract>
      <t>In order to support the internationalization of protocols and a more diverse Internet community, the RFC Series must evolve to allow for the use of non-ASCII characters in RFCs. While English remains the required language of the Series, the encoding of future RFCs will be in UTF-8, allowing for a broader range of characters than typically used in the English language. This document describes the RFC Editor requirements and gives guidance regarding the use of non-ASCII characters in RFCs.</t>
      <t>This document updates RFC 7322. Please view this document in PDF form to see the full text.</t>
    </abstract>
  </front>
  <seriesInfo name="RFC" value="7997"/>
  <seriesInfo name="DOI" value="10.17487/RFC7997"/>
</reference>
<reference anchor="RFC9720">
  <front>
    <title>RFC Formats and Versions</title>
    <author fullname="P. Hoffman" initials="P." surname="Hoffman"/>
    <author fullname="H. Flanagan" initials="H." surname="Flanagan"/>
    <date month="January" year="2025"/>
    <abstract>
      <t>In order to improve the readability of RFCs while supporting their archivability, the definitive version of the RFC Series transitioned from plain-text ASCII to XML using the RFCXML vocabulary; different publication versions are rendered from that base document. This document describes how RFCs are published.</t>
      <t>This document obsoletes RFC 7990. This document also updates the stability policy in RFC 9280.</t>
    </abstract>
  </front>
  <seriesInfo name="RFC" value="9720"/>
  <seriesInfo name="DOI" value="10.17487/RFC9720"/>
</reference>

<reference anchor="UnicodeLatest" target="http://www.unicode.org/versions/latest/">
  <front>
    <title>The Unicode Standard</title>
    <author >
      <organization>The Unicode Consortium</organization>
    </author>
    <date year="n.d."/>
  </front>
</reference>


    </references>

    <references title='Informative References' anchor="sec-informative-references">



<reference anchor="RFC20">
  <front>
    <title>ASCII format for network interchange</title>
    <author fullname="V.G. Cerf" initials="V.G." surname="Cerf"/>
    <date month="October" year="1969"/>
  </front>
  <seriesInfo name="STD" value="80"/>
  <seriesInfo name="RFC" value="20"/>
  <seriesInfo name="DOI" value="10.17487/RFC0020"/>
</reference>
<reference anchor="RFC6365">
  <front>
    <title>Terminology Used in Internationalization in the IETF</title>
    <author fullname="P. Hoffman" initials="P." surname="Hoffman"/>
    <author fullname="J. Klensin" initials="J." surname="Klensin"/>
    <date month="September" year="2011"/>
    <abstract>
      <t>This document provides a list of terms used in the IETF when discussing internationalization. The purpose is to help frame discussions of internationalization in the various areas of the IETF and to help introduce the main concepts to IETF participants. This memo documents an Internet Best Current Practice.</t>
    </abstract>
  </front>
  <seriesInfo name="BCP" value="166"/>
  <seriesInfo name="RFC" value="6365"/>
  <seriesInfo name="DOI" value="10.17487/RFC6365"/>
</reference>
<reference anchor="RFC7994">
  <front>
    <title>Requirements for Plain-Text RFCs</title>
    <author fullname="H. Flanagan" initials="H." surname="Flanagan"/>
    <date month="December" year="2016"/>
    <abstract>
      <t>In 2013, after a great deal of community discussion, the decision was made to shift from the plain-text, ASCII-only canonical format for RFCs to XML as the canonical format with more human-readable formats rendered from that XML. The high-level requirements that informed this change were defined in RFC 6949, "RFC Series Format Requirements and Future Development". Plain text remains an important format for many in the IETF community, and it will be one of the publication formats rendered from the XML. This document outlines the rendering requirements for the plain-text RFC publication format. These requirements do not apply to plain-text RFCs published before the format transition.</t>
    </abstract>
  </front>
  <seriesInfo name="RFC" value="7994"/>
  <seriesInfo name="DOI" value="10.17487/RFC7994"/>
</reference>



    </references>

</references>


<?line 144?>

<section anchor="acknowledgements"><name>Acknowledgements</name>

<t>This document is based on <xref target="RFC7997"/> which was authored by Heather Flanagan.</t>

<t>The acknowledgements from <xref target="RFC7997"/> are
to the members of the IAB i18n program,
to the RFC Format Design Team:
Nevil Brownlee, Tony Hansen, Joe
Hildebrand, Paul Hoffman, Ted Lemon, Julian Reschke, Adam Roach,
Alice Russo, Robert Sparks, and Dave Thaler.</t>

<t>Writing this document was greatly helped by contributions from the RFC Series Working Group (RSWG), including:
Brian Carpenter,
Carsten Bormann,
Eliot Lear,
John Klensin,
John Levine,
Martin Dürst,
Martin Thomson,
Pete Resnick,
Rob Sayre, and
Russ Housley.</t>

</section>


  </back>

<!-- ##markdown-source:
H4sIAAAAAAAAA9VZy3IbuRXd91eg5MXYNU1atsfjsbIJRVG2HI2ikuRyZZUC
u0E2om6gB+gWzbhmk2/IfqpSlV2WyS6r5E/yBfmEnHuBfpCSJ5XHIpmFh2yi
L+7j3HMPoMlkkjS6KdWRuFEfG6GNuDqd+0Qul07d7T3MbWZkhaW5k6tm4vxm
PXGr7NXr16+W2k+ePUsS30iT/1KW1mBZ41qVwMiLxC69LVWj/JGg1Umia8e/
++b54eHrw+fJ7eZInJlGOaOayQnZTzLZHGHvlU18u6y099qam20NwyrXjXVa
lklS66NEiMZmR2KrPD566xqnVr7/vq2Gr4lsm8I6vDLBTzCO55dT8dauVpU0
9CgEeCnbcvxUVVKXR6LG42kRHv9UZ9KYqXXrJDHWVbLRd4p8OZ5fPnvxij5d
35x8/YI+IHsUdvz4+tXzQ/r43ujM5upcIi8NPUAcsRSF6n4V15RR6XL+vXcf
/00E9t5dO7eGwtdtFaxJt1bIYdE09dHTp5vNZtqGleT20zvlKKX+ackePEVV
kOxRJPA1eIoPX7/4+uUQyldHSTKZTIRc+sbJrEmSm0J7AYC0lTKN8Krxoral
zrYCJkUDJ7XJypY2FHYlskLSe/CA0EU/52qljaatReeYQOSibpcwA5/wYvDO
kwEC5DSh4Pe2wQ/iWjmtYNnjiWyELEuRa1+XciuXpRINg9rTc7tRuZBeALBr
+j8sOMW/iUKvC6E+1iprwu5syymZk9dwQXJbiI2G+aUSwbRFPAirdgpbwFXe
S3p+anKVT/cy1XcG2yKUTENqK53npUqSR9QWzuZtRk4kHLKSrtz+SOAbGSOn
LInMtmUurMErmTWNRMJH6V85W/H7s+v52dnwC9VwmhA4XdgJhtOw+tOnCOjv
v0/7JFYWiRvspvQ+2y2lWbdyrShle34iW0jcwqxL7YuUy929pQxwqlEUCo/D
wOvvb04n30yTs4CYRldKeMBKjT3i4Bk0vlB5yivPFjeniL2qAP9mKwqsKCRc
brMi+E1lhkdkCtsApdjYWDPZy4nvqHC6D/ihjCNX9mtdyVssAJkRSzagL7Js
1iERTQdlRq7ZCWkpvaIKxkU+tMnI6ZAFwtuoXaYBLJTvy1ETzRUBVDy+upw/
IaQ75Ws0myb4UrJ1VZeK/KUk7HvVjANKCdi58pnTS/jXOU38FoLH5pdz4Zst
TK9bjaXIwTa0uhKbQiP/o+QGdgtrWq8YD4XdkBNbIVEmPEQDJY8eYS65Shtb
2vU2RAkDlTh4qGYHolISZDLayLaNhzccXQAcemXDwZBnHAqbmSbBmn4ozhDl
aPeOhn9079EWnz7tjAC2dvD+SyAvME73dsAjOTuyg4UHnCEwlfoIUsp0BUSZ
tlqiugCUUyAhT7iT/YDgf2oLMpom14r6JkwrYIxKz70AiIx8gEvf0tNmSDiY
DgncKRzcGDcskzTGI3X3yrZmyBmNEY7zkTiWXmfiSn3Xasd48+zDjuJIuPN9
cZ/BmFe5bnGfSPDIbGadA2ljscyc9cTlG6q2o16j/u44nNxeOrvx+DJNLpUF
8IFKC+j5rW9URVCnTIAt7gJaVpb8NEqByinJcVcoA9fwTOBPOmtLybQF8NN4
oBci23Vjgpz4lwffEBpSYF0eKo0E48WGwUC57ho4jt0402FmB8O8d+zkCGO5
1CURJNlEEn6MAen/UmQlRhFvmhGFIGdMc8bAMZ6LPB0CdIPrfcaQrNpZMBhL
Lt957mzM9MiXzu0wWg3HIkv9a8SB9dB+tkT9ZlyDbpxpJxwYmdwcb5M+HFRE
2DJQTEDaplAmUI8yoOksuNWorEAzEXdbw+Fy2rOsha1tN+NGeX0kLoc5vQvu
OPiCTgE+QyvkOxMzTJu3GLEARwoPM/nZ0hBF2hWcGkwxoRvCRsYEDgnGTXov
9aAQgo1PB8r/P9JU/6O1v+HIqSzeVoNMH23eMRiRzA6JPfYkTqTnrRANb4WJ
iUFRy2xs4wnCz3Eg4qYotyn133YfFrL0gcmICYkZ73Cg4exGJiBiA3dlt5L1
CJeY3uvXoQp3Wm2YKmclBnWLUr69+facPbs8Oe1JylFRiBVGXgSLJLykWJfb
ukgH8tbmzpboetlHAdZUtQ/1UyhtnERh0qL296mxl0tfYZT1OGKBslHEQx6w
y4AhJtFaU2sQWvp1bpz8YeDupsW3dV3Su8st/xoSkkYlUwHcDSc59FkAjVHo
PC/dNsSN7DF9dhOhWxo4rm6Y/jmArWAIDIWMJI2UrJSj5q4UyhB6PFIkub33
FjXeEM9osndavlcAqONTmHpAbXxBLq0gAklgoJWGQ8P+KfUhTTPaomPyQHho
Tt8PDwo61yuExno6ZoL5KycxutqSWw+0EBkgxViosu4lJMkhLGP1OJZCXJHP
jLXAHF37UrovbKMCTln3RlZkhpCMBFbu/elf9IWo+Sth9ScBsV03B8LNLf4h
oDAKhiYZgWCanLaOmDMN3EGs2OtjBlpWWFIqcVbTIjiJFFl6i7w29yo+nqU7
R7XonR5OaLIkVia9dcdJHtXGbyHBPnZ5UZG+A2ePpEiPEaS+GR3+kNf5Tt7H
nL9WkA5EY5Rg0hb4/YLyi3EfYjjFNzEXjy9OcXyR/0xPnw0RbTqaDyfVKEcY
PfRg8JzpbqObYqzlRlAJHS4uiDeaeMYhHqMytMZEb6m9UXk+rYb8YpFXoBoy
oEZjJ1JAvlOeTbG9b4zPKMMh6ILAlCSzCIuoFqKCDUjr6OVBzHPGS31LA4EV
bqAWxVqBOi8evnQvFMYs2Sc3wBIcMTATk4SOIpH2RkXwOaA7DZO1an0T5hKJ
PupW5FCWQeGpaMKI8ZvTLtYv7rk6kpVjj4csFxbJ5MzNbVUTvQ7iZ63s2ska
LB4fDjCMBwBu1OBK7yIaZ9k2IZh4COlLGVsDXmVQLfHSa0gW7xqxE2q5iPor
ST6waKAfCVB47WG+oqO+o8r1Rw/TiTjiIXKamWCjvdrVgx30O+0sSCJxc6Vi
re/6UXHvzBiS1T0PmYphfc5HcqPbPKRLx6E/kvpEIJ3cH/PELBBD11tKM7Mh
giUi48Hv4+wlcUzbgePDRUZUNZl2YPMgf8GoeDWmKA3atj9EBkr9FUHSA0UH
NGjvZNmq4RgbhS1GvwRjGNXQTPfbaokMdgT6l9+Te3/7zR8O0pFxWESiTH8D
FmxRnuFy7D/moCVgF8dsvBEK3Eq3reLfd+nx+y8PD2cvn0Tf6Pvzw9n8yX9m
9BeLC3F99uZiMLt4f/Xz8Oi/4W4q7u8QHE/FaKcgp0gALKMg6/Qbz+llCfE2
iGeyhFbfeqh4gGAg+E70145OFaCJ43jS4pppAK07IXWcMnAELdvX9JE3PI46
kC5E/ra0bnSS46ne2wz4i50SuZ4mEL8l8lZ1gADxtK5HUvc+c7G3xKzE/b3Z
TmUyzXazYOd2Z6cnDmbdhpFNB/rszgC9DNBE0n//4bd/pKo8O3354qtUnM+u
3izE1eJEzM+u5ueLJylW/PC7sOLV4nm34s3VAqXt11B1YOlPnaWX3brj8/eL
btn0gM7S+5eZ5/GoPFx1js8G/UEafRYP29OHrCzi3VW8Xd+9yfrMKZjvoiE5
+I888UrrbHYx4z/CYJoFRvX7F8Xx/oqYMSzPdpazmWuFEyWR4b6pbtKPpkdX
Ir7pmoAJgfZemfGBdoRMRIMuC/NxoPV4Z+ipN+7p60Dz8W8SS/QSOTjLbo3d
lCpfh4u7/SDxub+y3rma53MSiZgwBgMZvFWSif0U5ZJr2d1by71N7v3tgeRI
EtuiUjGE0Bdns2Ohn33D53+0e5V266iGp6GsJ8rrtRE3SlZHyYW606U4dnZj
SoVOuLEQCG+lwUBJxTurkre6zNXSIfPpzl8HsZSUgapoer5rS01qFiKuuIWV
WS4rcWVlVqQ4M2sc269aT216ZeFtI67BP7c+NMAJCbCbArQElkg+oP5hEI/z
SplbYyTQBSAdd0L++IZAY7aGE1Mnu0dw/WDdLVl742xbi8dX1x/eoO3626Cj
5NiR33Ppav7jQJrgI13miWPKlTFpsig16OwckjxN3tnCiJ+VNGtN/HaO/BmV
Jt8SoRpx8tc/4/3+601hK48EJZcKpIH0AGW3aYIsiGu5dYozkFBukNbWl2o7
Tf4BPLq5KJoeAAA=

-->

</rfc>

