groff-commit
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[groff] 17/28: groff_char(7): Rewrite 8-bit Latin-1 subsection.


From: G. Branden Robinson
Subject: [groff] 17/28: groff_char(7): Rewrite 8-bit Latin-1 subsection.
Date: Tue, 1 Sep 2020 07:43:08 -0400 (EDT)

gbranden pushed a commit to branch master
in repository groff.

commit 3024a3e3c0f460dc83027d1bbc0b29f38eb7e760
Author: G. Branden Robinson <g.branden.robinson@gmail.com>
AuthorDate: Tue Sep 1 20:38:03 2020 +1000

    groff_char(7): Rewrite 8-bit Latin-1 subsection.
    
    * Retitle subsection to "Eight-bit encodings and Latin-1 supplement".
    * Elide references to code point numbers to make discussion
      ISO/EBCDIC-agnostic.
---
 man/groff_char.7.man | 35 +++++++++++++++++++----------------
 1 file changed, 19 insertions(+), 16 deletions(-)

diff --git a/man/groff_char.7.man b/man/groff_char.7.man
index b5612de..a8272b5 100644
--- a/man/groff_char.7.man
+++ b/man/groff_char.7.man
@@ -474,25 +474,29 @@ falling back to basic Latin glyphs only when necessary.
 .
 .
 .\" ====================================================================
-.SS "8-bit character codes 160 to 255"
+.SS "Eight-bit encodings and \%Latin-1 supplement"
 .\" ====================================================================
 .
-Code points in the range 160\[en]255 are for the most part interpreted
-as characters according to the \%Latin-1 (ISO \%8859-1) standard,
-and correspond to those in the same range of the Unicode \[lq]C1
-Controls and \%Latin-1 Supplement\[rq] code chart.
+ISO 646 is a seven-bit code encoding 128 code points;
+eight-bit codes are twice the size.
+.
+ISO 8859-1 and code page 1047 allocated the additional space to what
+Unicode calls \[lq]C1 controls\[rq]
+(control characters)
+and the \[lq]\%Latin-1 supplement\[rq].
+.
+The C1 controls are neither printable nor usable as
+.I groff
+input.
 .
 .
 .P
-Input characters in range 128\[en]159
-(the C1 controls on non-EBCDIC hosts)
-are defined as control characters and not printable;
-nor are two characters in the \%Latin-1 supplement.
+Two non-printing characters in the \%Latin-1 supplement are handled
+specially.
 .
 .
 .TP
-160
-.
+NBSP
 encodes the no-break space,
 and is mapped to
 .BR \[rs]\[ti] ,
@@ -500,8 +504,7 @@ the adjustable non-breaking space escape.
 .
 .
 .TP
-173
-.
+SHY
 encodes the soft hyphen character.
 .
 .I groff
@@ -512,7 +515,7 @@ on input it is mapped to the hyphenation control escape,
 .
 .
 .P
-The remaining characters in the \%Latin-1 supplement print as
+The remaining characters in the \%Latin-1 supplement represent
 themselves.
 .
 Although they can be specified directly with the keyboard on systems
@@ -521,8 +524,8 @@ it is more portable,
 both to other
 .I roff
 systems and to \%UTF-8 environments,
-to use their glyph names;
-see the next section.
+to use their glyph names,
+shown below.
 .
 .P
 .TS



reply via email to

[Prev in Thread] Current Thread [Next in Thread]