aboutsummaryrefslogtreecommitdiff
path: root/src/cmd/tcs/ex5.utf
diff options
context:
space:
mode:
authorwkj <devnull@localhost>2004-04-21 01:15:41 +0000
committerwkj <devnull@localhost>2004-04-21 01:15:41 +0000
commita31db67d14c0e50353eac3db342f3c969cabdf76 (patch)
tree0cebd9a65bab940355f5be0751f53238366555d7 /src/cmd/tcs/ex5.utf
parentba03b8910dd07511378e6f1007faf0539ed25598 (diff)
downloadplan9port-a31db67d14c0e50353eac3db342f3c969cabdf76.tar.gz
plan9port-a31db67d14c0e50353eac3db342f3c969cabdf76.tar.bz2
plan9port-a31db67d14c0e50353eac3db342f3c969cabdf76.zip
Add tcs; it compiles in my world, but I haven't tried it in Russ's yet.
Diffstat (limited to 'src/cmd/tcs/ex5.utf')
-rw-r--r--src/cmd/tcs/ex5.utf50
1 files changed, 50 insertions, 0 deletions
diff --git a/src/cmd/tcs/ex5.utf b/src/cmd/tcs/ex5.utf
new file mode 100644
index 00000000..b4baca0b
--- /dev/null
+++ b/src/cmd/tcs/ex5.utf
@@ -0,0 +1,50 @@
+.tr -\(hy
+.TL
+Hello World
+.br
+or
+.br
+Καλημέρα κόσμε
+.br
+or
+.br
+こんにちは 世界
+.AU
+Rob Pike
+Ken Thompson
+.AI
+.MH
+.AB
+Plan 9 from Bell Labs has recently been converted from ASCII
+to an ASCII-compatible variant of Unicode, a 16-bit character set.
+In this paper we explain the reasons for the change,
+describe the character set and representation we chose,
+and present the programming models and software changes
+that support the new text format.
+Although we stopped short of full internationalization\(emfor
+example, system error messages are in Unixese, not Japanese\(emwe
+believe Plan 9 is the first system to treat the representation
+of all major languages on a uniform, equal footing throughout all its
+software.
+.AE
+.SH
+Introduction
+.PP
+The world is multilingual but most computer systems
+are based on English and ASCII or worse.
+The pending release of Plan 9 [Pike90], a new distributed operating
+system from Bell Laboratories, seemed a good occasion
+to correct this chauvinism.
+It is easier to make such deep changes when building new systems than
+by retrofitting old ones.
+.PP
+The ANSI C standard [ANSIC] contains some guidance on the matter of
+`wide' and `multi-byte' characters but falls far short of
+solving the myriad associated problems.
+We could find no literature on how to convert a
+.I system
+to larger character sets, although some individual
+.I programs
+have been converted.
+This paper reports what we discovered as we
+explored the problem of representing multilingual