From a31db67d14c0e50353eac3db342f3c969cabdf76 Mon Sep 17 00:00:00 2001 From: wkj Date: Wed, 21 Apr 2004 01:15:41 +0000 Subject: Add tcs; it compiles in my world, but I haven't tried it in Russ's yet. --- src/cmd/tcs/ex5.utf | 50 ++++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 50 insertions(+) create mode 100644 src/cmd/tcs/ex5.utf (limited to 'src/cmd/tcs/ex5.utf') diff --git a/src/cmd/tcs/ex5.utf b/src/cmd/tcs/ex5.utf new file mode 100644 index 00000000..b4baca0b --- /dev/null +++ b/src/cmd/tcs/ex5.utf @@ -0,0 +1,50 @@ +.tr -\(hy +.TL +Hello World +.br +or +.br +Καλημέρα κόσμε +.br +or +.br +こんにちは 世界 +.AU +Rob Pike +Ken Thompson +.AI +.MH +.AB +Plan 9 from Bell Labs has recently been converted from ASCII +to an ASCII-compatible variant of Unicode, a 16-bit character set. +In this paper we explain the reasons for the change, +describe the character set and representation we chose, +and present the programming models and software changes +that support the new text format. +Although we stopped short of full internationalization\(emfor +example, system error messages are in Unixese, not Japanese\(emwe +believe Plan 9 is the first system to treat the representation +of all major languages on a uniform, equal footing throughout all its +software. +.AE +.SH +Introduction +.PP +The world is multilingual but most computer systems +are based on English and ASCII or worse. +The pending release of Plan 9 [Pike90], a new distributed operating +system from Bell Laboratories, seemed a good occasion +to correct this chauvinism. +It is easier to make such deep changes when building new systems than +by retrofitting old ones. +.PP +The ANSI C standard [ANSIC] contains some guidance on the matter of +`wide' and `multi-byte' characters but falls far short of +solving the myriad associated problems. +We could find no literature on how to convert a +.I system +to larger character sets, although some individual +.I programs +have been converted. +This paper reports what we discovered as we +explored the problem of representing multilingual -- cgit v1.2.3