[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Dvorak (& Lojban)



>>>>> "HACKER" == HACKER G N <c9709244@ALINGA.NEWCASTLE.EDU.AU> writes:


    HACKER> Does this mean that there are only as many keys on a
    HACKER> Chanjie keyboard as there are on a QWERTY keyboard?

Yes.  Indeed, it uses only 25 keys (excluding the space bar).

To input Chinese  into  the computer, the  typical device  is a normal
(e.g.  US 101-key) keyboard.  Nowadays, there  are other devices, such
as pen input  tablet (a digitizer,   used usually by  serious users of
CAD)  and   voice   input (employing   voice  recognition  techology).
Although these latter devices are more intuitive to use, the inputting
speeds are far below that of using a keyboard.

Many  "methods"  have been invented  for   inputting Chinese  into the
computer using  the standard keyboard.  Some  of them require the user
to type in a phonetic transcription  (e.g.  Pinyin) of a character and
then lists all  characters with  that pronunciation  for the user   to
select from.  These methods  are generally slow, because the selection
lists are long and the  user has  to  be distracted by the  selection.
Blind-folded typing is  not   possible.  Changjie, however, bases   on
character shape (as mentioned before).   There are around 100 Changjie
radicals, classified into _24_ groups, with  each group mapping to ONE
of the 26  letter  keys of  a US  keyboard,  ignoring the  distinction
between upper/lower cases.   One more  key  is used for special  cases
(those character that cannot be easily  broken up into radicals).  So,
only 25 keys are needed.

In  Changjie, the  average number of  keystrokes  (excluding the space
bar, which has to  be depressed after  keying in the radicals of  each
character)  per character is   3.5.  Touch-typing is possible, because
the Changjie codes of characters seldom have collisions.  So, the user
seldom has to  select from  candidate lists  as in the  phonetic-based
methods.  The typing  speed can  be  up  to  60 char/min for   trained
typists, and up to 200 char/min for wizards.



    HACKER> Because if so, that's pretty impressive, considering how
    HACKER> many characters there are in Chinese.

There  aren't many Chinese   characters  in  common use.  There    are
altogether tens of thousands of them, if we count  all those that have
appeared in these 3000 years, and count variants of the same character
as different characters.

However,  the number  of  characters frequently  used by newspapers is
around 6000 -- the number of characters that a ordinary student should
have learnt  in primary  school.   For casual writings (e.g.  personal
letters, diaries, etc.) this count would drop to 3000, or even less.


On  the computer, around 20000  Chinese characters are available.  So,
theoretically, if we  use only those   26 Latin letters to  encode the
characters, we can at best achieve a code length of

        log_{26} (20000) = 3.04

for each character.  Some people design inputting methods with as much
as 40 base letters (mapped to those 40 keys on and  below the row of 1
-- 0) and for such methods, the minimum code length is about:

        log_{40} (20000) = 2.68

Since it is possible  to assign shorter codes  to more frequently used
characters, the *weighted* average code  length can approach _2_, i.e.
an average  of 2 keystrokes (excluding  the space bar which terminates
the input of a character)  per character is possible.  Such  inputting
methods exist,  but are not as  common as Changjie, which is available
on almost all system supporting BIG5 encoding.




--
Lee Sau Dan                     'u&u40(Big5)                    ~{@nJX6X~}(HZ)
.----------------------------------------------------------------------------.
| http://www.cs.hku.hk/~sdlee                        e-mail: sdlee@cs.hku.hk |
`----------------------------------------------------------------------------'