Amaravati: Abode of Amritas

19.10.26.23:02: KHITAN SMALL SCRIPT DIAMONDS AND PYRAMIDS

1. Most Khitan small script character blocks have these structures. In short, characters are arranged from left to right and top to bottom in rows that are two characters wide.

What I call 'vertical stacks' have one character directly atop another instead of side by side: e.g.,

<052.e'> instead of <052.e>.

Today I came up with the term 'diamond' for blocks with more characters in the center than at the top or bottom:

<294.051.098.151> <294.gha.al.ghu> (Xing 28.3), <364.051.098.151> <364.gha.al.ghu> (Dao 26.24)

364 is a variant of 294. By itself, 294 represents a word 'south'. Perhaps 294 ~ 364 is an alternate spelling of <t.093> ~ <t.ie.093> ~ <d.093> 'south'.

And today I came up with the term 'pyramid' for blocks with only one character on the top atop one or more rows of two:

<280.028.097> <280.sh.ur> (Xing 28.1), <294.051.098.051.122> <294.gha.al.gha.ai> (Xing 19.22)

I hate 'this exists, that's it, bye'-type posts, but I don't have time to look into these blocks any further tonight. (I found all four yesterday and didn't have time to write about them then.)

2. Last night I noticed the similar-looking Khitan small script blocks

<282.128.339> <282.128.i> and <282.196.339> <282.bu.i>

next to each other in the block index of 契丹小字研究 Studies on the Khitan Small Script (1985: 401) and wondered if

<128> <?> and <196> <bu> (Shimunek 2017: <ebu>)

might be variants of each other. I'll have to see if they occur in other identical environments.

3. I'm curious about how Southern Min was romanized in this 17th century Southern Min-Spanish dictionary.

4. 懦 GSR 134e is an oddity in a phonetic series whose usual common denominator is Old Chinese *no: e.g.,

需 134a *sIno 'to wait; weak', *sInorʔ 'supple', *norʔ 'weak'
繻 134b *sIno
儒 134c, 濡 134f, 𣽉 134g, 襦 134i *CIno

𣽉 134g has a variant of 需 I've never seen before

臑 134h *CIno 'pliant, soft', *naw(s) or *CAnu(s) 'shoulder; upper part of an animal's front leg'
孺 134d *CInos
醹 134j *CIno(ʔ)

懦 GSR 134e 'weak' not only has the expected reading *Cino 'weak, timid' but also has three other readings (also all meaning 'weak, timid'):

*Cinorʔ
*norʔ
*nojʔ < *nolʔ?

The *liquid-final readings of 懦 134e and its phonetic 需 134a reflect confusion with 耎 238a *CInorʔ 'soft, weak' as noted by Karlgren (1940: 163). I suspect 耎 238a *CInorʔ 'weak' was homophonous with 需 134a *sInorʔ 'supple', an alternate spelling of the same word, but there is no evidence within series 耎 238 directly pointing to *s-: e.g., modern readings with [s] such as Cantonese seoi1 for 需. It is possible that *CInorʔ and *sInorʔ had a common root *norʔ with different prefixes.

Both 需 134a and 耎 238a in turn have 而 982a *nə 'and' as their phonetic. 而 982a is originally a drawing of whiskers and represented a word 'whiskers' - probably the same word as 須 ~ 鬚 *sIno 'beard'. I suspect *nə 'and' might be an unstressed reduction of an earlier *no.

How one phonetic became three:

Primary phonetic	Derived phonetic	Further derivatives
而 *sIno? 'beard'	需 *sIno 'to wait; weak'	繻 *sIno 'frayed silk', etc.
	耎 *CInorʔ 'weak'	需 CInorʔ 'supple', CInorʔ 'weak', 懦 CInorʔ ~ nojʔ 'weak'
	而 no? > nə 'and'	栭 *nə 'fungus', etc.

5. I have never seen an English name begin with <Jch> before. I presume the female name <Jchandra> is from Sanskrit candras 'moon' (masculine!). <Jch> would be understandable as a rendering of Sanskrit voiceless unaspirated c, though I don't think that was the motivation for that unusual letter combination.

19.10.25.22:05: VERTICAL STACKS IN KHITAN: 052 (PART 4)

1. The last instance of vertically stacked <052.e>

in the Khitan small script that I know of is in line 27 of the eulogy for Emperor 道宗 Daozong (1101):

<mu.188 dau.o.gho tau.is.en 371.a 079? 318.er j?.u.ur 232 sh.eu.013 ur.u.er ghu.as.al ung.su 336 c.er>

Incomprehensible to me except for <c.er> 'wrote' at the end of a sentence which is a convenient point for breaking the line into two. The second half has a few words I recognize:

<COMPOSE.l.ghu ui.l us.g 066.l.ge.en mu.u.j.i 026.s.g ts.ar hua.315 ghu.ogh 268.ir.o.o.on mo.ge.de y.ie.er 042.a p.ge.ii 282.128.i eu.ul.ge.y as.ar p.ge.ii m.g.un 052.e dau.o.on as.320>

'compose-CAUS/PASS-? affair? character V-CAUS/PASS-PTCP.M sagely ? ? ? ? ? N-LOC/DAT ? ? V-CVB ? ? pure V-CVB V-PTCP.M? 052.e ? ?'

Unfortunately, I understand none of the words around <052.e>. <m.g.un> looks like a masculine perfective participle modifying a noun <052.e>.

2. It's taken me years to notice this rare case of semi-overlap between the Khitan large and small scripts:

1864 <ar> (large) and 123 <ar> (small)

I'm so accustomed to the same shape standing for different things in the two scripts (e.g., 一 is 'one' in the large script but 'above, north' in the small script) that I've never looked for any homophonous lookalikes.

3. Here's a fun Cantonese character I learned today: a semantic compound with a Khitan small script-like¹ two-on-one structure: 膥 ceon1.

未 <NOT.YET>	成 <BECOME>
肉 <MEAT>

I can't think of any other semantic compound with that layout of elements, and 肉 is a rare bottom element.

Can you guess what 膥 means? Answer here (visible if you select the blank space before the period): reptile or bird egg.

I've found three Mandarin readings for 膥: chun1, cun1, and cen1. Jun Da's modern Mandarin rank for膥 is #8529; 膥 only appears twice in the corpus, presumably as a loan from Cantonese. Are the Mandarin readings borrowings from Cantonese?

I seem to be the only person who ever looked at the CantoDict entry for 膥 (other than the editor of the page).

This word has been viewed 1 times since 30th Oct 2012

¹But of course Khitan small script blocks are phonetic symbol combinations, not semantic compounds.

The only common character with 肉 <MEAT> on the bottom is 腐 'decay, rotten' (Jun Da #1576). There turn out to be at least fifty more characters with 肉 <MEAT> on the bottom, but I don't know what they represent:

肏𦘫𦘬𦘻䏑𦙂𦙡𦙩胔胬𦙲𦚇𦚘𦚨𦚮𦚷𦛄𦛆𦛇𦛝𦜑𦜜𦜾𦝍𦝎𦞍𦞏膐𦟡𦟴𦠂𫆰𦠕𦠩𦠪𦠬𦠶𦡈𦡜𦡭𦡼𦢆𦢰𦢽臠臡𦣚

Some interesting characters for future investigation:

A rare (and unique?) case of both the abbreviated (月) and full forms of 肉 <MEAT> in the same character:

𦝾

己 at the top right is <SELF>.

A rare (and unique?) case of 肉 <MEAT> as a bottom right corner element the same size as a bottom left corner element:

𦞋

A rare (and unique?) case of 肉 <MEAT> as a top element:

𫆭

An 夕 <EVENING>-like variant of 肉 <MEAT> I've never seen before:

𦠹𦠺𦣐𤓏

Finally

𮌾

is ⿰腐丑 <DECAY.OX>. Is it a semantic compound or a semantic-phonetic compound? If it is the latter, which half is semantic and which half is phonetic?

19.10.24.23:14: VERTICAL STACKS IN KHITAN: 052 (PART 3)

1. There are at least two more instances of vertically stacked <052.e> in the Khitan small script:

Let's look at the first from line 20 of the eulogy for Emperor 道宗 Daozong (1101).

<b.as s.188.o.oi.en h.su.gu 193.ad REGION.y.i y.au.ul.ghu em.d.er neu.e dau.ui t.c t.092.gu HEAVEN 050.u.lia neu.e sh.ur.l.b.ñ k.ii k.ii mu.u.j us.g m.g.s ETERNAL s.092 n.am.ur sh.ul.g.s ETERNAL as.ar heu.ur s.eu.ur.un ta.u.d.b.356 c.ie.271.ñ 052.e gho.l.g pu.ulji.l.g sh.al.s dau.ur s.iau>

'again N-GEN? ? N-PL? region-GEN? ? V-PFV.FIN.M? earth V-CVB? ? ? heaven ? earth ? ? ? sage written.character ? eternal ? autumn ? eternal pure spring N-GEN? ? V-PFV.PTCP? N? ? ? ? ? ?'

<c.ie.271.ñ> might be an <-ñ> participle modifying <052.e> which could be a noun here as it seems to be in the epitaph for 蕭仲恭 Xiao Zhonggong (1150). The only other form of <c.ie.271-> that I know of, <c.ie.271.i>, may end in a converb <-i>. 271 may end in a vowel since a final consonant-[ɲ] cluster is unlikely.

2. I've been wondering since maybe 1992 why 旺角 wong6 gok3 is called Mong Kok in English. Tonight I finally learned why:

The current English name is a transliteration of its older Chinese name 望角 (Jyutping: mong6 gok3; IPA: [mɔːŋ˨ kɔːk˧]), or 芒角 (Jyutping: mong4 gok3; IPA: [mɔːŋ˨˩ kɔːk˧])

Or reviewed why - that explanation sounds vaguely familiar.

Mong Kok belongs to the category of conservative English exonyms for places. Others are Peking (preserving *k), Belgrade (preserving *l), and Prague (preserving *g). But in those cases, the modern autonyms are descendants of the forms underlying the English exonyms, whereas Mong Kok is only half-cognate to modern wong6 gok3.

New question: Why was the name of Mong Kok changed?

3. Today I learned that an American superstar is called the 貓王 Cat King in Chinese. Who is he?

4. I had never heard had waked before, and was surprised to see that it was more common than had woken until recently.

19.10.23.23:51: VERTICAL STACKS IN KHITAN: 052 (PART 2)

1. The Khitan small script epitaph for 蕭仲恭 Xiao Zhonggong (1150; hereafter 'Zhong') has two spellings which might represent the same word:

<052.348> (7.13) and <052.109'> (48.1)

<'> indicates vertical stacking.¹

Khitan small script character 109 <e> is a five-stroke variant of six-stroke 348 <e>, so the only potentially significant difference between the two spellings is the orientation of the two characters within each block. "Potentially" only if the two blocks represent different (though homophonous) words.

How can I tell if they represent different words? If the two blocks appear in identical contexts, they are unlikely to be different words (unless a play on words was involved).

Let's look at the contexts of those blocks:

Zhong 7: <qid.i c.iau qu.i us.g h.an na al.a.ar c.iau qu.ui qid.i shï COMPOSE.un 052.e eu.ul po.ogh s.eu.ur.un ta.u.en ...>

'Khitan-GEN China-GEN character ? ? ?-PFV.FIN.M China-GEN Khitan-GEN master N compose-PFV.PTCP N cloud ? ? ?'

'... al-ed Khitan [and] Chinese characters ... 052.e [that was] composed by a Chinese [and] Khitan master ... cloud ...'

052.e is a noun modified by <COMPOSE.un> 'composed', so it presumably refers to a composition of some sort.

Zhong 48 (line 47 might end at the end of a sentence):

<052.e ku YOUNG.qu.ETERNAL ◊ h.zu.ge.en pu.ulji.i p.ir.en t.e.er ◊ gur.en>

'N person ? (a noble title?) ? ? ?.FIN.M?. country.GEN ...'

052.e seems to be a noun modifying ku 'person'. If this is the same 052.e that was composed in line 7, then a 052.e ku might have been a scribe or even a poet or storyteller (if the composition was oral).

It seems that the writer of Zhong, is.g.un h.ang.n.u (named in line 618), might not have made a semantic distinction between

<052.348> (7.13) and <052.109'> (48.1).

They do at least both seem to be nouns, though I can't be sure they both meant the same thing. Next I'll see if the vertically stacked version might also be a noun in other contexts.

¹I just realized the short vertical stroke could symbolize vertical stacking. If I used short horizontal strokes to indicate left-to-right sequences and placed direction indicator strokes between character numbers, the first block would be <052-348> in contrast with <052'109> for the second.

2. Shimunek (2017: 243) proposes that Khitan did not allow the clusters rc and rj "perhaps ... word-finally or across morpheme boundaries". I did a quick search for such clusters and could only find <n.ar.c.en> (Xu 60.2) which might be narc-en, the feminine perfective of a verb narc- '?'.

3. I've mostly used the Leipzig glossing rules for Pyu and other South and Southeast Asian languages, so I never thought about how to gloss forms like German gesehen 'seen'. The Leipzig rules gloss it as

PTCP-see-PTCP or
PTCP-see-CIRC

Oddly, CIRC (for circumfix?) is not in the rules' list of abbreviations. Until now I would have glossed it simply as 'see.PTCP' without breaking it up.

I suppose I could then gloss Sanskrit a-paśa-t 'saw' as

IPFV-see-IPFV.3SG or
IPFV-see-CIRC.3SG

though I would prefer 'see.IPFV.3SG'.

19.10.22.23:32: VERTICAL STACKS IN KHITAN: 052 (PART 1)

1. Last night, while looking for what 契丹小子研究 Research on the Khitan Small Script (1985) had to say about

342 <?> (see my previous post),

I saw the block <052.348'> which is one of a minority of vertically stacked blocks. (I indicate vertical stacking with <'>.)

block
#	052.109	052.109'	052.348	052.348'
Dao	unattested	-	-	20.31, 27.34
Xu		-	4.2, 5.13, 26.15, 27.24, 61.13	-
Gu		-	4.21	-
Zhong		48.1	7.13	-

Kane (2009: 41) states that 052 "is only found in the word

<RECORD.gi>

'record', with various suffixes." But 052 can also occur by itself (only once in Gu 10.7) or followed by characters other than 334 <g> (Kane's <gi>):

109 <e>, 112 <ge>, 348 <e>, 349 <ge>, 354 <j>

It is true, though that 052 is most often followed by 334 <g>. And it's possible that <052.ge> contains the same root as <052.g> 'record'. I could even go so far as to say that

'record' was Xg <052> ~ <052.g>

but there are no other known cases of Khitan small script logograms with both full logogram and logogram-phonogram spellings

<052.ge> was a derived form Xg-e

but there is no known suffix -e, so maybe Xge is an indivisible root unrelated to 'record'

<052.j.de> (the only known instance of <052.j>) could be a dative/locative of a derived noun Xg-j

could -j be an agentive suffix like Mongolian -ci?

<052.j.de> could also represent an Xj word unrelated to the Xg-words meaning 'record'

In any case, 052 never occurs in noninitial position in a block. Nor does it ever seem to transcribe any Liao Chinese syllables. 052 could have represented a string that was only in 'record' (and an unrelated <052.e>?).

Going back to the stacking of <052.e>:

<052.348> is the majority spelling
<052.348> coexists with <052.109'> in Zhong
<052.348'> is unique to Dao
<052.109> doesn't exist as far as I know

Is there any significance in vertical stacks as opposed to the more common horizontal stacks? I'll start comparing the contexts of the two types of stacks next time.

2. Yesterday when discussing a possible of case of root-final <p> ~ <b> alternation I forgot to mention a case of possible noun plural suffix alternation:

<-t> ~ <-d> (cf. Written Mongolian -d)

And tonight I thought of the converb

<-c> ~ <-j> (cf. Written Mongolian -ju/-jü)

What determines the forms of the suffixes? Could they be predictable from a historical perspective? E.g.

<ai.d> < *ai-d 'fathers'
<mo.t> < *moH-d? 'mothers'

In the above scenario, the aspiration of <t> [tʰ] is a trace of a root-final *-H that was lost in isolation (the singular of 'mother' is <mo> < *moH?).

Compare the <d> ~ <t> alternation with the -da ~ -tha alternation of the Korean finite suffix:

나다 nada < *na-ta 'to come out'
놓다 notha < *noh-ta 'to put'

the *h has no trace after a vowel: 놓아 noa < *noh-a 'put and ...'
ㅎ <h> is consistently spelled in Korean even though it is not pronounced in all members of the paradigm

And compare the <j> ~ <c> alternation with the -ja ~ -cha alternation of the Korean hortative suffix:

나자 naja < *na-ca 'let's come out'
놓자 nocha < *noh-ca 'let's put'

3. Last night I rediscovered the problem of 幽靈漢字 ghost characters in JIS (and Unicode) encoding:

Of these [twelve], it is conjectured that several glyphs came about due to copying errors.

That made me wonder about the problem of ghost characters in TJK (Tangut/Jurchen/Khitan).

Three Tangut characters that puzzle me are 5997-5999 in Li Fanwen (1997) which are only known from Sofronov (1968):

LFW1997-5997 𘅈 1lhy1 looks like a variant of 𘀮 3932 1lhy1
LFW1997-5998 𗤗 2zwiq2 has no (near-)homophone lookalikes (or any homophones)
LFW1997-5999 𗧄 (class IX initial; rhyme unknown) has no (near-)homophone lookalikes

Okay, make that two that puzzle me.

See Wu and Janhunen (2010: 48) for a brief discussion of Khitan small script ghost characters ("mistakes and overlappings").

I don't know of any discussions of ghost characters in the Khitan large script or the Jurchen (large) script.

19.10.21.23:59: KHITAN 'STABLES' AND 'ALTERNATORS': BEYOND ONSETS?

1. Last night while copying line 28 of the Khitan small script epitaph of 耶律迪烈 Yelü Dilie (1092), character 343 caught my eye in block 12 <p.al.s.gha.al.343> (as printed in Kane 2009: 205).

343 (left) looks a lot like 342 (right):

Visual similarity is no guarantee of phonetic identity or even phonetic similarity in the Khitan small script. Nonetheless the fact that 342 and 343 occur in similar contexts could imply that they are homophonous variants of each other:

<342.295.140> <342.p.en> 'wine-GEN'?
<342.311> <342.b> 'wine'
<343.295.140> 'wine-GEN'?
<343.311> <343.b> 'wine'?
<343.311.140> <343.b.en> 'wine-GEN'?

The only one of those five that has been identified with some degree of certainty is <342.b> 'wine', identified as the object in the Khitan object-verb expression corresponding to Chinese 酣飲 han 'rapturously drunk' in the bilingual Langjun inscription.

342 has even been regarded as a logogram <WINE>, though I doubt I am not sure that is the case since 'wine' is <342.b> and not <342> by itself. As far as I know, neither 342 nor 343 ever occur alone which would seem to rule out logogram status unless <342.b> is a redundant spelling with <b> added to clarify a final -b in the reading of <342>.

Most interesting of the five is <342.p.en> which might be a variant spelling of <343.b.en> , possibly 'wine-GEN'. Might 'wine' have [p] in final position but [pʰ] before a vowel like Korean /pʰ/ words such as /apʰ/ 'front'?

앞 <Ø.a.ph> /apʰ/ [ap] 'front'
앞에 <Ø.a.ph Ø.ŭ.i> /apʰe/ [apʰe] 'in front'

In earlier Korean, /apʰ/ in isolation was spelled phonetically as <Ø.a.p>¹. Khitan <342.b> and <343.b> may be similar phonetic spellings reflecting a final [p].

Khitan <342.p.en>, on the other hand, reflects a medial [pʰ].

So far I have been talking about alternations in Khitan initial consonants. Could there be alternations in Khitan medial and final consonants as well? If <342.p> and <342.b> both represent 'wine', how many other nononset alternators are there?

Jishi (2012) has read both 342 and 343 as [səm]. I don't know what the underlying reasoning is; I know of no Mongolian word like sem- for 'wine'. Moreover, if Jishi is right, <342.b> looks like it should be read [səmp], though it seems Khitan did not have final consonant clusters.

¹10.22.0:45: In post-1933 Korean orthography, 앞 <Ø.a.ph> is the spelling used in all contexts, but in premodern Korean, there were at least three spellings used before vowels:

아ㅍ- <Ø.a ph->
압ㅍ- <Ø.a.p ph->
압ㅎ- <Ø.a.p h->

All three have <(p)h> in a second block. See Yu (1964: 531) for examples.

2. Last night I forgot to mention another unusual Japanese spelling in the entry for 王冠・叡智の光 Gōremu Keterumarukuto:

原初 <ORIGIN BEGINNING>

for adamu (rather than the normal gensho 'very beginning'). Obviously the logic is that Adam is the first man.

In theory one could create a semantic compound for adam like ⿰亻初 <PERSON.BEGINNING>.

傆 <PERSON.ORIGIN> already exists and represents two morphemes, Middle Chinese *ŋuan 'anger' and *ŋuanʰ 'hypocrite' (related to 鄉愿 <VILLAGE HONEST.CAREFUL> *xɨaŋ ŋuanʰ 'hypocrite', whose logic eludes me?).

I would rather not create a semantic compound with gender-specific 男 <MAN> because 男 <MAN> forms few compunds, and ⿰男初 <MAN.BEGINNING> and ⿰男傆 <MAN.ORIGIN> are complex.

²10.22.0:38: Or three? The 大徐本 Great Xu edition (986) of 說文解字 Shuowen jiezi gives the fanqie 魚福切 *ŋɨə + *puk = *ŋuk and the gloss 黠 'shrewd'. But 原 is an *-n phonetic which would normally never represent *-k syllables.

3. Last night I was surprised to learn that 智 <WISE> in 王冠・叡智の光 Gōremu Keterumarukuto isn't a jōyō kanji even though it's so common and well within my figure of 2000 for must-know kanji. Here are its Shpika stats, Kanken level, and Jun Da's general Chinese rank:

Aozora	news	Twitter	Wikipedia	Kanken	Jun Da
946	1064	1129	882	pre-1	956

智 is mostly in names. One doesn't have to learn it in school because one will encounter it effortlessly in everyday life. Hence I don't think it belongs at the Kanken pre-1 level; lots of lower level characters are less frequent and more difficult. But I understand why it has such a high rank; non-jōyō JIS level 1 kanji are automatically assigned to pre-1.

4. Looking at the top of Jun Da's combined Mandarin and classical frequency list, I get the sense that the ratio of Mandarin to classical isn't 50/50. High-frequency Mandarin morphemes outrank their classical equivalents:

genitive: Md 的 (#1) vs. Cl 之 (#13)
change of state marker: Md 了 (#5) vs. Cl 矣 (#830)

了 has a broader range of meanings in Mandarin than 矣 in classical, so there should be a gap between them, but I would put 矣 in the top 100 characters for classicla students to learn.

'this': Md 这 (#11) vs. Cl 此 (#89)

5. I would never have guessed that 𢦕 <HALBERD.ALL> is an old variant of the change of state marker 矣. The visual gap between 𢦕 and 矣 is roughly comparable to the visual gap between many Khitan or Jurchen large script characters and their Chinese character translation equivalents. The hypothetical Parhae script ancestral to Khitan and Jurchen large script characters could have been as different from mainstream sinography as 𢦕 is from 矣.

6. Today I was surprised to learn that ridings in Canada have nothing to do with riding anything.

If English were written in a Japanese-like script, it might be spelled as something like 三部 <THREE PART> even though ri- < thri- no longer sounds like three.

7. Tonight I learned that 說文解字 Shuowen jiezi is a TV series!

19.10.20.23:54: THE FIFTH, SIXTH, AND SEVENTH APPROACHES TO KHITAN 'STABLES' AND 'ALTERNATORS'

1. Approaches 1-3 are here and 4 is here. In short, they are

the allographic approach (2-series)
the allophonic approach (2-series)
the phonemic approach (3-series)
the diachronic approach (3>2 series)

Approach 5 is the geographic approach. Are alternators characteristic of texts in certain locations? Might they reflect Khitan dialects in which the two series of obstruents merged (in certain environments)? I have not yet tried to answer that question.

The geographic hypothesis does not necessarily predict that all texts in a certain region will have only one type of spelling. I would expect nonstandard Khitan texts to have a mixture of 'correct' and 'incorrect' spellings. The 'incorrect' spellings would reflect the local pronunciation.

Suppose in standard Khitan, a word X is pronounced with initial [tʰ] and written with <t>.

Then suppose that in a nonstandard Khitan dialect, X is pronounced with initial [t] in some or all environments. Speakers of that dialect may spell X either with <t> (reflecting the standard or a [t]-allophone) or with <d> (reflecting the local pronunciation [tʰ]).

2. Approach 6 is the chronological approach. Suppose an alternator has two spellings A and B. Do all B spellings postdate the year X? If so, B may reflect a sound change predating the year X.

The chronological hypothesis does not necessarily predict that all texts after the year X will have only spelling type B. I would expect post-X Khitan texts to have a mixture of conservative type A spellings and innovative type B spellings.

3. Approach 7 is the KSL (Khitan as a second language) approach. Khitan was an official literary language of the Jurchen empire until 1192. There is no doubt that Jurchen was a 2-series language. What if Jurchen speakers had trouble with Khitan which may have been phonetically and/or phonemically different from their own language?

Ethnicity does not equal language. What if under Jurchen rule, some Khitan scribes came to adopt Jurchen as their first language and wrote Khitan with errors characteristic of Jurchen speakers?

The KSL hypothesis does not necessarily predict that all Jurchen Empire Khitan texts will only have 'wrong' spellings. I would expect Jurchen Empire Khitan texts to have a mixture of correct and 'incorrect' spellings.

I have no evidence in favor of any of the seven approaches. Nor have I tested any of the approaches by searching for counterevidence. I am still in an exploratory phase. Much basic work needs to be done to resolve the question of alternators:

compliation of a list of alternators
compliation of a list of all occurrences of alternators
cataloguing all occurrences of alternators by phonological context, time, place, and author

Many problems besides the alternator issue may be clarified by ceasing to view the Khitan corpus as a unified whole. Khitan was in use over centuries over a large area by by both native and KSL speakers, and clearly its spelling was never standardized.

4. In a draft of my last entry that I didn't upload, I mentioned the Khitan small script characters for 'ten thousand' as examples of characters with reconstructed readings that to the best of my knowledge don't seem to be supported by either Khitan spelling alternations or Chinese transcriptions of Khitan.

#	character	Chingeltei 2009	Wu & Janhunen 2010	Jishi 2012	Takeuchi 2012	Shimunek 2017
187		t'um	MYRIAD	ɔlu	tum	tum
312		t'um	tum	?	tum	tum

The reading t(')um is reminiscent of Mongolian tümen, Jurchen tume(n), and Manchu tumen, all 'ten thousand'. Although I am unaware of any alternate spelling like <tu.m> for 'ten thousand', I forgot that 312 seems to correspond to the Chinese transcription 圖木 *tʰu mu. So I was wrong to regard 312 as having an ungrounded reading.

I don't know Jishi's reasoning for reading 187 as ɔlu. I don't know of any Mongolian olu-word meaning 'ten thousand' or 'myriad'.

I don't even know if I should be treating both characters as logographs for <TEN.THOUSAND>. Why would a numeral have two different logographs? Moreover, both also function as phonograms in noninitial position where they cannot be interpreted as stems followed by suffixes: e.g.,

Gu 22.9 <323.341.187> <qi.er.187> (the only instance of 187 in noninitial position I know of)
Zhong 25.6 <020.098.051.312> <y.al.gha.312> (the longest block I know of with 312)

I know of no similar blocks with other characters in place of 187 and 312.

312 is quite common in noninitial position which makes me think it is a phonogram. Perhaps 187 is a logogram <TEN.THOUSAND> and 312 is a homophonous phonogram <tum(u)>.

5. The Sinogenesis hypothesis: Last week I wrote,

Liao Chinese had two series [of obstruents] in speech. If a speaker of a two-series language worked on the Khitan small script, he might project those two series onto Khitan.

"A speaker" doesn't necessarily have to be a native speaker of Liao Chinese (or any form of Chinese). Clearly Chinese was known to the ethnic Khitan elite. Khitan educated in Chinese would draw upon their knowledge of Chinese to devise the Khitan small script. The writing of most sylllables as onset-rhyme character sequences is reminiscent of fanqie onset-rhyme spelling.

The trouble with this hypothesis is that a Chinese-literate Khitan might be familiar with the Chinese rhyme table tradition which has three series, not two. The tables would have provided a prestigious precedent for three series in the Khitan small script. But the actual script only seems to have two.

At the moment I am turning away from approach 3 to the stable/alternator problem and favoring 4-7 which do not require three series, at least in the Liao dynasty. (I still think pre-Khitan could have had *three series and perhaps some Khitan dialects still had all three. But I don't think the dialect that was the initial basis of the small script had three series.)

6. Why did Noah Webster drop k from magick?

Since magical lacks a "k," it only makes sense (thought Noah Webster, and as far back as his 1806 dictionary) that magic too would go without a "k"—hence no magick in American English. The same reasoning gave us also our public and traffic spellings, as well as others. The words gimmick and haddock and maverick have no k-less relation (there is no gimmical, haddocal or maverical) to argue away their "k's."

So I wonder - how did magic, public, and traffic lose their K's in the UK?

Center turns out to not be a Websterism:

[Samuel] Johnson's choice [of centre] was, according to the Oxford English Dictionary, likely based on a very particular edition of an earlier dictionary by one Nathan Bailey: while all 30 editions of Bailey's dictionary featured the "er" spelling of center (which was likewise used in editions of Shakespeare, Milton, and others), the folio edition of Bailey's dictionary was the one Johnson used, and it employed centre.

7. WWII Japanese-language officer Gordon McLendon's Correct Spelling in Three Hours (1962) is a rarity: a modern English-language book not listed at Amazon! What rules did he think could be learned in three hours?

8. This is the most bizarre Japanese spelling I've ever seen:

王冠・叡智の光

<KING CROWN LIGHT ASTUTE WISE no LIGHT>

pronounced

ゴーレム・ケテルマルクト
gōremu keterumarukuto

< golem + כתר keter 'crown' + מלכות malkhut 'kingship'

rather than the expected ōkan eichi no hikari 'royal crown: light of wisdom'. Golem corresponds to nothing in the spelling, and eichi no hikari corresponds to nothing in the Hebrew.

Tangut Yinchuan font copyright © Prof. 景永时 Jing Yongshi
Tangut character image fonts by Mojikyo.org
Tangut radical and Khitan fonts by Andrew West
Jurchen font by Jason Glavy
zAll other content copyright © 2002-2019 Amritavision