(Return to the Table of Contents)

Chapter 6
WORDS AND GROWTH

6.1
Language and Growth 403
6.2
Seven Kinds of Lexical Growth 404
6.3
Making Composite Primitives 411
6.4
Making Complex Predicates 419
6.5
Borrowing Predicates 430
6.6
Rules for Rewriting Source Words 438
6.7
Rules for Adjusting the Endings of Loan Words 440
6.8
"Gluing" Trial Words 441
6.9
Curing Segmentation Problems 444
6.10
Some Non-Scientific Borrowings 449
6.11
The Borrowing Pathways and Their Products 450
6.12
Making Single-Source Primitives 452
6.13
Making Names 454
6.14
Importing the Linnaean Names of Biology 458
6.15
Little Words and Structural Change 462

Notes 466

6.1 Language and Growth

Languages behave in many ways as if they were alive. For one thing, all languages grow. For another, each language seems to grow in its own way, as if it had its own unique pattern of openness and resistance which causes it to change quite readily in some directions but not at all in others. For example, one of the ways in which English seems to grow is by swallowing the words of other languages whole. As Otto Jespersen, the late 19th, early 20th century Danish linguist, once put it, English eats other languages alive. Not so with French, for example, which prefers to redefine old words to meet new needs or coin new metaphors from old roots. Chinese also shies away from foreign loan-words and grafts new ideas onto ancient stock. Japanese, on the other hand, is more like English, adopting and repronouncing foreign words with gay abandon. Changes in the pronunciation of words over time seem also to follow special patterns which are unique to individual languages, or to small groups of closely related tongues. And it is a major, but so-far untested, hypothesis of linguistic theory that each language tends to develop its own metaphysical outlook on the world. All this has suggested to nearly all students of language that languages are rather idiosyncratic creatures, disorderly in externals but internally autonomous, and in any case very much alive.

Now if you plant something that you thought was a tree and it doesn't grow, then you were mistaken. What you planted was a stick: a poor wooden thing, stiff and unyielding, and without the essential expansiveness, that ability to stretch and swell and fill its niche, that is the very stuff of life. So with languages; if they are alive, they grow. Some partisans of the international language movement have argued that one of the advantages of a constructed language over a natural one, as a universal second tongue, would be that the constructed language wouldn't change. They may be right; but certainly it is a mistake to hope so. For just to the extent that such hopes are fulfilled, they fail. An unchanging language would be a useless corpse, suitable for ritual incantations, perhaps, but quite incapable of serving the changing needs of humans. Latin is said to be a "dead language"; and so it may be. But not because it isn't spoken. For it is obediently intoned in churches and schoolrooms even yet. If it is dead, it is because it has stopped growing. Nobody lives in the house of Latin anymore. Its vast, still rooms no longer echo to the sounds of scuffling life.

Now if Loglan is a language, one of the ways that we will know it, will be that it will grow. The ways in which Loglan has been built to grow are a very carefully considered part of its structure. We have realized from the outset that if the language is to be easily learned, its initial vocabulary must be very small. But even though small, that initial vocabulary must be capable of very rapid growth...growth both in the mind of the learner, as da encompasses the language, and in the community of speakers as they continually expand its corpus to match their expanding need. The machinery we have provided for rapid, spontaneous and yet orderly growth is the main topic of this chapter.

But first a warning. Natural languages grow rather slowly. The proportion of people who now speak English who will contribute a living word to it during their lifetimes is rather small. A few scholars will do it; a journalist now and then; a poet to implement a new perception; an inventor gazing fondly at the new thing da has made; or an occasional gifted urchin who, searching for a word de doesn't know, coins one in desperation that happens to stick among de's fellows and then lives on as slang. But the frequency of these events reckoned over the populations who speak these modern languages is very small. Most of us live with our native languages as we find them. To make new words ("neologisms," as they are called by pedants) is not approved in college classrooms. Word-building is a game we are taught not to play.

Not so with Loglan. If Loglan lives, it will be in part because its very structure stimulates the poetic impulse in a substantial proportion of its speakers. Or because the word-making game in Loglan turns out to be easy and fun to play. In most of this chapter we will be describing the rules of that word-making game. But first let us review the resources with which we start the play.

6.2 Seven Kinds of Lexical Growth

Let us take stock of what we have. At the present time (1989) the vocabulary of Loglan stands at around 9,000 words. Of these, about 300 are simple little words, the 1-, 2- and 3-letter structure words such as a, ia, ta and tai with which we have been mainly concerned in describing the grammar of Loglan; another 500 or 600 are compound little words, the less commonly used, longer structure words, such as pana and anoi, with which we have had less to do in this book; another few hundred are illustrative names such as Djan and Frans (the number of names in any language is of course unlimited; the set provided in the current Loglan lexicon simply illustrates the naming process); and the remaining 8,000, or more than 90% of the present vocabulary, are predicate words of various kinds.

Among the little words are the hard-worked common words, like le, la, da, pa and ba, some of which occur in nearly every utterance. Such words lay out the grammatical and logical structure of ideas. But if grammar and logic, through their little words, provide the bones of a language, predicates provide its infinitely varied flesh. Grammars, and the little words that impart grammatical structure, are necessarily finite affairs; the vocabulary of predicates, in contrast, is in principle infinite in size. This is true in any language. In theory, and perhaps also in practice, there may be many learned persons who know the whole grammar of English. No one knows the whole predicate vocabulary of English, and now that it has passed the half-million mark, no one ever will. Obviously, if we are concerned with how Loglan might grow, it is with the mechanisms for augmenting its predicate vocabulary that we will be primarily concerned.

Of the 8,000 predicate words that are currently defined in Loglan, approximately 1,500 are borrowings--most of those from the vocabulary of science--and another 1,300 are "primitive" in the sense that they are not derived from any other word or words in the language. The largest subset of the primitive predicates are the basic semantic building-blocks of the language, the 860 "composite" words like mrenu and fumna. Recall that these have been derived as broadly as possible from eight natural languages. But the 5,200 current Loglan predicates which are neither borrowed nor primitive, and which we accordingly call "complex," is by far the largest category of words in the language. These have nearly all been derived from two or more of the composite primitives...with an occasional assist from a little word like ne, no or nu. In English, 'ease' is a primitive predicate: 'easy' and 'uneasy' are complex. More obviously, 'black' and 'berry' are primitive; 'blackberry' is complex.

With relatively few exceptions, the predicates used in the specimen sentences in this volume have been composite primitives. Indeed, only about a hundred of the 860 composites have been used to write the specimens used in this book. These are the words like kamla and godzi, matma and farfu, ditca and kicmu, which we have encountered over and over again. Occasionally a complex predicate has been used to illustrate a grammatical point; but these have been introduced metaphorically as we went along. For example, faltaa [fahl-TAH-ah] ('liar') was introduced as 'false-talker'. The word is a complex formed of parts of the two primitive predicates falji ('false') and takna ('talk'), from which we learn that 'false-talker' is the basic metaphor underlying the Loglan conception of a liar. Similarly, mormao [MAWR-mough] ('kill') is made from mor- and -mao; and these parts come from the primitives morto ('dead') and madzo ('make'). So to kill something is to "make it dead" in Loglan. By a similar metaphor, a scientist is a "science-maker", someone who contributes to the growth of science; so the Loglan word for 'scientist' is sesmao. The word is made from the ses- of sensi ('science') plus -mao again. There are numerous -mao words in Loglan. For example, rojmao comes from rodja madzo or 'grow-make' and so means 'cultivate'. Agronomy can then be conceived as "cultivation-science"; and this metaphor can be rendered by the three-term complex rojmaosensi. Loglan for 'agronomist' then unfolds as the four-term word rojmaosesmao ("grow-make-science-maker"). In this way the pool of predicates can be indefinitely extended by the metaphorical elaboration of the fundamental ideas of the language. It is clear that the elaboration of new complex predicates by the invention of new metaphors will be the major source of future word growth in the language.

But it is not the only source. Our studies of these matters over the years have suggested that there are at least six further ways in which the vocabulary of Loglan will continue to grow. In addition to new complexes, there is a small pool of what might be called "international" words--words like 'telephone', 'football' and 'beer'--which has been only partly incorporated into Loglan. Most of these now-international words originally spread across the planet in the wake of commerce or conquest during the centuries of European expansion; so they are nearly all European words. But despite their local origins, they are now universally recognized. They are therefore ready to be added gratuitously, as it were, to the vocabulary of Loglan. Usually they are incorporated as primitive-form words: futbo, telfo, paspo. Others come in as longer borrowings: tcokolate. About a hundred of these words have already been brought in. Perhaps a few hundred more remain.

A morphological aside. When a Loglan word is primitive in form--i.e., when it looks and sounds like a primitive, as telfo and futbo do--but is derived as these words are from only one or a few languages, it is called a one-source primitive in order to distinguish it from the composite primitives, like sensi, morto and madzo, which are derived from as many languages as possible. Complex predicates like mormao and sesmao are largely made, of course, from composite primitives; for it is these that have been provided with the short affixes, like ses-, mor- and -mao, that are the principal building blocks of the language. But, morphologically, locally-derived primitives like futbo are also primitive...even if they are not semantically so. Words that are neither primitive nor complex, but are still predicates, are called borrowings. These are words like trombona, iglu and proteini which are obviously predicates (they have all the necessary features) but are neither primitive nor complex in form. Armed with these few principles, let us continue our survey of the possible sources of future word-growth in Loglan.

In addition to the international words--which come into Loglan either as one-source primitives or borrowings--there is a second, even larger pool of "native" words which, when their meanings are required by Loglan scholars and story-tellers, will probably also be adopted. These are the local words for languages, peoples, artifacts, articles of food and dress, or even local plants and animals. Such things are often unique to the places where they originated--or at least, like kayaks and parkas, they were originally--and so they should obviously be predicated in Loglan by words which resemble the local words for them as closely as possible. Most of these words, like Innuit 'igloo', are primitive notions to the original users. So they should probably not be built as complexes even though they could be. Thus nichaa [neesh-HAH-ah] ('snow-house') is really not an adequate translation of Innuit 'igloo'. As it happens, the Loglan borrowing can be an exact imitation of the original Innuit word, namely iglu.

By a similar route Swahili simba for 'lion' also entered our international language intact. Like parka, the natural word 'simba' already has the shape of a Loglan primitive. Since the once-widespread lion is now virtually confined to Africa, it seems only fair to use the most widely-used African word for the beast. The Innuit word 'kayak' also names a unique category of objects on this planet, and so also deserves to be preserved and not metaphorized. But as the word /KAiak/ is vowel-rich and consonant-final, it is more difficult to import into Loglan than iglu, parka and simba were. We will learn how to borrow it in Section 6.5.

Another source of locally-derived words are the language-nationality-culture triplets which we find in Loglan: words like spana, spani and spano. As we have already noted, all three of these were derived from Spanish 'Espan(i)ol' and mean in some sense 'Spanish'. Such words are brought into Loglan either as one-source primitives, as these three were, or as borrowings. So these, too, form a large pool of importable natural words only a small portion of which has yet been imported. In addition to the "Spanish" triplet above--spana for 'is an instance of the Spanish language', spani for 'is a Spaniard/Spanish person', and spano for 'is a Spanish custom/instance of Spanish culture'--we also have, with parallel meanings, dotca/-i/-o from German 'Deutsch' and 'deutsche', hinda/-i/-o from Hindi 'Hindi' and 'Hindu', ponja/-i/-o from Japanese 'Nippon' and 'nippon-ji', and junga/-i/-o from Chinese 'Zhung' and 'Zhungwo'. There are perhaps a dozen more of these primitive-form triplets in the current lexicon. But this just scratches the surface. There are at least 5,000 nameable languages on this planet; and a distinct people and culture--or at least a sub-culture--can usually be associated with each language.

There is still a fourth body of internationally-used concepts waiting to be incorporated into Loglan, and it is the largest yet. This is the international vocabulary of science. Of these, by a conservative estimate, there are now upwards of a quarter of a million terms. Science words were mainly derived from Latin or Greek by European scholars during the few centuries since the European Renaissance, and they are found similarly-spelled, or spelled with only minor and often quite regular variations, in nearly all European languages. Indeed, when suitably transliterated into other writing schemes, they are to be found in all the other world languages whose speakers now "do science".1 In incorporating these and other scholarly words into Loglan, we regularly use not the sound but the appearance of the scientific word as our model. We do this because, while scientific words are often pronounced very differently in different languages, their appearance on the printed page is usually quite uniform over many languages. The uniformity is often faithfully preserved even when the original Graeco-Latin word has been transliterated into some other alphabet, for example, into Cyrllic, the alphabet of Russian. So it is the scientific reader, not the listener, we must keep in mind when we are borrowing a scientific word for Loglan.

In making a scientific borrowing we try to retain those portions of the international word that are most consistently retained in other languages. This means avoiding the special endings that are often imposed by certain languages as well as language-specific spellings. At the same time, the resulting word must be a Loglan predicate, made with Loglan phonemes, and so display at least one consonant-pair as well as being vowel-final. It must also be regularly pronounceable as a Loglan word however it is spelled. Proteini [proh-TAY-nee] is an example of a word which meets all these requirements handsomely. It not only captures the look of what amounts to the "same word" in all European languages, but its final vowel is the standard /-i/ given to all Loglan scientific borrowings. So to a Loglan reader it has the unmistakable look--and sound, when da pronounces it--of a scientific borrowing. As many science words in Loglan do, it sounds a bit Italian. But that's fair; the scientific Renaissance commenced in Italy.

Not all ISV words ("International Scientific Vocabulary") fare so well. Take the word that is spelled 'atom' in English. Adding the conventional ending -i to the English word (which happens also to be the invariant international stem) gives ?atomi. (A leading question-mark, recall, is the mark of a trial word.) But heard in a Loglan utterance this trial word would break up as the little word phrase a to mi = 'Or two of me'. Thus ?atomi lacks the essential consonant-pair that is the distinctive feature of the Loglan predicate. But this lack is easily supplied. We insert a "gluing consonant"--conventionally the phoneme /h/--between the /t/ and /o/ of ?atomi, and this gives the word the required consonant-pair: ?athomi. Moreover, it puts the consonant-pair in a position that will prevent the trial-word from breaking up. The result is athomi [aht-HOH-mee], a word that now passes all tests. While not so immediately recognizable as proteini is, the Loglan word for 'atom' is still recognizable on second glance... especially by loglanists who know the gluing art and understand the vital need for it.2

Probably the International Scientific Vocabulary is the largest single population of words that will ever be incorporated into Loglan. Now approaching uniformity in all the languages whose speakers follow science, the terminology of science has become a truly planetary linguistic phenomenon, its numbers probably now growing more swiftly than that of any other body of words on the planet. In a sense it is already an international language, one waiting only for its grammar. Loglan may supply that international grammar. Sections 6.5-9 will be devoted to the art of ushering this flood of international words into Loglan.

So far we have mentioned making complex predicates and borrowing three kinds of concepts: international but once-local words in common use, language-nationality-and-culture words, and the vocabulary of science. We have treated these as four sources of potential growth for Loglan. A fifth but much smaller source of future growth is that we may yet find that the present set of semantically basic predicates, the composite primitives, is not yet complete...that is, not yet entirely adequate to the growth needs of the language. It is these semantically basic words, of course, that are employed in building complex predicates. They represent concepts that are nearly universal in human experience, and are represented by primitive predicates in nearly every human language. We will know that the Loglan set of them is incomplete when we find that words that we wish to build as complexes in Loglan may not be built without certain constituent notions that are not yet in our language.

As I mentioned, there are at present about 860 of these composite primitives in Loglan. It is pleasant to report that their number has grown only very slowly over the years. They have been tested three times for constructive adequacy against lists of the most frequently-used concepts in four major European languages (English, Spanish, French and German). So far they have passed these tests with flying colors. That is to say, they have proved collectively capable, with a negligible addition-rate, of supplying metaphors for all the complex notions we have so far encountered in these languages.3 But they have not been tested against languages which are representative of other language-families than the Indo-European one. It will therefore not be surprising, on Whorf's hypothesis, to discover that in embracing the concepts of the non-European languages, as we are now on the verge of doing, a handful of new "fundamental notions" may yet be required. If so, a new handful of composite primitives will have to be added. These will then be used to express the complex notions of still other major branches of human thought.

As their name implies, composite primitives are constructed compositely from words drawn from many languages. This procedure is described in Section 6.5 below.

A sixth source of word-growth is obviously inherent in the privilege every loglanist has of adding new names to the language. These may be either imitations of natural names, or entirely new coinages. There is in principle no limit to the number of names that may be added to the language. The making of names is discussed in Section 6.13.

Seventh and finally, there are certain "open" classes of CVV-form little words to which new members may be added from time to time. Chief among these are the modal operators, such as lia = 'like', and the discourse operators, such as sui = 'also', which may be indefinitely augmented. In addition, compound attitude indicators, like aiui = 'Yes, gladly', may also be added at will. Changes in, or additions to, other classes of little words will be more difficult to accomplish; but it is probable that needs for new punctuation words will be discovered, and it is even conceivable that whole new systems of little words will have to be incorporated to accommodate grammatical features of natural tongues which have so far been overlooked in our analyses. The unused CVV-form words listed in Paradigm L of Loglan 4 & 5 are available for making these additions. The V, VV and CV word-spaces have long ago been exhausted.

In sum, there are seven sources of future word-growth for Loglan: (1) the addition of new complex predicates (like djasolsensi for 'sociology of knowledge'); (2) the incorporation of international words (like futbo) and (3) local food, tool, plant and animal words (like gorgonzole and atlatlu), (4) the coining of language, nationality and culture triplets (like inhuita/-i/-o = 'Innuit' or 'Eskimo'), (5) taking in yet more of the international vocabulary of science (like deoksiribonukli), (6) bringing in new person and place names (like Betcua'naland), and finally (7) inventing new compound little words (like pacenoinacefacenoifazu = 'once-and-future-but-not-now-and-not-for-long', an ad hoc invention for the present paragraph) and the assignment of meanings to little word forms presently unassigned. In the remainder of this chapter we will consider how these seven elastic chambers of the language may be utilized by the speaker who wishes to contribute to its growth.

6.3 Making Composite Primitives

The composite primitives of Loglan are the semantically universal predicates of human experience. Such words have all been made in Loglan by deriving them from the eight source languages in such a way that the probability of recognizing them, as measured over the population who speak one or more of these eight languages, has been maximized. In general this has been done by ensuring that each phoneme in the constructed word appears in as many natural words of similar or related meaning as possible. The result is that these Loglan primitives have been made of overlapping pieces of natural words. It is in this sense that they are composite words.

The decision to add a new composite primitive to the language is a major one. Not only is the work involved many times greater than that required to make a complex, a borrowing, or a local primitive, but the presupposition that the machinery for making complex predicates either cannot or should not be applied to making a new predicate word for the needed notion, or that the notion is not already conveyed by a word that has spread internationally, which would justify our borrowing it, or by one that is of only local origin or significance--in which case, for a different reason, we would also be justified in borrowing it--is, by this time, likely to be a questionable one. To fail to find a suitable metaphor does not mean one cannot be found; and the discovery of a new, truly basic primitive is, by this time, an exceedingly rare event.

One late arrival in the set of primitives will illustrate this. It was the word setci = 'set'. The word klesi for 'class' or 'category' had been around for a long time. But there are many collectivities which are not classes, in that their members do not share any taxonomically useful characteristic--as mammals do, for example, and divorced American mothers also do--and which are, therefore, only sets. For example, the 37 objects bigger than dust-speck that are currently on my desk constitute a set, as do every tenth word in this book. The distinction between sets and classes is semantically fundamental...in a logical language. Moreover, there are certain complex notions--like 'genotype', 'character', 'kin', and so on--that can hardly be defined, or even thought about, without using the notion of a set. Klesi, to be sure, could have been made from the idea of a set; but not the other way round. Klesi was already deeply embedded in the semantic structure of the language; and anyway, until their differences from sets have been analyzed, classes and categories are more likely to be objects of direct perception than sets are. One sees the cardinals flitting through the woods as representative of their larger, unseen class: the species Pyrrhuloxia cardinalis. But one does not see the scattered objects on one's desk as members of any collectivity...until, that is, one begins to think metalinguistically about just such phrases as 'the objects scattered on my desk'. It is from such historically late thoughts about meaning that the concept of sets arises. But late or not it is fundamental to our hypermodern language. For all these far-flung reasons, then, setci was reluctantly added to the primitive notions of Loglan, and klesi was retained.

Still, the occasion will again arise--though one would hope, infrequently--when a concept of truly general human significance will be found to have no good translation into Loglan and for which the existing set of primitives provides no ready route by which to express it metaphorically. As mentioned in the previous section, just this occasion may arise when and if the productive adequacy of the existing primitive set is tested against the complex concepts of some non-Indo-European language, say Chinese; for when this is done it may turn out that a supplementary set of "fundamental notions" will have to be added to our international language. It is well, then, to record here the methods by which composite primitives have already been built, and by which new ones may still be added to the language.

In order to achieve a kind of cultural neutrality for Loglan, its primitive predicates were formed as composites of words of similar meaning in the natural languages. As I indicated while discussing Loglan design principles in Chapter 1, this should make cross-cultural experimentation with Loglan not only possible but fair.4 To this end, it became our technical objective to make at least this one very important category of Loglan words--arguably the most important, for among them are the elementary semantic notions of any language--maximally recognizable over the largest possible population base...or at least one large enough to afford a variety of both linguistic and cultural contrasts around which to design cross-cultural experiments of a Whorfian cast. This objective was easier to meet than it now sounds. The first eight languages with which we tried to meet it turned out to be entirely adequate--perhaps more than adequate--to the task. For the total amount of recognizability it was possible to build into Loglan words from a consideration of just these languages was surprisingly large.

The languages chosen were, in the order of the combined totals of their primary and secondary speakers in 1955: English, Mandarin Chinese, Hindi, Russian, Spanish, Japanese, French and German. These are the eight most widely spoken human tongues. The trial addition of the ninth language, Arabic, while it would have appreciably increased the size of the population base, as well as the linguistical and cultural variety of future experimental designs, turned out to make no significant contribution to the average recognizability of our composite words; so Arabic was not retained. Tests in the other direction--that is, of the effects of dropping German, French, Japanese, etc., in reverse order from our list--were not carried out. We were satisfied with what may well be the over-achievement of our aim.

Linguistically considered, this set of what we may now call the "source languages" of Loglan contains representatives of three language-families: Sino-Tibetan (Chinese); Japanese, which is sui generis; and Indo-European (all the rest of the source languages). Looked at geographically, the source set contains the three most important Asiatic languages--Chinese, Hindi and Japanese--as well as the five most important European ones. Among the source European languages, there is a Slavic language (Russian); two Germanic languages (German and English); and two Romance languages (French and Spanish). Some of the most interesting structural differences among human languages, from the Whorfian point of view, are spanned within this group of eight. As to culture, differences large enough to satisfy the most eager Whorfian also exist among the peoples represented in this group. The Chinese-Western difference, the Chinese-Japanese difference, the Indian-Western difference, the Romance-Germanic one--differences which express themselves in some of the widest philosophical, aesthetic, religious, political, metaphysical and ideological contrasts now partitioning the planet--all exist within the domain of Loglan's carefully contrived neutrality to be experimentally explored.

The population from which subjects for these experiments might be enlisted, and about which, therefore, conclusions might be drawn, is also satisfactorily large. The number of human beings who spoke one or more of the eight source languages as a native tongue, or, if none of them, then one or more of them as a second language, was, on the statistics available in 1955, roughly three-quarters of the population of the Earth.5 The average recognizability score of a Loglan composite word over this immense source population is 45%. This figure means that a person selected at random from this population would have the mathematical expectation of finding that about 45% of the sounds of some word of related meaning which da knows, appear in the same order in each new Loglan word with which da might be presented. Given the linguistic diversity of the source languages, this figure is surprisingly high. One would not have thought there would be so much commonality in the sounds of such historically diverse languages. But in part this high figure derives from the generosity of the Loglan five-letter form. For very often two very short natural words can be accommodated, as it were, side by side. This often happens for example, with Chinese and English, the two dominant languages on the list, both of which abound in short words. Thus Loglan forli ('strong') contains three-fourths' of English 'fort' followed by all of Chinese 'li'.

For the immense number of primary and secondary speakers of English--now reckoned at well over 30% of the world's five billion people--the probability of finding recognizable phonemes in a Loglan composite primitive is substantially more than .45; in fact it is about .70. English speakers are not only marginally more numerous than the second largest group, namely speakers of the Beijing dialect of Chinese, but unlike Chinese, English has lexical affinities with three other languages on the list. It shares some of its vocabulary with both Romance languages in the source set as well as other words with the other Germanic tongue. In other words, English has more linguistic allies in the source set than any other language in it. This, too, contributes substantially to the high average amount of English to be found in the typical composite word. Japanese, being sui generis, and so with the fewest allies, is also among the less widely spoken languages on the list. So it makes the smallest contribution to the Loglan composite word, with an average containment of Japanese phonemes of only 12%. The wide spread between these two figures is a consequence of our method of making--or, more accurately, discovering--high-scoring composite sequences, which is the method we will now describe.6

To maximize the probability of recognition of a given word over the source population, it was only necessary to weight the contributions made by each language to each trial word by a number which represented the proportion of the total source population who were speakers of that language.7 Then the trial word that demonstrably had the highest weighted-score was taken to be the best Loglan word available for the concept. The proportions, or scoring weights, which were used in this work are given below:

English .28
Chinese .25
Hindi .11
Russian .10
Spanish .09
French .06
Japanese .06
German .05

Let us now consider some examples of how these weights were used.

For the concept 'week' the trial word which received the highest R-score ("recognition score") was likta; and likta contains:

2/3 of English 'week' /uik/ yielding 2/3 X .28 = .19
3/5 Chinese 'li bai' /libai/ 3/5 X .25 = .15
3/7 Japanese 'isshukan' /iscukan/ 3/7 X .06 = .03
2/5 Hindi 'saptah' /sapta/ 2/5 X .11 = .04
3/9 Russian 'niedielia' /niedielia/ 3/9 X .10 = .03
Total R-Score: .44

The phonemes in the natural word which were counted as matching phonemes in the Loglan word are given in boldface.

Likta is a typical word in more ways than one. Not only is its score very near the average, but the fact that the only substantial contributions to its recognizability are made by English and Chinese is also typical of a very large group of composite primitives.

Here is the derivation of a fairly high-scoring word, djano = 'know', to which three languages made substantial contributions; for djano contains:

2/2 English 'know' /no/ 2/2 X .28 = .28
4/4 Hindi 'jan-na' /djan/ 4/4 X .11 = .11
4/5 Chinese 'j dao' /djdao/ 4/5 X .25 = .20
Total R-Score: .59

Note that the suffix '-na' is omitted in the phonemic transcription of the Hindi word 'jan-na' and that English 'know' is not transcribed as /nou/, as it would be if we were concerned with the exact phonetics of this word. This is because English speakers hear this diphthong as /o/.

As a third example, consider the derivation of a low-scoring word, dzoru = 'walk', which happens to be exclusively derived from the two Far Eastern languages; for dzoru contains:

4/4 Chinese 'dzou' /dzou/ 4/4 X .25 = .25
2/3 Japanese 'aru-ku' /aru/ 2/3 X .06 = .04
Total R-Score: .29

Again, a suffix, '-ku', is ignored in the calculation.

Here is another moderately high-scoring word, morto = 'dead'. This one comes exclusively from the six Indo-European languages; and it contains:

3/3 Spanish 'mor-ir' /mor/ 3/3 X .09 = .09
3/3 French 'mort' /mor/ 3/3 X .06 = .06
4/5 English 'mortal' /mortl/ 4/5 X .28 = .22
3/4 Hindi 'mrit' /mrit/ 3/4 X .11 = .08
2/3 German 'tot' /tot/ 2/3 X .05 = .03
3/5 Russian 'smert' /smert/ 3/5 X .10 = .06
Total R-Score: .54

Each word adopted was, of course, the survivor of a competition among as many as a dozen trial words. In nearly all cases the accepted word was demonstrably the best word that could be made from the lexical materials considered for each language. The exceptional cases were those in which the best word conflicted with an existing word, and so a slightly lower-scoring word had to be adopted.

As the above derivations show, a natural word was held to make a non-negligible contribution to a trial word only under fairly rigid circumstances. Any sequence said to be a "matching" one had, in fact, to meet the following three conditions:

  1. At least two phonemes in the natural word had to match phonemes in the trial Loglan word.
  2. If exactly two phonemes matched, then they had to occur in the same order in both words and be either (a) adjacent in both words, or (b) separated by exactly one consonant in both or by exactly one vowel in both.
  3. If three or more matching phonemes occurred, they had only to occur in the same order in both words to be counted. That is, no match between the phonemic neighborhoods was required.

In other words, we found that if only two phonemes were involved in a given match, they had to be reasonably close together and found in the same consonant-vowel pattern in both words. Thus the /aCo/ pattern in ?flako was said to match the /aCo/ pattern in 'tabol', but this sequence was not taken as matching either the /aVo/ in 'saio' nor the /aCCo/ in 'tabro'. On the other hand, we found that if as many as three phonemes were matchable between the trial and the source word, then the similarity between them was evidently more robust. For it apparently didn't matter whether the contexts matched or not. We found that the order of even three phonemes continued to matter, however.

What sounds of what source languages were taken to "match" what Loglan phonemes was a fairly complex problem in local phonemics which was settled differently for each source language. In Spanish, for example, any 'a' is said to match Loglan /a/; in English, only the 'a' of 'father', the 'o' of 'not', and the 'a'-sound occurring in diphthongs like 'aye' and 'ow' were said to match Loglan /a/. But because it "sounds like" 'o' to English speakers, the English final diphthong /ou/, as in /nou/ = 'know', was said to match Loglan /o/. In French, nasal 'a' and 'o' were taken to match Loglan /a/ and /o/ respectively provided they were followed by /n/ or /m/ in the Loglan words, for then it seems so to French ears. In Chinese, the sounds represented by 'hs' and 's' in the Wade system of transcription were both taken to match Loglan /s/; the sound written 'j' in Wade but with 'r' in the Yale system and in Pinyin is matched with Loglan /r/ even though this sound is not recognizably 'r' to any European ear; and the Chinese sound written 'ssu' in Wade, 'sz' in Yale, and 'si' in Pinyin is unmatchable with Loglan because the sequence /sz/, which is its approximate phonemic value, is proscribed in Loglan predicates. Of course all eight source languages have sounds that do not match any Loglan phonemes at all. But the above remarks will give some idea of the problems encountered in comparing the sounds of Loglan trial words to the words of several phonologically quite different languages simultaneously, and some of the ways in which these problems have been solved. In general, our ruling principle has been to set up such identities between Loglan and each source language as would fairly predict which sound-pairs would seem "recognizably identical" to a listener from that language, and which would not.

The recommended procedure for contributing a new composite primitive to the language is as follows: First, make a list for each of the eight source languages of all possible words in that language which might serve as mnemonic cues to the concept to be defined. They do not have to be synonyms.8 Eight good pronouncing dictionaries will be required. Second, transcribe each such potential cue-word into Loglan phonemics. Use such matching principles for each language as you can devise, or which you can induce from the derivations given in the Loglan-English dictionary. Use asterisks (or some other non-phonemic character) to record non-Loglan sounds in these transcriptions; for although they can make no contribution to a trial word, they must be counted as contributing to the overall length of each cue-word.9 Third, make trial words. Each word must, of course, be of either CV'CCV-form or CCV'CV-form, and any initial or medial consonant-pairs must appear in Tables 2.1 and 2.2, respectively, of Chapter 2; that is, they must be permissible. In making trial-words, you may be guided by four hypotheses, given here in the diminishing order of their liklihood: H1: The best word will maximize the joint contribution of Chinese and English. H2: The best word will maximize the contribution of English. H3: The best word will maximize the contribution of Chinese. H4: The best word will not maximize either Chinese or English, or Chinese and English jointly, but will capitalize on some adventitious commonality among cue-words of other languages. H4-words are exceedingly rare. There may be, of course, several trial-words to be tested under each hypothesis, and words produced under H1, H2 and H3 may be identical. Fourth, prove that the highest-scoring trial word or words is in fact the best possible word on the lexical materials you have assembled. Fifth, ascertain whether the best word conflicts with some existing word, and if it does, adopt the best of the words that do not conflict.

A trial-word is said to conflict with an existing word if (a) they are phonemically identical (no homonyms are allowed), (b) they differ only in their final vowels (likta and ?likti would conflict, for example), or (c) their only difference is a pair of consonants that occupy adjacent vertices on the "square of sibilants":

                          s------c
                          |      |
                          |      |
                          z------j

What this third kind of conflict means is that a pair of primitive-form words that differ only in the /s c/ difference, for example--as cimra and ?simra ([SHEEM-rah vs. SEEM-rah] would--are very likely to be confused in conditions of moderate noise; so this may not be the only difference between them. (Cimra/?simro would have an acceptably larger difference between them, for instance.) /c j/, /j z/ and /z s/ as only differences would also lead to word-pairs that are likely to be confused. However, the "diagonal differences" /s j/ and /c z/ in the square of sibilants are quite acceptable. For example, monca = 'mountain' and monza = 'morning' function side-by-side in the language with no evident problems. This is probably because two phonetic features discriminate the diagonal pairs--specifically the presence or absence of voice and the front-back position of the tongue--whereas only one feature discriminates the pairs on the sides of the square. Our research has shown us that differences in two features are always sufficient to distinguish word-pairs like monca/monza and jurna/surna even in conditions of moderate noise. Notice, also, that words differing only in their stressed vowels, like kerti/kurti, are quite distinct; whereas the same phoneme-difference in an unstressed syllable, as in larte/?lartu, would have generated conflict had we allowed such phonologically close pairs to exist in the language.10

Members of spana/-i/-o-type triplets do, of course, conflict with one another by this last criterion. But this is allowed. In fact, it is a deliberate feature of these semantically closely-related sets of local primitives. The fact that they are similar in all but their final vowels signals the close semantic relationships between the members of these language-nationality-culture triplets.

When you have made your new composite primitive, and assured yourself that it generates no conflicts in the existing lexicon, you may send your new word with its definition and derivation--arranged more or less as the dictionary entries for composite primitives are arranged in Loglan 4 & 5--to the address of The Institute given on the title page of this volume. Accompany it with a brief argument as to why you think we need it. For example, lists of useful complexes which might be made with your primitive would be germane. If, in the judgement of the Word-Makers Council, your word and the concept which it expresses are useful additions to the Loglan composite primitives, it will ultimately appear in the next edition of our dictionary, and, in the meantime, notice will be given of its acceptance, along with other current new words, in the various bulletins to that effect that appear from time to time in The Institute's other publications.

6.4 Making Complex Predicates

To add a new complex predicate to the language one must first coin a metaphor or metaphors capable of suggesting the meaning of the new predicate to future learners of the language. Remember that for the forseeable future Loglan will be learned as a second language by adults, only later by children. It is in this adult context that the importance of good metaphors will be seen. For example, the metaphor "sign-know" is very likely to suggest to a newcomer to Loglan the meaning of a predicate that is well-translated by English 'understand', especially if it is to be used in the sense of knowing the meaning of some sign or message. Surely it will do so a little better than 'under' plus 'stand' have ever succeeded in doing for adult learners of English!

Second, after one has made a metaphor, one must make sure that this metaphor or these metaphors are in fact expressible in Loglan primitives or borrowings. The metaphor "sign-know" is so-expressible; for both sanpa ('sign') and djano ('know') are elemental words...in Loglan as well as English.

Third, one must make a trial word or words from the Loglan version of each metaphor, making sure that it or they are in morphologically permissible form. For example, sanpa has two short affixes, san- and -saa, while djano has but one: dja. So the only possible 6-letter renderings of this complex notion are ?sandja and ?saadja. The morphology tells us (Table 2.3) that the */n+dj/-joint is not permissible. In fact it is unintelligible. The /d/ becomes inaudible in this context; so the string that is intended to be */ndj/ will be heard as /nj/. Hence ?sandja will be heard as sanja, which is primitive in form and thus misinforming. So if we want to use san + dja, we must buffer the */n+dj/-joint with the hyphen /y/. The result is sanydja [SAHN-nuh-jah], a 7-letter word. The hyphen looks awkward for so common a word. So we decide we prefer the one surviving 6-letter form saadja [sah-AH-jah].11

Finally fourth, one must discover which, if any, of the trial-words is in fact still available in the sense of falling within the remaining "free word-space" of the language. At the time it was made, the saadja-slot was still free, so saadja was adopted as the word for 'understand'. The metaphor was adopted in 1962, and the word itself was remade with the new morphology in 1982.

Let us take up these points one at a time from the perspective of the maker of a new complex.

Behind every complex predicate, old or new, stands a metaphor. One's insight into the meaning of that metaphor may be as sudden as a hammer-blow, like the immediate understanding that morto madzo ('dead-make') can mean nothing else than 'kill', or, like the French phrase 'savoir faire' ('to know to do'), one's understanding of its meaning may sink in only slowly over the first half dozen occasions of its use. (The same metaphor in Loglan, by the way, is durzo djano = 'do-know'; this is the thought that lies behind the Loglan complex predicate duodja [doo-OH-jah] = 'know how to do'.)

Sometimes one's understanding of a metaphor depends on the contrasts it makes with other, similarly constructed ideas. For example, the difference between the active kind of knowing conveyed by duodja and the more passive sort conveyed by saadja is a larger difference than the one between saadja and siodja [SYOHD-jah], which also means 'understand' but this time in the very different sense of comprehending the workings of some individual or system. Siodja, happily enough, is derived from the metaphor sisto djano, or 'system-know', which is an idea that seems very satisfactorily to convey the subtle difference between these two basic kinds of understanding ('I understand him' vs. 'I understand what he is saying').

A good metaphor must also avoid the emptiness of one term's being included in the sense of the other. If the meaning of one term in a metaphor is already implicitly contained in the meaning of any of its companions, then including it does not add much. Thus ?bersakli = berti sakli ('carry-sack') does not express the idea of a suitcase very effectively because (nearly) any sack or bag can be carried. ?Berbao = berti bakso ('carry-box') is an improvement since perhaps not all boxes can be; but ?berbao does not quite yet make the point about luggage. Racysakli [rah-shuh-SAHK-lee] made from traci sakli ('travel-sack') is a better metaphor since travelling is at least a surprising thing for a sack to do; but the word itself is awkwardly long. Note that the proscribed sequence */cs/ in ?racsakli required hyphenation. Moreover, there is no 3-letter affix for sakli. So racysakli is perhaps longer than we'd like the word for 'luggage' to be. So the metaphor we first chose to convey the idea of a piece of luggage in Loglan was traci bakso ('travel-box' or 'traveller's-box'). This yielded the complex racbao. We decided that this word was best for 'suitcase' because (i) all pieces of luggage travel, (ii) not all boxes travel, and (iii) nearly all luggage these days is rectilinear or boxlike in shape. Indeed, those that are not--like the duffle bags that sailors still use--could well be called racysakli ('travel-sack') in Loglan, a word with plain affinities to racbao. But having a pair of words for different kinds of luggage then suggested that the general term for luggage ought to be not racbao but racveo [rahsh-VEIGH-oh], the second part of this new complex being derived from veslo, a word which means 'vessel' and is the generic word for 'container' in Loglan. Thus racveo could mean 'travel-container'. So after a number of false starts it appeared that we had at last arrived at our destination. The lesson to be learned from this story is not to fall in love with one's early metaphors. Let them lie around unloved for awhile. If you do, better ones are very likely to come along.

Some metaphors require a fairly close analysis of the concept to be expressed. The English metaphor to be 'on the verge of', in the sense of being about to do something, is an example of an idea that defied successful metaphoric capture for some time. We tried moidru from modvi durzo ('intend-do') and durmoi from durzo modvi ('do-intend'); but intention is not the essential feature of an actor on the verge of something. Da may have been intending da's action for some time. It is the fact that da is on the edge of acting now that distinguishes a person about to do something from one who only placidly intends. Once this is seen, a happy metaphor is immediately forthcoming; and the Loglan word for 'be about to do' became durbiesni [door-BYESS-nee], a contraction of the phrase durzo bidje snire, a phrase which in turn translates literally and happily as 'do-edge-near'. So in the end we went back to the figure lurking behind the English metaphor. A person who is "on the verge of" something is indeed "near its doing-edge".

One might conclude from these examples that metaphor-making is essentially a poetic act. This would not mean, however, that only poets can do it...or if it did, that would not be much of a restriction. For there is probably a poet lurking inside every human head.12

Occasionally one makes two or more related complexes at once. Thus groracbao meaning 'trunk', made from groda racbao ('big-suitcase')--actually, from groda traci bakso ('big-travel-box'); for the terms of metaphors within metaphors must also be meaningful when unravelled--was made at the same time as racbao was, and suggests that the essential difference between a suitcase and a trunk is size. And so it is; for both are boxlike and both are luggage. And both contrast in shape with racysakli, the duffle-bags. On the other hand, all are racveo, the travel-vessels. Thus by contriving this quartet of words all at once, we permit racveo to be used generically, whereby Kambei lomi racveo eo ('Bring my luggage, please') may be expected (in Loglandia) to fetch all one's luggage however varied.

Suppose, now, we have a satisfactory metaphor or set of metaphors for some new predicate idea we wish to express, and that we have been able to write these metaphors in Loglan using only the existing set of primitives. How then do we build trial words? The complex-making procedure is algorithmic; which means that a computer program can do it for you.13 This is how that program works:

Step 1. It assembles all possible affixes for each position in the metaphor. Thus for sanpa djano it assembles:

1st Term 2nd Term
san dja
saa djano
sanpy

Note that a CVC-form affix need only be considered for non-final positions. CCV- and CVV-form affixes may be used in any position, but, as we'll see in Step 3, if one of the latter is initial in a 3-term or longer complex, it must be hyphenated to the body of the word to keep it from "falling off".

Step 2. It then generates all possible trial-words. In this case, there are six of them:

?sandja ?sandjano
?saadja ?saadjano
?sanpydja ?sanpydjano

Step 3. The algorithm then examines the "joints" in these trial-words to discover whether any need hyphenating, using Tables 2.2-3 to do so. Three joints do. ?sandja and ?sandjano require hyphen /y/ because of the unintelligible */n+dj/, and ?saadjano requires hyphen /r/ because, without it, /saa/ would fall off.

So hyphenated, the new set of trial words is:

?sanydja ?sanydjano
?saadja ?saardjano
?sanpydja ?sanpydjano

Step 3 reminds us that there are two kinds of intraverbal hyphens in Loglan as was mentioned in Chapter 2. One is hyphen /y/, which is used to buffer otherwise unpronounceable or unintelligible consonant sequences and to attach 4-letter affixes like sanp- to their neighbors; the other is hyphen /r/ (with its allomorph /n/) which is used to join two CVV-form affixes together in two-term complexes and to prevent initial CVV-form affixes from falling off three- or higher-term complexes.

Step 4. The algorithm then looks for and eliminates trial-words that break either of two rules: (a) No word is allowed to have adjacent identical vowels unless one of them is stressed. (b) The trial-word must not resolve into a CV-word followed by some unintended complex.

Part (a) of Step 4 eliminates ?saardjano but allows ?saadja because the second /a/ in the shorter word is stressed: /saADja/. Part (b) is called the "Tosmabru Rule" because the trial-word ?tosmabru has this property. The trial word ?tosmabru was once intended to be heard as tos+mabru; but instead it resolved as the phrase to sma+bru. None of our trial-words fails the Tosmabru Test.

Five trial-words survive Step 4:

?sanydja ?sanydjano ?saadja ?sanpydja ?sanpydjano

Step 5. Determine the best word among the survivors.

To do this, the algorithm scores them all. Unless the concept is one with a low expected frequency of use, in which case a longer word may be desirable, the shortest words score highest. So on the grounds of length alone the computer will pick saadja from this set. Among words of the same length, however, words with more vowels score higher than those with fewer vowels. This is because Institute policy at present is to keep the vowel-consonant ratio in complex words as high as possible. This not only makes them easier to pronounce--judged on a world-wide basis--but increasing vowel-ratio also reduces the amount of consonant-buffering required in the buffered dialects; and this will reduce the average length of buffered words.

Human word-makers short-circuit the complex-making algorithm at many points. In fact, the human worker who looks up, or already knows, the short affixes of sanpa and djano (as found in Appendix C) will recognize immediately, on seeing san saa and dja, that ?saadja is the best of the two words that can be made from this metaphor with short affixes. For one thing, ?saadja will be seen to be richer in vowels than ?sandja is even before the possible difficulties of the sequence ?/n+dj/ are noticed and investigated. Therefore these difficulties do not need to be investigated. The best word is clearly saadja and that's the end of it.

With a little practice such decisions can be made in a few seconds. All one needs is a list of primitives and their affixes (such as Appendix B or C) and access to a few short rules (such as those embodied in Tables 2.2-3).

Technical words may be as long as the maker feels is necessary to get da's point across. Let's look at the word for 'agronomy' again. Its deriving metaphor is rodja madzo sensi = 'grow-make-science', or the science of cultivation. The algorithm makes rojmaosensi [rohzh-mough-SEN-see] from this metaphor, the long affix, -sensi, for the 'science'-term having been dictated by the fact that sensi has no V-final short affix. In fact, its only short affix is ses-. Building on rojmaosensi, the algorithm will then make the word for 'agronomist' by exploiting the one short affix that sensi does have. Indeed, ses- is involved in the general word for 'scientist', which is sesmao. Thus, as we observed earlier, the scientist, in Loglan, is seen as a "science-maker". So replacing -sensi in rojmadsensi with -sesmao we get 'agronomist' = rojmaosesmao [rohzh-mough-SESS-mough], the "cultivation-science maker". The fact that -mao is used in two quite different senses in this word, one figurative, the other literal--or both figurative, but to different degrees--is part of its poetry...the sort of thing that a poet but not a logician might be expected to do.

Most Loglan complexes are made up, as rojmaosesmao and saadja are, of 3-letter segments. As you may already have discovered, the separateness of these triplets virtually leaps off the page. Like the codons of the genetic code, their intelligibility derives from their regularity. Some irregularities do exist, of course...those caused by hyphenation and the occasional long affix, such as the long final segment in rojmaosensi, for example. But, as we'll see in a moment, even irregular segment boundaries are easily spotted by the resolver. Moreover, once resolved, each segment may always be assigned to exactly one primitive predicate. As a result, each Loglan complex will always be uniquely decipherable. So when it is new to a learner, it will be heard as a string of elemental meanings, the whole string to be understood as a metaphor. What is remarkable is that the decipherment and subsequent understanding of such metaphors can be done so easily by persons who have never seen or heard the word before.

To guess the intended sense of a new metaphor from a string of elemental meanings is of course an inductive leap, full of insight and daring. It is one of the most amazing performances of the human mind. Yet that we should be able to make this leap with the near-infallibility with which we do make it is perhaps only the other half of the poetic gift, the gift by which we understand the poetry of others.14 In fact, in what will often be their instant understanding of the metaphors behind the new complex predicates they encounter, human users of Loglan will be enjoying one of the few advantages they will ever have over their computers. The latter will, of course, have to look each word up: old or new, simple, borrowed or complex. (Fortunately computers can perform these incessant dictionary lookups at great speed.) In contrast, human users of Loglan will seldom have need for dictionaries. Once they have mastered the basic kit of Loglan primitives and their affixes, human auditors and readers will seldom have to look up even new words. When they do, it will usually be to confirm and extend an insight they will already have gained from the metaphorical combination of easily deciphered parts.

In making this kind of dictionary-free learning possible, Loglan is by no means unique. Germans, too, seldom use dictionaries. It is said that by the age of eight, each German-speaking child has mastered the basic building blocks of the entire adult German vocabulary...with the result that, from that point on, and except for foreign loan-words, almost nothing ever has to be looked up. We have tried to engineer this same desirable property into Loglan.15

One apparent irregularity in the segmentation of Loglan complexes is that occasionally they are hyphenated. But this is only apparent. As the hyphens--whether /y/ or /r n/--always occur at the boundaries between segments, they may hardly be regarded as hindering segmentation. In fact, except for two irregular element words, ytrio and yterbio ('yttrium' [Y] and 'ytterbium' [Yb]), which are obliged to be spelled with the irregular phoneme /y/ because of the 'Y' in their international symbols, /y/ never occurs anywhere else in Loglan predicates except at segment-boundaries. So if an instance of /y/ is found in a predicate word, and the word is not ytrio or yterbio, that /y/ is marking some segment boundary; and the predicate itself, of course, can immediately be known to be complex. So the presence of hyphens helps rather than hinders both the segmentation process and the swift classification of words to which it leads.

The /r n/-hyphen is of course more difficult to see than the /y/ one. The reason is plain. The sound /y/ has few functions in non-names. Except for the two /y/-bearing element words, the phoneme /y/ occurs only in its letter-words and in predicates. In the latter it occurs only as a hyphen or, in the buffered dialects, as a consonant buffer. So the appearance of a /y/ in a V-final, CC-bearing word in a non-buffered dialect is an instant signal to both eye and ear that the /y/-bearing word is a complex predicate.

Not so for the /r n/-hyphen. Both /r/ and /n/ occur in many words that are not complex, and in places even in complexes which are not the joints between its segments. Still, the /r n/-hyphen--/r/ being its primary or preferred allomorph, /n/ being used only when the next phoneme is /r/--is easy enough to spot. It is used for just two, easily recognized purposes, both of them quite rare. One is to fasten two CVV-form segments together to make a single two-term word, as bao and mao are joined in baormao [BOUGH-rr-mough] to make 'box-maker'; and the other, even rarer use is to tack a CVV-form segment onto the front of a 3-syllable, or longer, complex. Suppose one wanted a single short word that meant 'market-science'. 'Market' is marte; and looking it up in Appendix C we would find that it has exactly one short affix, mae. Since mae is CVV in form, if it is to be used at the beginning of a 3-syllable or longer word--and all -sensi words have at least three syllables--it must be /r n/-hyphenated. The result is maersensi [mah-ehr-SEN-see]. Maersensi is a pretty word; but perhaps a little obscure...as, in fact, /r n/-hyphenated words tend to be. So probably a better choice would be to settle, in this case, for the long affix marty-, with its incorporated /y/-hyphen, and build martysensi [mahr-tuh-SEN-see]. This new option, though certainly less mellifluous (at least to this word-maker's ear) than maersensi, is transparently the word for "market-science"; and so it is arguably a better word for so technical a concept.16

Even /y/-type hyphens are fairly rare in Loglan complexes. They occur most frequently in words like mekykiu ('eye-doctor' or 'ophthalmologist') in which hyphenation is unavoidable. The deriving metaphor for this word is menki kicmu, and each of its two words has exactly one short affix that will serve in these positions: mek- and -kiu. (Kicmu has another short affix, kic-; but kic- may not be word-final.) *Mekkiu and *mekkicmu are illegitimate constructions, threatening to be reduced in speech to the phrases me kiu and me kicmu immediately. So the four legitimate options are ?mekykiu, ?menkykiu, ?mekykicmu and ?menkykicmu. We see that hyphenation is literally unavoidable. Unless the concept of an eye-doctor is deemed to be infrequently enough used to deserve a longer form--like 'opthalmologist'?--the algorithm will again pick the short word, mekykiu.

Long affixes are another source of harmless irregularity in the segmentation process. Five-letter affixes like -sensi--which are nothing more than the primitive itself treated as an affix--appear as final segments whenever (i) the final term in the metaphor has no V-final short affix, as is the case with sensi, or (ii) a long suffix is deemed to be deserved by a more formal word. Mresurva [mreh-SOOR-vah] and fumsurva [foom-SOOR-vah] for 'manservant'/'valet' and 'womanservant'/'maid', respectively, are instances of the first case. The long affix is required because surva has only one short affix, namely suv-, and it is not V-final. All science-words are also built on this pattern in that they all have -sensi as their final segment: fidsensi ('physics') from fizdi = 'physical', livsensi ('biology') from clivi = 'alive', tarsensi ('astronomy') from tarci = 'star', numsensi ('mathematics') from numcu = 'number', tetsensi ('meteorology') from tetri = 'weather', and so on.

Another kind of long affix is the one used in non-final positions, such as marty- in martysensi. Counting only their meaning-bearing portions, these affixes are really just four letters long; but they are extended to five by the necessity of being joined to the rest of the word with a hyphen. All these long non-final affixes are formed by replacing the final vowel of the chosen primitive with /y/, as marte ('market') is so-modified to produce marty-. As all Loglan primitives are open to this affix-making move, and as all pairs of them except those that happen to be members of the same language-nationality-culture triplets differ in more than their final vowels, such long affixes are always uniquely assignable to exactly one primitive or primitive-triplet. The vagueness of being derived from a triplet rather than a singlet, as is the case of the language-nationality-culture words, may seem a defect of the system. But when one considers that what is common to the meanings of the individual words in a triplet like spana/spani/spano is a very large and robust thing indeed--in this case, the very soul of "Spanishness"--it becomes apparent that it is precisely that large, robust, common thing--that very Spanishness--that is signified in a deriving metaphor by the long affix spany-.

Finally, there is the kind of complex predicate in which one or more of the terms in the deriving metaphor is not a primitive but a borrowing. Athomynukle [aht-hoh-muh-NOOK-leh] is such a complex; the deriving metaphor is athomi nukle ('atomic-nucleus') and the borrowed term, of course, is athomi. We will look at how such borrowings are made in the next section. At the moment all we wish to point out is that, by convention, whenever any term in the deriving metaphor of a complex predicate is a borrowing, then all the joints between the segments of that complex must be hyphenated. (The ambiguity that is cut off at the pass by this rule will be identified in a moment.) The hyphen that is always used to attach such borrowed segments to the rest of the word is /y/. Thus iglymao [EEG-luh-mough] means 'igloo-maker' and is derived from the metaphor iglu madzo, of which the first term is obviously a borrowing. To make a non-final affix from the loan-word, its final vowel (or vowel group, if it had one) has been replaced by /y/.

Take the complex predicate bakteryrodhopsini [bahk-tehr-ruh-rohd-hohp-SEE-nee]. We know it is a complex and a predicate by its internal /y/. The single hyphen breaks this long word into just two parts, baktery- and -rodhopsini. Each part is derived from a borrowing. Baktery- comes from bakteri for 'bacterium/-ia', and -rodhopsini for 'rhodopsin', being final, represents itself. So the deriving metaphor behind this word was evidently bakteri rodhopsini or 'bacterial rhodopsin'. Thus its proper translation (into English) is 'bacteriorhodopsin', a single scientific word which actually means 'bacterial rhodopsin'. That, at least, is the inside story of bakteryrodhopsini as known to its builder.

But what about the outside story? How can such a word be known by the reader or listener to be a 2-term complex made from just two borrowings? Why don't bak and ter also count as segments of this word, representing, as they normally do, bakto ('bucket') and te/teri/tera ('three'/'third'/'triad'), respectively? The answer is that because we know that one of the terms in the deriving metaphor was a borrowing, we may infer that both of them are. Which one do we know to be a borrowing, and how do we know it? We know that rodhopsini is a borrowing because it does not break into segments as a complex and therefore can be nothing else. We also know the rule just stated that if any term in a complex is a borrowing, then all of its joints must be hyphenated. Only one joint in this borrowing-containing complex is hyphenated. Therefore, if the word-maker was obeying this rule--and we must assume da was; that is what rules are for...to ensure the safety of just such assumptions--no other apparent joint, such as the one between bak and ter, is a real one. Therefore, bakter- is a single segment and must come from bakteri, and not from bakto tera, for example. If in fact the word-maker had wanted to make a word for 'bucket-triad-rhodopsin', then da would have hyphenated the bak+ter joint thusly: bakyteryrodhopsini. In so doing, da would be following the rule that requires us to hyphenate all joints of loan-bearing complexes.

It is of course quite possible to create text in which all these hyphenated words have actual hyphens instead of 'y's. In this textual style 'bakter-rodhopsini' would contrast even more visibly with the unintended 'bak-ter-rodhopsini'.

It is now time for a phonological observation. It is permissible to pause after any of the /y/s in one of these long loan-bearing complexes, and even to stress the /y/-preceding syllable, without spoiling its resolution as a single word. Thus the pronunciation [bahk-TEHR-ruh . roh-dohp-SEE-nee] of the word bakteryrodhopsini is just as acceptable as its somewhat swifter delivery as [bahk-tehr-ruh-roh-dohp-SEE-nee]. The production with the pause will sound superficially like a two-word phrase but will turn out not to be one. It is the anomalousness of the [uh . ] sequence that tells us that it isn't. The [uh] is a hyphen; and as a word may not end with a hyphen, the pause [ - ] that follows it must be inconsequential, that is, ignorable by the resolver. The resolver is, in a sense, advised by the appearance of each hyphen that it is still in the midst of a predicate word. Knowing this it can, in effect, start its resolution over again. So the resolution of this string as a single predicate word is not in the least troubled by the practice of pausing after /y/s.

Close relatives of scientific complexes made with borrowings are the complexes which have letter-words or number-words among their parts. Usually, these non-predicate elements in the metaphor appear in its early portions; they are in any case never final. Little words, too, are joined to the predicate stem with hyphens. Thus Xaiykre [KHIGH-uh-kreh] is 'X-ray', the Xai being the letter-word for upper-case Latin 'X', and the kre being from kreni = 'is a ray from source...' Xaiykre may be variously written, e.g., as 'X-kre', 'Xykre', 'Xai-kre' or, indeed, as 'Xaiykre'. But all such expressions are read aloud as [KHIGH-uh-kreh]. The sound [kh], the reader may recall, is the gutteral "k" or "rough breath" of Russian or Greek, and an irregular sound in Loglan. The contrasting word Kaiykre [KIGH-uh-kreh], by the way, would be 'K-ray'; so one must, in this case, be careful of one's pronunciation...cultivating, for the purpose of talking about X-rays, one's Russian [kh].

Incidentally, the above example reveals how another potential ambiguity is avoided. Kaiykre, meaning 'K-ray', is hyphenated with /y/...as it should be. Suppose we mistakenly used the /r/-hyphen, writing kairkre and saying [KIGH-rr-kreh]. Unintentionally we would have invoked an entirely different predicate, one that means 'property-of-a-ray' or perhaps 'having-a-ray'; for in this word kai is not the letter-word for upper-case 'K' but the short affix of katli ('has property/quality/feature..'). The difference in meaning between the two words Kaiykre and kairkre is entirely conveyed in speech by the two hyphens. The rule is that letter- and number-word segments are always hyphenated to a predicate stem with /y/; while normal CVV-form affixes, when initial segments of their complexes, are always hyphenated with /r n/. Remember baormao, the "box maker".

Let us close this section on building new complexes by observing that if a Loglan speaker finds a complex predicate in the Loglan dictionary whose deriving metaphor strikes him as less effective, less beautiful, or less apt than one that has occurred to da, let da construct a new one. By submitting da's alternative word to The Loglan Institute da will ensure that it will be considered for inclusion in the next edition of our dictionary. If the Loglan Council of Word-Makers does consider it worthy of inclusion, da's coinage will, for a time, live side by side with the old one...whence, in competing for the attention of other users of the language, only one, perhaps, will survive.17

6.5 Borrowing Predicates

Borrowings are not so much made as seen. Once one has mastered the borrowing art, new borrowings may be written into Loglan almost as fast as one can write them down. So the most troublesome question is not how to borrow a given concept but whether to borrow it.

The borrowing process itself is virtually algorithmic. This is especially true when the word in question has a widely-borrowed root that appears in slightly different forms in many different languages already, as is true of nearly all science words. So recasting such a word one more time, this time in a form acceptable to Loglan morphology, is usually a trivial matter that will add only seconds to the borrowing process. Thus, 'protein', 'hormone' and 'interferon' come into Loglan as proteini, hormoni and interferoni almost as fast as you can type them; and writing 'atom' as athomi--once the borrowing rules are known--requires only a few more centiseconds. So typically the most difficult part of adding a new borrowed word to Loglan is making the decision to borrow it in the first place.

The temptation to borrow a word usually arises because its concept is a necessary one for some literary, scholarly or scientific enterprise, and one suspects that it should not be made as a Loglan complex. The other possibility, namely making it as a composite primitive, is seldom even considered. By this time it is an extremely rare event that a new concept is best rendered as a composite primitive. But choosing between borrowing an existing international word for a useful concept and making it as a Loglan complex sometimes requires some thought, and sometimes even preparatory work. Institute policy, while increasingly clear in the case of science words, may not yet provide guidelines that cover the case you are considering. So the most workable policy for individual word-makers is a tentative but optimistic one. Study this chapter; borrow the words you think we need; then let your fellows on the Loglan Word-Makers Council review your borrowings. They will decide whether your new words should be permanent features of the language or not. By considering everybody's work from their community-wide perspective, the Council is likely to develop the same kind of "nearly algorithmic" borrowing policies in other areas as have already emerged for science.

Borrowing policy as it affects science words may be simply stated. If a word has already been borrowed with minor local adjustments in the majority of European languages--in a sense, Europeans were the founding members of the now intercontinental community of science and still deserve to be consulted--and its concept is an exclusively scientific one, then borrow it once more for Loglan. Borrowing it will be to follow a path well-trodden.

But how does one know that the scientific word one is looking at is one of those much-borrowed words that is already part of what the Merriam-Webster people call "ISV", the "International Scientific Vocabulary"? Without consulting a stack of foreign language dictionaries? The easiest answer is to shift the inquiry to another, more easily answered question: Are its roots Graeco-Latin? Did the maker of the prototype--the neologism that is the ultimate source of all this borrowing--go back to those classic languages of European antiquity to get the semantic elements with which to coin it? Well, if you know the most commonly used Greek and Latin roots that appear as "combining forms" (affixes) in these modern scientific coinages, you will probably recognize immediately that 'protein', 'carbohydrate', 'rhododendron', 'horizon' and 'oxygen', but not 'light-year', show indubitable signs of having been made of just such roots. 'Light-year', in contrast, though equally "scientific", has been made of good old English roots and is therefore almost certainly not ISV. (Neither 'light' nor 'year' is either Greek or Latin.) An even simpler test, and probably equally decisive, is to look up the word or words in question in a good-sized dictionary of some Romance language. Any Romance language will do...for example, French, Spanish or Italian. They all have almost identical borrowing habits as far as science is concerned. If a scientific word is substantially the same word--or at least similar enough to attest to having been borrowed from some common source--in both one Germanic language (English) and any one of these Romance tongues, then it is almost certainly ISV and should probably be borrowed.

Applying this test to English 'protein', 'carbohydrate', 'rhododendron', 'oxygen', 'horizon' and 'light-year' we would get 'proteina', 'idrato di carbonio', 'rododendro', 'ossigeno', 'orrizonte' and 'anno luce' if our test language were Italian; 'proteina', 'carbohidrato', 'rododendro', 'oxigeno', 'horizonte' and 'año de luz' if our test language were Spanish; and 'proteine', 'hydrate de carbone', 'rhododendron', 'oxygene', 'horizon' and to 'anneé-lumière' if it were French. Clearly all of these words but the 'light-year' set are ISV. The point is that consulting any of the three Romance languages would have been sufficient to find that out.

Indeed, going back to our earlier point about the origins of these particular words, simply knowing their etymology (i.e., the history of their derivation, as given in almost any large dictionary) would have been sufficient to find out that all but 'light-year' are ISV. For under etymological inspection, 'light-year', being composed of two Germanic words (the German for it is 'Lichtjahr') stands out like a sore thumb. Even with a good etymological dictionary, however, other cases will not be so clear. All things considered, the Romance language test is probably more reliable.

So let us return to that test and its implications for our examples. Since English departs from the Romance languages in using 'light-year' for that important astronomical measure (instead of some anglicized Latin compound, say, like *'annolumen'), we may conclude that neither the Germanic nor the Romance rendering of it should be borrowed for Loglan. Instead the Loglan term for 'light-year' should be made as a Loglan complex...as it easily can be. In fact, in making our complex predicate we might as well use the same metaphor as the one that both the Germanic and the Romance languages use: "light-year" or "year of light", it amounts to the same thing. Thus the Loglan word for this concept was built as litnirne [leet-NEER-neh]. It comes from litla nirne, which is of course nothing more than the literal translation of the English phrase 'light year' into Loglan. Note that we use the Germanic word-order, not the Romance one. Loglan, too, is an Adjective-Noun language as neither Latin nor any of its descendants is.

For a very different reason, the concept of "horizon" should probably also not be borrowed. While the word is apparently ISV in all the European languages, the concept itself is part of everyday human experience. Every pair of human eyes has seen horizons. Unlike protons, oxygen and carbohydrates, horizons are directly perceived by human sense organs and noted--and probably named--in all human cultures. Therefore the Loglan word for 'horizon' should be reduced by metaphor to the common ingredients of human experience, which is what we do when we make it as a Loglan complex predicate. Following this decision, 'horizon' was easily made as telbie [TEL-byeh] from the metaphor 'Earth-edge'. An horizon, I reasoned, marks the edge of the planet...as everyone who has sailed, or flown, or stood in high places, knows. The Loglan metaphor that conveys this high-flying image is terla bidje.

All the other words on our test list may be safely borrowed...and from either the Romance or the English version of the word; the borrowing procedure has been contrived in such a way that it won't matter greatly. I personally prefer using the Spanish version as the source word. I use the Spanish language in my "Romance language test". So while I'm about it, I keep the Spanish word I've just looked up to see if it is like the English, and use it as the source word for the borrowing I will make only if it is. For example, Spanish 'sicopatico' is more useful as a starting point for a Loglan borrowing than English 'psychopathic' is. Both are "local versions" of exactly the same international scientific word; but in the Spanish language most of the necessary rewriting has already been done. The Spanish dictionary one uses to perform this test must be a fairly large one, of course, or specialized in the scientific direction. For it must in any case contain a usably large proportion of the ISV.18

Once the decision to borrow a scientific word has been made, then transforming, let us say, the Spanish version of it into a predicate-form Loglan word is accomplished in four steps, two of them tests which most words pass immediately. In the rest of this section I shall describe the four steps in a summary fashion in order to give the reader an overview of the borrowing procedure as a whole. Then, in Sections 6.6-9, the full range of moves under each step will be discussed in greater detail.

Before leaving our example, let me report that the four words we have been talking about, all of which we found to be safely ISV, were borrowed as proteini, carbohidrati, rodhodendroni and okso. The latter could have been borrowed as oksigeni, but as it is a very commonly used element word, the shorter okso seemed justified.

Now let us walk through the steps of the borrowing process. Suppose we want to import the word 'insulin' into Loglan. The Spanish word is 'insulina'; so we are assured that it is ISV. Clearly insulin is not, like horizons and blue-birds, directly perceived in human experience. Hence it is borrowable.

These preliminary questions out of the way, the first step is to rewrite the source word in Loglan phonemes if it needs to be. 'Insulina' is already in Loglan phonemes. So no respelling is necessary. Our first trial-word is therefore ?insulina itself.

The second step is to equip it with an appropriate ending. As you may have noticed, /-i/ is the ending conventionally assigned to Loglan science words. So we replace the final /-a/ of ?insulina with /i/. Again it is no surprise that ?insulini sounds Italian. Most Loglan science words do.

The third step is to test our trial-word for breakup. ?insulini doesn't break up into a phrase the way ?atomi did (a to mi). The initial /i/ is stuck fast to /ns/ because /ns/ is an impermissible initial sequence. After that, nothing breaks.

The fourth step is to inspect our trial-word for segmentation problems. (a) Does ?insulini segment like a complex? Clearly it doesn't. (b) Does it commence with a consonant-pair? It doesn't; but if it did, we would have to ask, Does the sequel to its first consonant segment as a complex? ?insulini is not CC-initial, so the second question doesn't apply. So ?insulini passes all tests. Insulini may now be used in the language as a predicate meaning 'is insulin/a quantity of insulin from animal/source...'

Reviewing what we have done,

Step 1. We rewrite the source word in Loglan phonemes if necessary. (It wasn't necessary.)
Step 2. We supply it with an appropriate ending. (As insulin is a scientific concept, we replace the single-V ending of the Spanish source word with /i/.)
Step 3. If it breaks, we glue it. (It didn't break. We didn't have to glue it.)
Step 4. If it, or the sequel of an initial consonant, segments like a complex, we spoil the segmentation pattern. (We didn't have to.)

'Insulin' is a fairly typical word. About a tenth of the science words we've borrowed have been like it in being consonant-final and requiring no gluing: 'interferon', 'protein', 'proton', 'interleukin', etc., are all of this pattern. They've all gone into Loglan with (a) little or no respelling, and (b) the addition of final /i/ to the English form of the word. This usually amounts to the replacement with /i/ of the single final vowel of the Spanish source word. The results are words like insulini, interferoni, proteini, protoni and interleukini that sound like Italian plurals and yet are distinctly loglandical. Because they neither break as phrases nor segment as complexes, they have the properties of a simple (i.e., non-complex) Loglan predicate quite naturally. Such words can be borrowed almost as fast as they can be written. The source of the borrowing is always plain.

Now let's look at several additional difficult-to-borrow words. This time we'll consider a word that does require some respelling: 'cercopithecine'. It's based on a Linnaean genus word, so we'll probably fail to find it even in a large Spanish-English dictionary; but looking it up in an unabridged English one we find that it is derived from 'Cercopithecus', the name of a genus of long-tailed African monkeys that includes the guenons. Our convention is to use a Linnaean word as the source of a borrowing whenever one is available. The reason we do this is that endings like '-cine' tend to be language-specific; and so it would be a mistake to imitate any one of them in Loglan. In contrast, the Linnaean terminology is universal. So as we want our word to be as international in flavor as possible, we use 'Cercopithecus' as our source for the Loglan translation of the English word 'cercopithecine'.

In Step 1 we ask, Does 'Cercopithecus' require respelling? The answer is yes. Two of the 'c's in this word precede "strong vowels" (/a o u/) and so are turned into /k/s. Thus ?cerkopithekus. Note that the first 'c', which precedes a "weak vowel" (/e i y/), is unchanged. This follows a general custom in the Romance languages...and many others. However, instead of pronouncing this unaltered 'c' as /s/, as it is the Romance custom to do, we will of course continue to pronounce it as Loglan /c/, i.e., as [sh]. What about the 'th'? Our rule here is to rewrite this digraph as /t/ whenever it does not precede a stressed, i.e. penultimate, vowel. When it does precede such a vowel--as it does, for example in ?ethili--we keep the /h/ and pronounce it: for example, [et-HEE-lee]. The 'e' in '-thecus' is also going to be stressed. So we keep this /h/ and pronounce it: [shehr-koh-peet-HEH-koos]. Thus our trial-word is still ?cerkopithekus.

Step 2 asks us to fit a new ending if our trial-word requires one. ?cerkopithekus does. It is not only a science word but its source is Linnaean. So special rules apply. One of those rules requires that we replace the /us/ on words derived from Linnaean sources with /ui/; we'll see why this special treatment is required in Section 6.6. So ?cerkopithekui (pronounced [shehr-koh-peet-HEK-wee]) is the new shape of the trial-word as it emerges from Step 2.

Step 3 asks us to test our trial-word for breakup. Does it break up as a Loglan phrase? No; again the initial /ce-/ is prevented from being heard as a CV-word by the fact that the /rk/ that follows it in ?cerkopithekui is impermissible at the head of a word. So we pass on.

Step 4 asks us to look for segmentation problems. Can ?cerkopithekui be heard as a Loglan complex? We can start to segment it as cer+kop+..., but then the sequence +ithekui comes along and we cannot continue. If a word cannot be completely segmented, it is not a complex. Finally, ?cerkopithekui is not CC-initial so the second segmentation test does not apply. Apparently there are no segmentation problems; so there is nothing to be done in Step 4. Cerkopithekui is evidently the Loglan word for 'is a cercopithecine, a member of the genus Cercopithecus'.

Let's now borrow a word that illustrates why Spanish makes a better source language than English does. Let's borrow the medical term 'psychopathic'. The words 'psychopath' and 'psychopathy' will of course come along with it. The corresponding Spanish cluster is 'sicopatico/-ta/-tia'...attesting to the ISV-ness of the psychopathy concept. In Step 1 we rewrite the two 'c's as /k/s; for both precede strong vowels. In Step 2 we replace the ending /-iko/ with /i/ getting ?sikopati. (It wouldn't have mattered which member of the source cluster we had used, because /-a/ and /-ia/ are also replaced with /i/.) In Step 3 we find that ?sikopati breaks up as si ko pa ti; so we glue it together by inserting /h/ after the consonant (or consonant group) that follows the first word-break, in this case, the one between si and ko. This gives us ?sikhopati. In Step 4 we discover that there are no segmentation problems. So sikhopati is evidently the Loglan predicate for 'is psychopathic/a psychopath, someone suffering from psychopathia'. Psychopathia itself, of course, will be designated by lopu sikhopati, the mass of psychopathic properties, or psychopathies. The mass of psychopathic states can then be designated by lopo sikhopati.

Let's borrow one more medical word and then go on to consider the four borrowing steps one at a time. Let's take another illness word, this time 'tubercular' in the sense of 'suffering from tuberculosis'. In Loglan, we usually use the disease noun, the '-osis' or '-pathy' word, as the source of these medical borrowings. In Step 1 we rewrite 'c' as /k/ because /u/ is a strong vowel. In Step 2 we drop the 's' from '-osis' getting /osi/. In Step 3 ?tuberkulosi breaks up into the phrase Tu berkulosi ('You are "bercular", whatever that means'); so we have to glue it. We insert /h/ after the consonant that follows the word-break; and this gives ?tubherkulosi. In Step 4 we notice that the sequence /-kulosi/ cannot end a Loglan complex unless it is part of a borrowed segment. There are no hyphens; so this settles the matter without further testing. The Loglan word is tubherkulosi and it means 'suffers from tuberculosis/is tubercular'. Again, the disease itself will be designated by lopu tubherkulosi, and the mass of all disease states, by lopo tubherkulosi. (If you want to develop some skill in pronouncing these long words, practice saying [loh-poh-toob-hehr-koo-LOH-see] a few times. You might alternate it with the distinctly easier production [loh-poh-seek-hoh-PAH-tee]. Soon you will be talking like a loglandian physician.)

Borrowing will never be completely algorithmic, of course. Every now and then a first-cut borrowing will imitate a complex, and when it does, a repair appropriate to the local circumstances of the problem will have to be devised. Section 6.9 describes the several strategies that have been devised to deal with this non-algorithmic side of the borrowing problem.

In the above examples we have concentrated on problems. So let me conclude this section by showing the reader how swiftly the borrowing procedure works with non-problematic source words. Here is a group of Italian musical words, also widely borrowed internationally and so borrowable in Loglan as well. The procedure works like an algorithm here; it brings all six words through Step 4 without appealing to human judgement:

ItalianStep 1Step 2Step 3Step 4
viola - - violha -
violino - violina violhina -
violone - violona violhona -
violoncello violoncelo violoncela violhoncela -
tromba - - - -
trombone - trombona - -

This is a typical set of borrowed words. As they are Italian, we see that almost no rewriting is necessary in Step 1. In fact, only the double 'll' of 'violoncello' gets rewritten as 'l'. In Step 2, however, four of the six trial-words exchange their Italian '-o' and '-e' endings for the Loglan music-word ending '-a'. (The other conventional word-endings are given in Section 6.7.) In Step 3, another four require /h/-insertion to prevent break-up (into the phrases vio la, vio li na, vio lo na, and vio loncela). In Step 4, none of the six words prove to have segmentation problems. Finally, we may note that honcela [hohn-SHEH-lah] will make a nice abbreviation of Loglan's new violhoncela, the also overlong name for the mellow instrument whose name we abbreviate as 'cello' in English.

In the next four sections we consider the four steps of the borrowing procedure in more detail.

6.6 Rules for Rewriting Source Words

Once a word is chosen to be the source of a borrowing, one should rewrite whatever characters in it match the left halves of the following rules. Rewriting rules will not be very different for the various types of borrowings--for music as opposed to food words, for example--but certain problems arise in borrowing science words that require special rules. Table 6.1 includes the solutions to all known instances of such problems for all categories of borrowings. The trial-words given as examples in the table are the products only of rewriting; no other borrowing steps have yet been performed on them.

Table 6.1 Rules for Rewriting Source Words
aa aa => a. 'aardvark' => ?ardvark.
ae ai-or-ae- => e. 'Aegyptopithecus' => ?egiptopithekus.
c c before C/a/o/u => k; c followed by e/i/y, or matched in the symbol of an element word, unchanged. 'canid' => ?kanid; but 'violoncello' => ?violoncelo and 'californium' = ?californium.
cc cc followed by e/i/y => kc; otherwise cc => k. 'succinta' => ?sukcinta; but 'Echinococcus' => ?ekinokokus.
CC Any double instance of a C not c is replaced by a single instance of that C. 'Bettongia' => ?betongia.
ch ch => k. 'Escherichia' => ?eskerikia.
ee ee => i. 'sakeen' => ?sakin.
eigh eigh => ei. 'leightonii' => ?leitoni.
ew ew => u. 'Andrewsarchus' => ?andrusarkus.
h h in VhC => VCh to make it pronounceable. 'ahli' => ?alhi.
ie -ie => i; non-final ie unchanged. 'calorie' => ?calori.
igh igh => ai. 'lightfooti' => ?laitfuti.
ii -ii => i; non-final ii unchanged. 'livingstonii' => ?livinstoni.
lh lh- => elh; non-initial lhC => lC; non-initial lhV unchanged. 'lhoesti' => ?elhoesti.
ng ngC => nC; ngV unchanged. 'livingstonii' => ?livinstoni, but 'Bettongia' => ?betongia. If ngC => nC is later found to cause segmenting, then repair with nC => ngiC. E.g., ?livinstoni breaks as 'li vinstoni' and /h/-insertion then gives ?livhinstoni which segments as liv+hin+stoni. This triggers the rule nC => ngiC which inserts /gi/ into the trial word, giving ?livhingistoni, which no longer segments.
oo oo => u; except in 'zoo-' and other disyllables in which oo remains unchanged. 'lightfooti' => ?laitfuti, but 'Zoothera' => ?zoothera.
ou ou => u when within a natural form: 'youngi' => ?iungi; when split between two forms, unchanged: 'thiourea' = 'thio-' + '-urea' => ?tiourea.
ow ow => ao. 'owstoni' => ?aostoni.
ph ph => f; ph- => eph in very short words; non-initial ph unchanged when it will glue an otherwise breaking word. 'phenyl' => ?fenil; but 'phyla' => ?ephila; and 'ophiocomina' => ?ophiokomina.
pt pt- => ept; non-initial pt remains unchanged. 'Pterocera => ?epterocera.
pn pn- => n; non-initial pn remains unchanged. 'pneumonia' => ?neumonia.
q q => k. 'antiquitas' => ?antikuitas.
qu in Sp. source words quV => kV; but in a Linnaean, quV => kuV. 'braquiopodo' => ?brakiopodo; but 'Madoqua' => ?madokua.
rh rh- => r; rhC => rC; non-initial rhV unchanged. 'rhodopsin' => ?rodopsin.
sc sc => sk. 'sclerosis' => ?sklerosis.
th when not followed by a stressed V, th => t; when followed by V', h is retained. Also, if the th-containing word is related to one in which h precedes V', keep the h. Thus 'ethylene' ?ethileni [et-hee-LEH-nee] is related to 'ethyl' ?ethili [et-HEE-lee] and so keeps its h.
y y => i. 'Amphyosemian' => ?amfiosemian.
yn yn => un whenever the n is the last C in the word. 'butenyne' => ?butenune; but 'dynamo' => ?dinamo.
x x- => z unless matched in the symbol of an element word, when unchanged; when non-initial, => ks. 'xenyl' => ?zenil; but 'Atilax' => ?atilaks; 'xenon' unchanged.
w w => u unless matched in the symbol of an element word, e.g., ?lawrencium.

Note that 'j' is not rewritten; it is retained and pronounced as Loglan [zh].

Perhaps the most awkward rewriting problems are presented by the Linnaean vocabulary of biology. This is true principally because species and genus names often incorporate the name of their discoverer. This brings into science a riotous variety of names and spelling styles: 'Escher' in 'Escherichia', 'Livingston' in 'livingstonii', 'Lightfoot' in 'lightfooti', and so on. Many rewriting rules are necessary just to make such words pronounceable in Loglan phonemes. In general our goal has been to develop words from these Linnaean sources that will be regularly pronounced in Loglan much as Spanish or Italian scholars would pronounce the originals.

6.7 Rules for Adjusting the Endings of Loan Words

By convention, certain characteristic vowels are used for ending loan words of given types when they are six letters long or longer. Shorter words, like iglu, may keep their natural endings. In fact, the endings of 4-letter loans and single-source primitives, like simba, should follow the natural sources as closely as possible without regard to these conventions except that all element words, short or long, should end in /o/.

These are the five characteristic vowel endings:

-a for musical forms and instruments.
-e for local foods, dishes, plants, and animals.
-i for scientific, technological and medical words.
-o and -io for element words.
-u for all the rest, e.g., tools, clothing, dwellings.

Most of the rules for adjusting the ending of a borrowing are obvious. For example, if after being rewritten, a trial loan word already ends in its characteristic vowel, e.g., ?livinstoni from the Linnaean 'livingstonii' and ?tromba from Italian 'tromba' ('trumpet'), then we of course leave the ending unchanged. If it ends in a single V which is not characteristic of its type, then we change it to the proper one. E.g., the science word ?amfidela becomes ?amfideli, and the musical word ?trombone becomes ?trombona. There is one exception to these obvious rules: the Spanish endings '-ico/-ica', which will have been rewritten as /-iko -ika/ during Step 1, are entirely replaced by whatever vowel is appropriate. Thus, as we have already seen, the medical term ?sikopatiko is reduced to ?sikopati.

Genuine problems arise only with trial words which end in vowel-groups or consonants. Then the following rules apply. Remember that adjusting the end of a trial-word takes place after it has been rewritten, if necessary, in Loglan phonemes:

1. All word-final vowel-groups are reduced to the characteristic vowel except the endings /-ae -ea -oa -ua/ on Linnaean-derived science words. In the case of /-ae/, the whole ending is replaced by /ei/ if the source is Linnaean. In the case of the /-Va/ endings, the vowel /i/ is added. Thus the Linnaean-derived trial-words ?kimera (from 'chimaera') ?sturio and ?artemia are changed or reduced to ?kimeri ?sturi and ?artemi, respectively, but the rewritten Linnaeans ?kimerae (from 'chimaerae') ?cinerea and ?madokua are changed or expanded to ?kimerei ?cinereai and ?madokuai. This is to preserve distinctions which are finer-grained among Linnaeans than among other categories of possible source words.

2. The characteristic vowel is added to all consonant-final trial words except those which end with /-(i)es -is -(i)us -(i)um -ik/. In these exceptional cases, the word-maker should replace the entire ending with the characteristic vowel unless it is an

/-(i)um/ ending on a Linnaean, in which case add /i/ to it, or an

/-(i)us/ ending on a Linnaean, in which case replace it with /ui/. Thus the trial scientific words ?andrias, ?acipenser and ?dalton issuing from Step 1 become ?andriasi, ?acipenseri and ?daltoni, while ?intestinalis, ?humerus and ?organic become ?intestinali, ?humeri, and ?organi after end-adjustment. But the Linnaean 'Bittium' keeps its ending by becoming ?bitiumi, and the /-us/ ending of the Linnaean 'Gymnorhinus' is changed to /ui/ in order to preserve its distinction from 'Gymnorhina'. Thus ?gimnorhinui and ?gimnorhini are the ending-adjusted words, and the gender difference is preserved.

Thus as far as possible the special requirements of each body of words to be borrowed--for example, the occasional importance of a gender distinction among the biological Linnaeans--are taken into account.

There is more about the importation of Linnaean words in Section 6.14.

6.8 "Gluing" Trial Words

That a trial-word should break into a phrase of two or more shorter words--as ?atomi breaks into a to mi--is perhaps the most common problem that the Loglan word-borrower will encounter. The frequency of the problem is understandable. Science words, especially, are mainly derived from Greek or Latin roots, and in these languages, simple consonant-vowel alternation is a very common word pattern. This is a common pattern in Loglan, too, of course. But in Loglan consonant-vowel alternation means that one is listening to a string of compound or simple little words. So if we were to borrow words like 'leukemia', 'modularis' and 'molecule' intact, they too would be heard, after end-adjustment, as strings of Loglan little words: leu ke mi, mo du la ri, mo le ku li, and so on.

Even when there is a consonant-pair in a Greek or Latin source word, it often comes too late to hold the whole Loglan trial-word together. Thus after end-adjustment 'rhododendron' and 'kilocycle' will be heard as ro do dendroni and ki lo cikli. Besides, not all consonant-pairs save trial-words from breakage. In ?isopropili', for example, the /pr/ pair is itself a permissible initial. So despite its having a consonant-pair, the trial-word breaks up into the quite reasonable phrase I so propili = 'And six propyls'. Similarly, 'retrovirus/-ri', after end-adjustment, comes out ?retroviri; and this also breaks up, in this case as the phrase re troviri. Obviously, such fragile words may not be permitted in Loglan, where speech and writing are isomorphic. After all, we speak a language in which one can infallibly set down in writing what one hears...provided the word-borrowers do their job right.

Despite the frequency of the breakage problem, however, there is always a simple algorithmic solution to it once identified. There are two principal gluing moves, each called for by its own particular set of circumstances. Inserting "gluing /h/" after the first word-break is, as we have seen, by far the most common of these repair moves. It fixes all the breakages except the last one, re troviri, that were listed in the last paragraph. In all other cases the phoneme /h/ is simply inserted after the first consonant or consonant-group that comes after the first word-break: after the /k/ in leu ke mi, the /d/ in mo du la ri, and so on. This gives us leukhemi [leigh-ook- HEH-mee] for 'leukemia', modhulari [mohd-hoo-LAH-ree] for the Linnaean species name 'modularis', molhekuli [mohl-heh-KOO-lee] for 'molecule', and rodhodendroni [rohd-hoh-den-DROH-nee], kilhocikli [keel-hoh-SHEEK-lee], and ishopropili [ees-hoh-proh-PEE-lee] for 'rhododendron', 'kilocycle' and 'isopropyl'. Once /h/s have been installed, these are all perfectly good words which have no segmentation problems. (Kilhocikli will, however, have to compete with the complex kilcikli ('thousand-cycles') for our vote as the best word for this concept; and it will probably lose.) It is not until we get to the breakup of ?retroviri as re troviri that the second, and far less common gluing move, is called into play.

We will call the second repair move "continuant doubling". It is applied only when one of the four continuants /r l m n/ is the last consonant in a group of two or more consonants that immediately follows the first word-break in a pseudo-phrase. This is the case in the phrase re troviri, for example. The cure is to double the continuant /r/. This produces the new trial word ?retrroviri [reh-trr-oh-VEE-ree], which does not break up.

But how does continuant-doubling arise? In particular, how is it related to /h/-insertion? Well, if we followed the more common practice in the above case and tried to insert an /h/ after the consonant-group in re troviri-- not noticing, perhaps, that the last element in the group was the continuant /r/--we would get ?retrhoviri as our new trial-word. Certainly this looks more difficult to pronounce; and if we do try to pronounce it in a Loglan way, we are very likely to produce a vocalic rendering of the continuant anyway: [reh-trr-hoh-VEE-ree]. Indeed, this is probably the only way one could pronounce the medial consonant sequence /trh/ in Loglan, that is, by vocalizing the /r/ in this inter-consonantal context. So, since vocalicization is an all but inevitable consequence of /h/-insertion in such contexts, and the vocalicized consonant is in many ways simpler to both produce and hear than the inserted /h/, we decided to simplify matters and not insert the /h/ at all in these cases, but just double the continuant. Thus retrroviri is now the recommended respelling of 'retrovirus/-ri' as imported into Loglan. Note that its pronunciation is surprisingly easy: [reh-trr-oh-VEE-ree]. Indeed, for most people, this will be an easier word to pronounce than words that have been fixed by normal /h/-insertion. 'Retrroviri' is also visually simpler than 'retrhoviri', as well as closer to their common source in 'retroviri'. So except for the necessity for introducing an ad hoc resolution rule that will keep such words from being heard as phrases, continuant doubling is a simplification of the /h/-insertion move in a context in which the latter encounters phonological difficulties.

The required ad hoc rule is, of course, that doubled continuants be confined to loan words and to the borrowed parts of complexes, and that even in these contexts they not be allowed in word-initial syllables. It is this rule, of course, that makes the sequence /trr/ an impermissible one initially, and so saves /retrroVIri/ from being heard as the two-word phrase re *trroviri.

Both /h/-insertion and continuant-doubling require a certain care in pronunciation. All such repaired words are meant to contrast successfully with the phrases into which they were not allowed to break: thus leukhemi [leigh-ook-HEH-mee] should contrast intelligibly with leu ke mi [leigh-oo-kEH-mee], and retrroviri [reh-trr-oh-VEE-ree] must successfully contrast with re troviri [reh-troh-VEE-ree], which is the phrase it was not allowed to be. Notice that the second contrast is far easier to produce and hear than the first one. The vocalicized continuant is in fact quite audible and gets an extra syllable of its own. Indeed, even syllable count is different in the word compared to the phrase (5 syllables vs. 4). In producing leukhemi intelligibly, however, one must deliberately separate the [k] from the vowel [eh] with a strongly aspirated [h]. A little practice listening to one's /h/s makes one reasonably good at this. However, you needn't be concerned about perfecting this skill. It is mainly your computer that will care if you fail to "emphasize your /h/s" in a way that can always be heard. Your human interlocutors will probably understand you even if you mumble them.

It is true that after all the breakups have been repaired--and about half your scientific borrowings will require gluing repairs--a Loglan text that is largely made up of such borrowings--a scientific article, say--will at first fairly bristle with its 'h's. But experience has shown that the 'h's soon drop into the visual background in such documents. They become virtually invisible, in fact. What the English-trained eye comes to see in athomi, molhekuli, ishopropili, rodhodendroni and leukhemi are strings of letters in which the 'h's and 'i's have been blanked out: 'at-om-', 'mol-ekul-', 'is-opropil-', 'rod-odendron-' and 'leuk-em-'. A possible cause of this phenomenon is that the 'h's bear almost no semantic burden. They seldom occur in such positions in the original Greek and Latin constructions. 'Methane', 'atherosclerosis' and 'ethyl' and kin are the chief counter-examples I've run into. Besides, in such words the "real" 'h's have a way of coming back to life: methani, atherosklerosi, ethili, ethileni, and so on. As for the inserted 'h's, all they have to say to the reader is 'This is probably a Loglan borrowing' The often-added final /i/s bear a similarly slender metalinguistic message: 'This is probably a Loglan scientific borrowing.'

So in the end what the reading eye does is look past the gluing 'h's and the characteristic vowels to the phonemes that actually identify each borrowing...in a sense the load-bearing ones. Among these the reader will almost always find the main ingredients of an often-borrowed international word laid out in seemly order. Beyond atlatlu is the Nahuatl word 'atlatl'; beyond ionhi is the Greek word 'ion'.

6.9 Curing Segmentation Problems

Loglanists, we have seen, expect to be able to spot and decipher new complexes as they read and listen. So a loan-word that ends up looking and sounding like a complex is no good to us. It throws the attentive reader/listener off the track. And loglanists are likely to be fairly attentive readers and listeners. In fact, we are likely to be constantly on the lookout for segmentation patterns like mek+y+kiu, roj+mao+ses+mao, num+sensi and marty+sensi in what we see and hear. These are the hallmarks of the Loglan complex predicate. Complexes are far more important in our language than borrowings are. So any borrowing that can be seen or heard as a complex probably will be.

In a world of well-constructed borrowings, attention to segmentation patterns will be amply repaid in Loglan. For one thing, new complexes are likely to be much easier to understand than novel scientific words are. The reason is that all regular complexes, i.e., those made of non-borrowed parts, are combinations of about 600 plainly derived affixes as well as the 860 Loglan primitives themselves. In Loglan, these basic semantic ingredients will be routinely learned by everyone. Once they are learned, any new complex--no matter how strange or long--can be easily deciphered. So deciphering a strange new complex is likely to be a much easier task for the ordinary loglanist than fathoming an unfamiliar scientific borrowing...like albocinerei, for example.

The word I've chosen to make this point is obviously a Loglan borrowing; its form is not primitive and it doesn't segment. But what it borrows is a natural complex, originally made from several parts. Its source was 'albocinereous', an English ISV word that was once derived from Latin 'albus' = 'white', 'cinis' = 'ashes' and '-eus' = 'composed of'. So 'albocinereous' is a technical word meaning 'composed of white and grey material'. It is used mainly by anatomists.

To know this one has to know a lot about how Greek and Latin roots have been used in making scientific words. At least one must know how to use the etymological information given about them in large dictionaries. All scientists must learn at least a portion of this immense body of Western scholarship; but usually they master only enough of it to understand the derivations of the words used in their own and related disciplines. It will probably not be different for the Loglan-reading scientist. So da will probably be unwilling to give up the clarity of the words da can easily decipher--the regular Loglan complexes--just to make the borrowing of the natural complexes a little more automatic.

For it is the automaticity of the borrowing process--and very little else--it turns out, that is at stake here. That is what we give up when we require that every word-borrower check da's products for segmentation, and then correct them before da publishes them. For, unlike breakage, there is no automatic way of curing a segmentation problem once one has been found. Suppose we have made ?palheozoi from the English source word 'Paleozoic' (Spanish 'Paleozoico' gives the same result). If we fail to notice that our trial word could be segmented as pal+heo+zoi, then at least some of our unfortunate readers would be sent on a wild goose chase when they encounter our new word. The most attentive of them would attempt to decipher pal+heo+zoi [pahl-heigh-OH-zoy] as a three-term complex. If they do, they will soon be frustrated. To be sure, pal- has an assigned meaning as an affix of spali = 'side'; but both -heo and -zoi are still unassigned. That doesn't mean that they couldn't have meanings, however, or that the day will not come when they do. Were such a day to come, borrowings like ?palheozoi, if we permitted them, would be un-untiable knots in the fabric of the language. They would be words that could only be legitimately interpreted as complexes but which context demanded be seen as loans.

So we avoid such words with their mixed messages. We carefully examine each loan-word we make for the two ways in which it might imitate a complex: (1) all by itself, as ?palheozoi imitates pal+heo+ zoi; and (2) as part of a CV-word initiated phrase, as ?spektri in the phrase to spektri imitates tos+pek+tri. For this reason words like ?spektri are also disallowed.19

The tests themselves are easy to conduct. Test 1 is simply to attempt to resolve one's trial-word as a complex. If one succeeds, as one does in the case of ?palheozoi, the word must be repaired; its segmentation pattern must be spoiled. Test 2 applies only to CC-initial words. Of these one asks, Will the sequel of the initial consonant segment as a complex? If it does--if ?spektri segments as s+pek+tri--it is a Spektri-type word; and it, too, has a segmentation pattern that must be spoiled. The tests require no human judgement. So performing them could well be part of a borrowing algorithm. It is the repair of a segmentation problem once identified that cannot yet be done algorithmically.

As mentioned earlier, the direct imitation of a complex, as ?palheozoi imitates pal+heo+zoi, produces by far the most common type of segmentation problem: it affects about 5% of the scientific borrowings. But the second, s+pek+tri-type problem, while much rarer--occurring as it does in less than 1% of the scientific borrowings--may be repaired by exactly the same set of maneuvers. So from the standpoint of the word-maker, we may consider both sets of problems to be one.

The task in either case is to spoil the segmentation pattern of the pseudo-complex. Once spoiled in any particular, the pattern will disappear. Thus there is no such thing as a word which is "half complex and half borrowed" in Loglan. If any part of a word fails to segment as a complex, and it is not a primitive, then the whole word is a borrowing. So to spoil a segmentation pattern at any point in it is to solve the segmentation problem.

Here are some of the most commonly used "pattern-spoiling" maneuvers:

1. Restore an old VV, VC or VCV ending, preferably one that was replaced in Step 1 or 2, and add the characteristic vowel if necessary.

The failed word ?palheozoi affords a good example of the usefulness of this kind of repair. In Step 2, the /-iko/ of ?paleozoiko was replaced by /i/; we now restore it and change the /o/ to the characteristic vowel /i/. This produces ?palheozoiki [pahl-heigh-oh-ZOY-kee], and, miraculously, the segmentation pattern disappears. Segmenting from left to right, we get as far as pal+heo+... before the "indigestible" residue +zoiki appears. No sequence of the shape -CVVCV can be the final segment of a regular Loglan complex because no string of that shape can be decomposed into regular Loglan affixes. Therefore any word that ends with such a segment must be either wholly or partially a borrowing.

So palheozoiki is now a satisfactory Loglan form of the international word that is in English 'paleozoic' and in Spanish 'paleozoico'; and while, to the knowing scientist, all three of these words are natural complexes, composed as they all are of variations on the two Greek roots 'paleo-' and 'zoi-' (meaning 'old' and 'life' respectively), there is now no temptation to interpret it as a Loglan complex. Its parts are in this case Greek, not Loglan.

There is a simple feature of a regular Loglan complex which may be used to detect the goodness of a Step 4 trial borrowing at a glance. This is that the last six letters of a regular complex must either (a) be -CVVCVV in form or (b) contain at least one CC. Several final patterns occur very commonly among borrowings that show neither of these characteristics. Such words may be known immediately to be borrowings...or to contain them. This is true no matter what their front ends look like...provided, of course, that all breakup problems have been solved. For example, if a predicate shows the final pattern -VCVCV, then it can only be a borrowing. This is also true of other final patterns: -CVVCV (as we have just seen) and -CVVVV and -VCVCVV. Predicates with such final sequences cannot be regular. Therefore if a predicate does end in any of these ways, it is either a borrowing or a complex in which at least its final segment has been borrowed.

Returning to our main task, which is the repair of segmentation problems, here is another pattern-spoiling move:

2. Insert a vowel between the members of any consonant-pair other than the first one.

This assumes that there are at least two CC-pairs, as there are, for example, in both ?spektri, which fails because of s+pek+tri, and ?galhaksi, which segments as gal+haksi, the latter being the trial-word which emerged from the gluing of ?galaksi (ga laksi). Usually the inserted vowel is /i/, and if there are more than two CC-pairs available, usually the earlier the pair that is chosen as the site for this surgery, the better the result. But in the case of ?s+pek+tri, choosing the third C-pair, namely /tr/ as the site for vowel-insertion gives the best result; and the best vowel, this time, seems to be /e/. Thus, spekteri is the word we chose, but spektiri would have spoiled segmentation just as well. Applying the same move to ?galhaksi produced galhakisi; but this is a resolvable word we didn't accept. Instead, we looked to other spoiling moves to produce possibly better results for this important notion.

Another trial word that segments is ?plasmi = p+lasmi (from 'plasma'). This time there is only one possible site for vowel-insertion, and that is /sm/. This time /i/ does seem to be the best vowel even though /a/ was tempting. The word we finally settled on was plasimi. Notice that 'plasmid' follows the same route: ?plasmidi segments as pla+smidi and repairing it by vowel-insertion gives plasimidi. So the two borrowings plasimi and plasimidi exhibit the same degree of affinity as their sources do.

Here's a final spoiling move:

3. Eliminate one of the two consonants in any consonant-pair other than the first one.

For example, gal+haksi, which is what happens to ?galhaksi, which is in turn the glued form of ?galaksi from 'galaxie', imitates a complex. Spoiling Move 1, which restores the original ending and then extends it, produces the resolvable word galhaksiei [gahl-hahk-SEE-ay]. Move 2, which inserts a vowel in the second CC, produces the equally resolvable galhakisi. Both moves cure the problem, but neither result seems as attractive (to this word-maker, at least) as one of the results of spoiling it by Move 3. Move 3 can give either galhaki or galhasi. Something about the simplicity of galhasi [gahl-HAH-see], for the notion of an "island universe", appeals to me.

Notice that while Move 3 works here, it would have given unattractive repairs of both of the problem-words we chose to cure with Move 2. That is, it gives speki, speti or speri for 'spectra', and all these are, in my opinion, less suggestive of the source than spekteri is; and it gives plasa and plama for 'plasma', and these are less successful words than plasimi is...or at least it seems so to these ears. But the reader has observed that the words that emerge from segmentation repair are nearly always less handsome than other borrowings. Fortunately, only about 5% percent of borrowed science words require such repairs; see Section 6.11 for the proportions in other categories.

There are other ways to repair segmentation problems; but these three are the main ones. It is obvious that human esthetic choices are involved at nearly every point in the repair process, and would be no matter how we chose to do it. It is thus the segmentation problem--the occasional word that looks like a complex and requires spoiling--that makes the borrowing procedure non-algorithmic.

6.10 Some Non-Scientific Borrowings

While the largest number of borrowed words in current Loglan are science words, local food, plant and tool words are also eminently borrowable. Here is a list of food and drink words, for example, for which the characteristic ending is /-e/. We start with cheeses:

gorgonzole glaosetere gaodhe
rokfore dansiko blanu gruiere
limburgere tcedare munsitere
brihe kambere provolone
montereidjeke stiltone edhame

Dansiko is from the dansika/i/o-triplet for Danish, Danes and Danish culture, which includes the art by which they make their cheese. Note that, for the most part, we follow sounds rather than spellings in making these borrowings. One talks and listens to talk about food; much less frequently does one write about it. So the principal habitat of the culinary vocabulary is speech.

We continue with some fruit and vegetable words:

aspharage abrrikote gumbo ('okra')
articoke fighe kolhe ('cabbage')
tcokolate melhone brokhole
mustharde selhere kolhyflora ('cauliflower')
banhane mango pampelmuse ('grapefruit')
kranbere zukhine persa ('pear')

Kolhyflora is of course a complex made from kolhe flora ('cabbage-flower'), a practically universal metaphor for this vegetable.

We finish with some words for wines and other potables:

bordo vermute rainvaine
bojhole uiski sakhi
mozhele konhiake absinte
porto madhiere campeine
herhese (cerhi) marsale muskatele
cabli sotherne klarete

One observes that words originating in the Romance languages--especially from Spanish and Italian--suffer less distortion in being imported into Loglan than words from other languages. Compare Italian-derived gorgonzole with German munsitere. This is, of course, a natural consequence of the deep phonological affinity between Loglan and other instances of what might be called "five vowel" languages.

6.11 The Borrowing Pathways and Their Products

Before we leave the topic of borrowing it will repay us to look at the outcome of all these arrangements. What borrowing pathways are most common? What do the words produced on these pathways look like?

We have a sample of about 1,000 scientific borrowings to look at. In the period 1985-88 The Institute added about 1,600 scientific words to the lexicon during its studies of how best to translate scientific prose into Loglan; and about 1,000 of these were added as borrowings. The rest were single-source primitives, complexes, and scientific names. The added words covered a wide variety of scientific topics, ranging from archeology to biochemistry and cosmology.

The largest group of borrowed words that were outcomes of a single borrowing pathway--neglecting rewriting differences--were words like hormoni. These are words that are made by replacing a single final vowel with /i/. They neither broke nor segmented; so they needed no repairs. Let us call them Group 1. They amounted to about 15% of the 1,000 borrowings. Three examples are kartilagi, intestini and gimnorhini (for 'cartilage', 'intestine' and 'Gimnorhina').

The next most productive pathway was single final-V replacement combined with /h/-insertion. No words with segmentation problems are included in this group. Call these Group 2. These were a little fewer than 15% of the sample. Three examples are anhilini, kimheri and cithosini (for 'aniline', 'chimaera' and 'cytosine').

The next biggest group were words that came from naturally C-final sources--C-final in English, anyway--but broke, and so had to be fitted with a final /i/ as well as glued with /h/. Call these Group 3. They were a little more than 10% of the sample. Three examples are hemhoglobini, neurhoni and achipenseri (for 'hemoglobin', 'neuron' and 'Acipenser').

The fourth group came from naturally C-final sources that didn't break but had to be fitted with final /i/. These required no gluing and were a little less than 10% of the sample. Next to Group 8, which required no alteration beyond rewriting, these are perhaps the most recognizable of the Loglan science borrowings. Three examples are antigeni, kanceri and etheri (for 'antigen', 'cancer' and 'ether').

Then we come to six smaller groups, each with about 5% of the sample. In order of size, they are Group 5, which were made by replacing a VC-ending, usually /-is/, with /i/ and gluing with /h/. Examples are anhalgesi, kathalisi and filhariasi (for 'analgesic', 'catalysis' and 'filariasis').

Next were those with VC-endings that required replacement with /i/ but no gluing. Examples are brakeocefali, sklerosi and karpoidi (for 'bracheocephalic', 'sclerosis' and 'Carpoides').

Then there were the Linnaeans that didn't break but were /-us/-final. These, recall, had to be made /ui/-final so that they would contrast with the occasional feminine form of the same words. Examples are amfibolurui, granulatui and gimnorhinui (for 'Amphibolurus', 'granulatus' and 'Gymnorhinus'; compare the latter with 'Gimnorhina' => gimnorhini in Group 1.)

Then there are the naturally Loglan-form words, perhaps as rewritten. Examples are arteri, singami and prosimi (for 'artery', 'syngamy' and 'prosimii'). These were also about 5% of the sample.

Another "five-percent group" were those with natural /i/-endings--again, perhaps as rewritten--but requiring gluing with /h/. Examples are ephoksi, ephitaksi and helhioterapi (for 'epoxy', 'epitaxy' and 'heliotherapy').

The last of the six "five-percent groups" were the Linnaeans with /us/- endings that were required to be replaced by /ui/, but which also needed gluing with /h/. Examples are pomharinui, orhiolui and methopidui (for 'pomarinus', 'oriolus' and 'Metopidius').

With the ten groups described so far, we have accounted for about 80% of the borrowed science words. Four more groups had about 5% each; but these were not products of single production pathways as the first ten groups were. For example, in the order of size, Group 11 was composed of all those words that had had segmentation problems, no matter what their other characteristics. About half of these were glued with /h/ and the rest required no gluing. Many different solutions to their segmentation problems were represented. Here are three examples: bronkikui, bronkiai and krisitali (for 'bronchus', 'bronkia' and 'crystal').

Fewer than 5% of the borrowings required doubled continuants as their solution to the breakup problem. Examples are neutrroni, membrrani and apllasi (for 'neutron', 'membrane' and 'aplasia').

A still smaller group had VV-endings which were replaced by /i/. Again about half were /h/-glued and half unglued: malhari, anhemi and albuminuri (for 'malaria', 'anemia' and 'albuminuria').

The last group were characterized by VV- and VC-endings that were augmented by /i/, and these, too, were about half /h/-glued and half unglued. Nearly all of this final group were Linnaeans: antropoideai, aurheai and cinhereai (for 'Anthropoidea', 'aurea' and 'cinerea').

To see some of these words in action, you may wish to examine Appendix G, where the first few paragraphs of a number of scientific articles are translated into Loglan.

6.12 Making Single-Source Primitives

Adding a local (parka), scientific (carbo), or other international primitive (futbo) to the language is probably the least demanding of the several word-making arts. There are only two primitive forms, the fumna- and the mrenu-one; and only a few of the words one wishes to borrow will fit neatly into either of them. But those that do usually do so quite easily, either exactly (simba), with some padding (genhe), or by abbreviation when some shortening is justified by use-frequency (telfo). Some very short words (e.g., 'beer') require several bits of padding to fill out the primitive form (birju). But unlike other borrowings, single-source primitives do not require elaborate tests.

Here are some primitive-form words that were derived from a single source either because they are words for once-local things that have since spread, like parkas, or because they are words like 'passport' that have long been part of the international scene. All of these words could have been made as longer borrowings; but either their form or their use-frequency seemed to justify the primitive form:

sinma sofha tulpi telfo
telvi kabre birju resra
konsu paspo kapta banko

Note that characteristic final vowels play little or no role in either making or recognizing these primitive-form words.

In making single-source primitives, the word-maker's task is to preserve as much that is phonemically prominent in the natural word in similarly prominent positions in the new one. Thus 'beer' could have been equally padded out as *jbiru, but this would have put the sequence -bir- in the middle of the word where it is surely less recognizable than in birju. The sound /j/, as we shall see presently, comes from Chinese.

Sometimes /h/ makes the best "filler consonant" (as in genhe and sofha) when one is required. Since /h/ is nothing more than an unvoiced version of the following vowel, it is naturally a sort of "minimum consonant". Also, although /h/ is widely distributed among the source languages, it is a fairly infrequent sound in all of them. Therefore it makes an excellent filler sound, ideal for use here as well as to mend broken borrowings. Knowing that word-makers are observing this convention helps one decipher sofha, brihe, sakhi and fighe as the words for sofas, brie, saki and figs, respectively.

Occasionally one is helped in making these primitive-form words by examining what happened to the international word when it arrived as a loan-word in other languages. Thus, when the Japanese borrowed 'beer' it became 'biiru', while the Chinese version of this foreign word--one of the few non-scientific imitative borrowings in Chinese, by the way--became 'bijiu'; so from these two Oriental renderings of the original Germanic word we may draw the final vowel /u/. We can also use /j/ as the filler consonant, taking it from Chinese. The result is birju. So in this case birju makes a more recognizable word than the more conventional birhu to the billions who know this word as 'beer', 'bir', 'Bier', 'biere', 'bir(r)a', 'birah', 'bere', 'biiru' or 'bijiu'.

Sometimes a truly universal but vowel-rich word like 'radio' cannot be built into Loglan by conventional methods in any way that seems adequately to preserve its flavor. ?Radio itself breaks of course into the phrase ra dio. Inserting /h/ to cure the breakage produces rad+hio, which segments like a complex. Dropping the /i/ gives the workable primitive-form ?radho...but it just doesn't seem to make it. To bring this one back into the realm of the recognizable, I remembered a substitution for the [y] allophone of /i/ that I had observed in northern Spain. In the province of Asturias one often hears the sound [zh] being substituted for [y] in such words as 'Yo' ('I'), as in [zhoh-VOY] for 'Yo voy' ('I go'). I wondered whether the same substitution might make ?radjo more recognizable as a variant of 'radio' to an international bag of loglanists. After living with it for some months--a necessary test of any morphological innovation--I came to believe it would. And so I rendered the European word 'radio' as Loglan radjo.

6.13 Making Names

Adding new predicates to a language is an occasional, even rather solemn affair in which the word-maker must persuade daself--and others--that da's proposed addition is a good one. But adding names to a language is a continuous and possibly even a frivolous activity. No dictionary, for example, even tries to keep track of all the names the speakers of its language coin and use.20 So it will be in Loglan. And just because name-making is an easy and casual activity, we must now treat the name-making game more fully. Everyone who speaks Loglan will have many occasions to build names.

As we learned in Section 2.11, the morphology of Loglan names is exceedingly simple. All that is required of a Loglan name is that it be any sequence of two or more Loglan phonemes of which the last is a consonant. We also learned that if a natural name does not end in a consonant, the consonant /s/ is conventionally supplied. Thus 'Italia' becomes Italias; 'Roma' becomes Romas; 'Mary' (/MEri/), Meris; and so on. Moreover, since we follow the stress of the natural name in pronouncing the Loglan one, if that natural stress is not penultimate, we mark the vowel that is stressed with an apostophe ('). Thus 'Paris' is Pari's [pah-REES] and 'Washington' is Ua'cintyn [WAH-sheen-tuhn]. Similarly, if the syllabification of a natural name does not follow the standard Loglan pattern, close-commas may be used to mark the special syllable breaks required: in Lo,is [LOH-ees], for example, but not in Djois [joyss]. Thus virtually any natural name whose sounds can be at least approximated in Loglan phonemes may easily be rendered as a Loglan name.

One is free to use the three irregular phonemes /x q w/ ([kh th ue]) in importing natural names, or not to use them as one chooses. Thus, 'Bach' may be written Bax or Bak depending on whether it is the goodness of the imitation that one wishes to maximize, or the number of people on the planet who will be able to pronounce what one has written. Some word-makers pursue one objective; some, another. Similarly, Russian 'Kruschev' may be either Xructcyf or Kructcyf, and Theodore the Englishman may write and speak his Loglan name as either Qi'ador or Tiador again depending on whether it is verisimilitude he is after or easy reproduction. It is often possible, of course, to get good approximations without recourse to the irregulars. (The only natural name I have yet encountered which cannot be well-approximated in Loglan sounds--even including the irregulars--is the Malaysian single-phoneme name which in English transcription is spelled 'Ng'.21

Before we leave the matter of natural names, there is one set of them that one would think would be proscribed in Loglan but are not. These are the names--very common in France--that commence with the sequence /la-/, a pair of sounds that happen to invoke our name-operator exactly. One would think that importing the name of Laplace, the 19th Century French astronomer and mathematician, exactly as it is pronounced, namely as Lapla's, would be to invite confusion with some Dutchman called 'Plass'. Would not the call Lapla's be indistinguishable in speech from the designation la Plas? The productions seem identical: [lah-PLAHSS] and [lah-PLAHSS]. Indeed the two utterances would be identical if persons with /la/-initial names were ever to be called by their names alone. But they're not. Loglanists, being alert to such difficulties, insist that all such names be preceded by either Hoi or la whenever they are used: Hoi when used as vocatives, la when used as designations. Thus Hoi Lapla's and la Lapla's contrast unmistakably with Hoi Plas and la Plas, while Plass has the extra privilege of being callable to dinner by his unadorned name Plas. But LaPlace can't be. If we ever call such a person, it will be by shouting [HOY.lah-PLAHSS]. For his name alone is indistinguishable from a potential designation of someone else.22

Let us now review the several ways in which new name-words may be coined in Loglan. In Section 2.11 we observed that the CVC-form was preferred for internally-derived Loglan names, and Sol ('Sun'), Ter ('Earth') and Lun ('Moon') were given as examples. Many other name-words can and have been made in this form, or in elaborations of it. For example, the standard way to make a numerical name is simply to add /n/ to the numerical quantifier. Thus, the number One is Nen, Two is Ton, Three is Ten, Thirteen is Neten, and so on. And these numbers may also be used for the names of the local hours: 'nine (o'clock)', 'ten (o'clock)' and 'eleven (o'clock)' being most economically referred to as Ven, Nenin and Nenen.

To provide names for local days of the week--as are provided in many languages, English included (thus 'He came on Tuesday' seems to be the preferred form in English over all the more exact alternatives: 'He came on tuesdays/a tuesday/the fourth tuesday, etc')--one may easily follow the precedent of these compact numerical expressions and generate the Loglan series Neden = 'Monday' (from ne + den from denli = 'day'), Toden = 'Tuesday', Teden = 'Wednesday', Foden = 'Thursday', and so on. To use them for every purpose for which day-names are used in natural language would probably not be good Loglan, however. In a language in which one can easily build predicates for these same classes of temporal objects, why not do so? Thus nerdei might be derived from neri denli = 'first-day' and mean 'is a monday of week/month...'--note my use of lower-case initial letters in both languages--and the series then continued as tordei, terdei, fordei, ferdei, sordei and serdei; and this is lower- case sunday this time.23 These predicate words are, in any case, the words that one would require, in a logical language, to translate such logically more fastidious remarks as 'He came on a tuesday' = Da pa kamla na ne tordei. Similarly, 'He came on tuesdays' finds satisfyingly precise expression as Da pa kamla na lo tordei.

Once such predicates are in hand, another kind of Loglan name-usage can be developed by pursuing the following route:

(1)Da pa kamla na ne tordei He came on a tuesday.

(2)Da pa kamla na lo tordei He came on tuesdays.

(3)Da pa kamla na la Tordei He came on Tuesday.

In the last step we are using the predicate as a name. We first encountered this usage with such relational words as matma and farfu, as in La Farfu pa fadkaa na lepo natli = 'Father arrived last night'. And this example reminds us of the informal alternatives to these rather formal predicate names, such as La Far in La Far, pa kamla na lepo natli = 'Dad arrived last night' Far, of course, is an actual consonant-final Loglan name-word, not just a predicate word being used as a name. Could it be that Toden will come to bear the same relationship to Tordei as Far bears to Farfu? That one will be a familiar version of the other? A nickname? If this development is foreseeable, we may then add the following step to our series:

(4)Da pa kamla na la Toden He came on Toozie.

And what is 'Toozie' but the just-invented English nickname for 'Tuesday' which we apparently now require to translate Loglan Toden.

Of course the tuesdays the loglanists would bother to name in this familiar way are just those special, local tuesdays which were singled out by them as deserving a local name, just as dads are singled out for nicknaming in such local groups as our own particular families. It's hard to think of a non-fanciful use for a nickname sense of 'Tuesday', I admit. But that's the use for which Loglan Toden is apparently headed. For most other uses, the predicate tordei will usually be a better choice.

Assuming that loglanists will have some use for temporal name-words--and surely they will, if only to translate natural language texts with fidelity--the names of the months are built on a similar pattern to that of day-names, this time using -men from mensa ='month'. Thus, Nemen = 'January'; Tomen = 'February'; Temen = 'March'; and so on up to Netomen for 'December'. Again, the corresponding predicates could be built as a series of complexes, this time using -mea from mensa. Thus nermea ('is the january of year..'), tormea, termea, up to nertormea for lower-case 'december' this time. It will be interesting to observe whether or how readily loglanists will take to using predicate words (common nouns) for objects which in their native tongues are uniformly (but logically, incorrectly) treated as deserving local names (proper nouns).

As mentioned earlier, any predicate may not only be used as a name, it may also yield a name-word. Thus, just as Far may be drawn from farfu to mean informal 'Dad', so Fum may be used as the informal call 'Woman!' In contrast, Hoi Fumna would be the more formal address 'O Woman!'. The latter form might, indeed, approximate the respectfulness of English 'Lady' or 'Madame'. Similarly, mrenu yields the familiar call Mren = 'Man!', while Hoi Mrenu is the formal 'O Man!' which might even be used to translate 'Sir'. Adding -n to a little word also produces a name and hence a potential call. Thus Tun is 'You!', and Min is the proper familiar vocative for people who talk to themselves.

By an odd turn in the analysis this takes us to a point from which we can speculate on how the mechanism of insult might run in Loglan. To call a man a fool or a liar, for example, is an insultingly familiar act in most cultures, one tolerated only between the closest relatives or the fastest friends. Outside these privileged circles, the use of such words is a provocation...indeed, among simple peoples, it is often a clear invitation to combat. To say such dangerous things in Loglan is, therefore, to say Bun} not Hoi Bunbo ('O Fool'), Faltak}, not Hoi Faltaa ('O Liar'), Tsitok} not Hoi Tsitoa ('O Thief'). On the other hand, we now notice that these other, more formal forms are still available, and that they will have a very clear if different sense in Loglan, a sense which may (miraculously) allow them to be used. Thus there may be circumstances in which to say Hoi Tsitoa in this formal way may be to remove the familiarity component from the insult, and thus to withdraw its sting. Is it possible that loglanists will be able to engage in straight talk with one another without offense, using these delightfully formal, and so not very insulting, verbal forms? Cannot one imagine a loglandian play, for example, in which the characters courteously addressed one another as Hoi Faltaa and Hoi Tsitoa? How dramatically this might contrast with Faltak} and Tsitok} when at last the chips went down.

6.14 Importing the Linnaean Names of Biology

A special category of imported names in any language whose speakers practice science is the Linnaean nomenclature of biology. 'Homo sapiens' and 'Escherichia coli' are examples of our respectful treatment of these words in English. By international convention, no writer may alter the original spelling of these biological names no matter what language da is writing in. Even scientists writing in languages that do not use the Latin alphabet--Chinese, Japanese, Hindi and Russian, among our source languages--are obliged to reproduce such expressions as 'Australopithecus afarensis' in their scientific texts exactly as they are spelled in Latin letters. Scholars who write in Loglan will certainly feel obliged to follow this same convention. But in that case we must have a way of indicating to our readers that our usual phonemic spelling rules are being temporarily set aside whenever they and we encounter a Linnaean name. We introduce the special Linnaean name operator lao [lough] to do this work. Apart from its role as a designator, the meaning of lao is that the text we produce within its scope is not to be interpreted phonemically.

Lao is an obvious kin of la, the standard name operator. Unlike la, however, lao is always separated from the name that follows it by a pause. Like the pause between names, this pause is occult; that is, it is not indicated by a comma in text. Thus,

(1)Lao Escherichia coli, intestini bakteri Escherichia coli is an intestinal bacterium.

Like all serial names, each of the Linnaean names in a string of names is followed by a pause. But only the final pause appears as a comma in text.

Another problem that inevitably arises with Linnaean names is that, without a special signal, the listener can't know beforehand how many separate words each lao-construction will contain. Sometimes there is only one word, as in lao Mammalia or lao Primates. Such solitary Linnaean words are usually class, family, or order names as these two are. But most often there are two elements in the Linnaean name, usually a genus-species pair as in lao Homo erectus or lao Pan troglodytes. Sometimes three Linnaean words are strung together, as in lao Homo sapiens neanderthalensis, which is the current scientific name for Neanderthal Man. The third name-word is usually the name of the subspecies or variety if these have been named.

There is no uncertainty about the scope of lao in text, of course. But in speech, while one can hear the lao, and know that at least the first part of the ensuing string will be Linnaean, one cannot know its scope beforehand. Most Linnaean names end with vowels, and so may imitate Loglan words that are not names. True; Linnaean names are separated by pauses. But how many pauses should one listen for before deciding that the Linnaean name is over?

To answer this question, a special signal for the listener is required, one that will determine the scope of the spoken lao-operator absolutely. We use what is normally our intraverbal hyphen /y/ to link strings of two or more Linnaean names together into what are always grammatically simple structures. Thus all sutori (second and subsequent) terms in Linnaean strings are preceded by the sound but not the sight of /y/. If the term to be attached to the string is consonant-initial, the obligatory pause precedes that spoken [uh]: [lough . ess-kehr-REEK-yah . uh-KOH-lee]. If it is vowel-initial, the pause follows the spoken hyphen: [lough . HOH-moh-uh . ehr-EK-toos]. The Linnaean sequence ends, notice, only when a pause arrives that is not accompanied by an [uh]. This stratagem links each spoken Linnaean string into a unified name, and the scope of each lao is that name.

Of course the Linnaean-linking hyphens are not shown in text...neither as letters nor as hyphens. To reveal them in writing would be to break the international convention concerning the invariant appearance of the Linnaean name. So the hyphens we embed in our Linnaeans exist only as sounds for us. They are the little grace notes we play on otherwise unruly tunes to conserve the audiovisual isomorphism of our language.

One other feature of the textual appearance of Linnaeans should be mentioned. It is the international convention to italicize, underline or put in boldface all the Linnaean terms in any scholarly text. Any writer is of course free to choose among the enhancements that are available to da. In this book, we underline Linnaeans in English and in the freestanding Loglan specimens, and we boldface them in running text. Unfortunately, this does not distinguish them from ordinary Loglan words in running text, for these are already bolded.

So much for the appearance of the Linnaeans. But how to pronounce them when they do appear? How to speak them, in fact, when we wish to talk science with other loglanists?

Each natural language community in which biology is practiced has its own standard way of pronouncing, or mispronouncing, the Linnaean vocabulary. When devising our own Loglan way of speaking Linnaeans, we will be well-advised to follow the customs of scholars who speak a phonologically kindred language, say Spanish or Italian. Happily enough, we already have a full set of guides for doing precisely that. This is the set of rewriting rules given in Section 6.6. We need only pronounce our Linnaeans exactly as we would do if we had rewritten them for borrowing. In other words, all we have to do to pronounce any Linnaean word "correctly"--that is, by our own loglandical standards--is to mentally perform Step 1 of the borrowing procedure on it, and then "read" that word aloud as we have mentally rewritten it. We don't put a special ending on it; we don't look for breakup or segmentation problems. We are not borrowing it. We are just rewriting it in our minds as if we were preparing it for borrowing; and we then speak it aloud as we have mentally rewritten it. We do this in order to standardize our pronunciation of the Linnaeans: to make us intelligible to one another.

This is easier to do than it sounds. Thus, coli comes out [KOH-lee] and not [SHOH-lee] or [KOH-ligh]; and Escherichia comes out [ess-kehr-REEK-yah] and not [esh-rr-EEK-ee-ah] as American and German scientists typically pronounce it.

If we all follow this loglanizing convention, we will have a clear, easily understood, Latinate way of pronouncing our Linnaeans. It will be a way that will not only be consistent with the rest of Loglan phonology, but also very similar to the ways Spanish and Italian biologists pronounce their Linnaeans.

Try reading the following expressions aloud as if you had first rewritten the Linnaean words for borrowing. Hyphenate the serial terms together orally even though the text shows no signs of hyphenation:

lao Escherichia coli lough . ess-kehr-REEK-yah . uh-KOH-lee
lao Mammalia lough . mah-MAL-yah
lao Primates lough . pree-MAH-tess
lao Homo sapiens lough . HOH-moh . uh-SAHP-yens
lao Homo erectus lough . HOH-moh-uh . ehr-REK-toos
lao Pan troglodytes lough . PAN . uh-trohg-loh-DEE-tess
lao Homo sapiens ne anderthalensis lough . HOH-moh . uh-SAHP-yens . uh-neigh-ahn-dehr-tahl-LEN-seess
lao Cercopithekus lhoesti lough . shehr-koh-peet-HEH-koos-uh . ell-hoh-ESS-tee
lao Australopithecus afarensis lough . ah-oos-trah-loh-peet-HEH-koos-uh . ah-fahr-REN-seess

There are some words here that are difficult to pronounce in any language. What Loglan pronunciation does is draw out such words in a Romance way, giving each syllable its due: [ah-oos-trah-loh-peet-HEH-koos]. Such a practice makes these long words even longer; but there is a very real sense in which it makes them easier as well. In English we adopt the opposite tack: we rush through such long, difficult-appearing words as 'Australopithecus' at breakneck speed, grunting the sounds out as we go, almost as if we didn't know how to pronounce them properly: [awss-truh-luh-PIHTH-uh-kuhss]. The trouble is that so much information is lost by such a pronunciation that unless one has heard it before, it is almost impossible to produce it from the spelling...and of course quite impossible to spell it from the pronunciation! The Loglan Spanish-type pronunciation will lead to far more creditable performances on both counts.

Except for our distinctively loglandic pronunciation of 'c' before weak vowels as [sh], rather than [s], our fastidiousness about pronouncing /h/s before stressed vowels, and our quite extraordinary use of linking [uh]s between pairs of Linnaean words, this is approximately the way a Spanish or Italian biologist would pronounce these scientific names.

Before we leave the subject of Linnaean names, let us remind ourselves that Linnaeans may also be borrowed as predicates as well. Thus, Loglan mamla ('is a mammal') comes ultimately from the Linnaean name 'Mammalia' just as primati ('is a primate') comes from 'Primates'. The alternative to using Ta mamla to say 'That's a mammal' would be to "predify" the Linnaean name with me and say Ta melao Mammalia = 'That's a member of (the Class) Mammalia.'. One can speak in this stiff Linnaean way in Loglan just as one can in English. But it's a bit more precise than one usually needs to be. Ta mamla is ordinarily quite sufficient.

However, if we do wish to say precisely that a given zoo specimen, say, is a "common chimpanzee", we might wish to eschew cimpa and cimpanizi altogether, which both mean 'chimpanzee' in the widest sense, and either use the exact Linnaean phrase,

(2)Da melao Pan troglodytes It's a (member of the species) Pan troglodytes.

which employs the me-operator to convert the Linnaean name into a predicate, or make two loan-words out of it. Thus, we could borrow 'Pan' as the predicate panhi and 'Troglodytes' as trogloditi, using the borrowing methods described earlier in this chapter; and then either use these predicates in the reverse order (since Loglan, unlike Latin, is an Adjective-Noun language).

(3)Da trogloditi panhi It's a "trogloditic pan" (i.e., a member of species troglodytes of genus Pan).

0Or, in deference to the original Linnaean, we might use the Loglan go- operator to put the matter in its original Latin word-order:

(4)Da panhi go trogloditi It's a "pan of trogloditic type" (i.e., a member of genus Pan and species troglodytes).

Note that panhi and trogloditi are now genuine Loglan predicates. They do not have to be predified and they are both written and pronounced in Loglan phonemes; so they require no special handling. So these predicate derivatives of the Linnaeans have certain stylistic advantages for the loglanist over the original Linnaean names.

In doing science we will often want to borrow predicate words which are the equivalents of '-cine' and '-oid' words in scientific English. We can always do this by borrowing Loglan predicates from Linnaean genus names, whether we have a precedent in the natural languages or not. For example,

(5)Da austrralopitekui It's an australopithecine.
(6)Da panhi It's a "panoid" (i.e., a chimpanzee of either the dwarf or common species).

Note again that these expressions are both easier to pronounce and grammatically more elegant than the Linnean predications which they replace.

I will not hazard the prediction that the entire Linnaean vocabulary is about to be absorbed by borrowing into Loglan. There are, after all, over a million of these words, many of them the names of rare, precious, and now vanishing things. Some of them are known only to a handful of specialists. The main thing is that we are ready to borrow them. We are ready to welcome them in any numbers in which such well-formed and easily handled predicates may yet prove useful to the international community of scholars.

6.15 Little Words and Structural Change

Throughout this chapter we have taken for granted that new content words could be added to Loglan without changing the language in any essential way. This is true; and it is true because, apart from their major partition into names and predicates, content words in Loglan do not differ grammatically in any other way. Thus, in any position in which one name appears, any other name may also appear...perhaps not sensibly but always grammatically. Similarly, and far more startlingly, any predicate may be replaced by any other predicate, and the replacement operation will leave the basic structure of the sentence quite unchanged. As a consequence, adding new names and predicates to Loglan cannot change the grammar of the language. For the differences between new content words and old ones can have nothing to do with grammar.

What this means is that predicates and names constitute in Loglan just two huge, rather amorphous "parts of speech." For their members are grammatically interchangeable. But though they comprise the bulk of the vocabulary, these are only two of about fifty parts of speech, or word-classes, in Loglan grammar.24 Most of the other forty-odd word-classes typically have only a small number of members, and about half of the word-classes, which includes all the punctuators, have only one. But the main point is that approximately 48 of the 50 word-classes of Loglan are occupied by little words and their compounds. We have called these grammatically finely divided words the "structure words" of the language. We must now consider what it means to change a structure word, or to add to the existing set of them.

Let us review briefly what we mean by "structure word." In the three preceding chapters of this book we have been exclusively concerned with questions of grammar. What this has meant is that we have considered in turn how each distinct group of little words could be used to determine the over-all structure of Loglan utterances. Thus, whether an utterance was a question, an answer, a claim about individuals, a claim about masses, a connection between claims, an imperative, and so on, depended entirely on the pattern and kind of little words that appeared in it. Consider the following utterance:

(1)Le mrenu pa ditca The man was a teacher.

In both languages we know what is going on grammatically because of the pattern of little words. We can reveal that pattern most clearly by withdrawing the content words altogether:

(2)Le.....pa..... The.....was a.....

Let us call such content-free expressions "grammatical frames". Now the point is that thousands of content words in both languages can be freely used to fill the blanks in such grammatical frames without in any way changing the structure of their remarks. But what is that structure? In Loglan it is the information carried by the little words in frame (2) that tells us that a certain individual or set of individuals, whom the speaker means to designate by mentioning one of da's properties, was in the past characterized by a certain property or quality, or by a relationship with other individuals or sets which is here expressed incompletely. As a whole, therefore, the frame alone expresses a kind of claim; moreover, the kind of claim it makes is a predicating claim, not an identifying one. So much can be told without any content whatsoever, that is, without looking at any names or predicates.

Now it is true that words like le in Loglan and 'the' in English can be replaced by certain other little words without significantly changing the structure of the remark. For example, the grammatical frame below:

(3)Levi.....na..... This.....is now a.....

does not differ significantly from that in (2). In fact, for most purposes, the grammar of these two frames may be regarded as identical. For in Loglan, at least, the two words le and levi, as well as the pair of tense operators na and pa, may replace each other wherever they occur without ever turning a grammatical sentence into an ungrammatical one, or vice versa. Thus, le and levi, like any two predicates, are also grammatically interchangeable. But the important thing is that the group of interchangeable words to which le and levi belong, unlike the group of predicates, is a very small one.

Now what happens if you change, withdraw, or add members to, a small class of structure words? Some idea of the effect can be obtained by considering the following playful changes in English. Suppose you try replacing the word 'the' by the new word 'foo' in a few sentences. You may find it nearly impossible even to begin. But now try replacing the word 'book' by the similarly nonsensical word 'thoo' in a few remarks. This is not only possible, but it can even be done with a certain heedless glee.25 The point is that structure words are too intimately associated with our fundamental linguistic habits for us to suffer their changes gladly. Content words, however, can be added to, redefined, even wholly replaced, without causing much pain to the human mind. A student in a first-year course in almost any university subject spends half da's time learning new content words; and some of them will be perfectly ordinary old words which da must now learn to understand in a new way. But it is apparently quite easy to do this. Similarly, any apprentice to a new craft must learn dozens of new content words a month; and again some of the words will have old meanings de must now reject. But no craft and no scholarly discipline can dispense with words like 'the' or 'and' or 'but' and very few dare to change or redefine them. None but the most impudent of sciences, mathematics and logic, dare to change them. Logic abandons the structure words of ordinary language almost completely...and therefore, perhaps, provokes the greatest linguistic bewilderment in its students.

If we look at the grammatical types of the words that occur as slang, we see the same fundamental difference. There are plenty of slang nouns, verbs, adjectives and adverbs in English; but there are very few slang articles or prepositions and no slang connectives that I know of. The reason is clear. Structure words change very slowly. They may be added to from time to time; but even this is historically resisted in most languages. For example, the forms 'ain't' and 'this-here' have been around for a long time and are still not quite accepted as "good English." In contrast, note how short a time it has taken the word 'astronaut' to install itself in the very lap of the language.

Now what this means for Loglan is that the arrangement of structure words with which its life commences will (i) almost undoubtedly have to be changed or added to, but (ii) those changes, at least, will be mightily resisted. The second point is now obvious; but what about the first?

Let us first make the categorical prediction that such changes will be necessary. For while the laboratory testing of the grammar--in a certain sense, its engineering--has long been completed, the actual construction of a human grammar inside human minds is a continuous, never-finished process; and we will almost certainly find that there are unsolved structural problems still. Mistakes will be found; ambiguities will arise; important natural functions will turn out to be inadequately served.26 A human grammar is an immense affair, a mansion of many rooms, and we cannot claim to have examined every nook and cranny of the structure we have designed. Not only that, but some questions--and some of which we are now aware--can only be settled by use. For example, I am interested to see how loglanists will handle their potentially "preposition-free" language. Will they use case-tags? Or ignore them? If they use them, under what circumstances will they do so? On another level, I sometimes fear that the connective system I have designed may prove too intricate for human minds to handle in the "real time" world of speech; and so on. If so, parts of that vast structure will remain unused by the majority of speakers. Adjustments may have to be made.

As questions of this kind are gradually being answered by experience or experiment, some speakers of the language will want to change it to bring its structure into harmony with those answers as they emerge. If such changes involve its most deeply embedded structure words, there is some question in my mind whether they can actually be executed without exciting open rebellion among those who have learned these "erroneous" structures.27

Some changes, of course, will be relatively easy to accomplish. Additions to the list of modal operators--tie, lia, sea, and the like--should be nearly as easily accomplished as the addition of a predicate. The list of discursive operators--bea, pou, sui, and so on--should also prove easy to augment. But additional case tags will, I predict, be more difficult to introduce. Compound indicators may also be added with great freedom. And the less frequently used tense and location operators may prove relatively easy to augment or change. Perhaps even the rarer "punctuation marks" may yield to pressure. But the connective system, or the tense system, or the system of descriptive operators, will probably prove more stubborn. Let us hope that these more fundamental structures of the grammar will now prove to be well-engineered, and that such changes as take place in them will be augmentative rather than corrective.

In the end some change in the fundamental structure of the language will almost certainly be necessary. This may seem unfortunate; but the alternative policy of freezing its early forms could well mean its early death. For only by accommodating productive change is the language likely to struggle through its own early traditions and come alive and grow.

Notes

1 For example, I was surprised to find that in a recent Chinese-English dictionary (Wu Jingrong 1983) some 15% of the science entries were phonemic borrowings from the Greco-Latin, or rather from European adaptations of it. E.g., English 'amoeba' is Chinese 'amiba'; 'aniline' is 'anilin'; 'aspirin' is 'asipilin'; 'atropine' is 'atuopin'; and so on. My distinct impression is that this is a larger proportion of imitative loans than one found in Chinese dictionaries published forty years ago, and it suggests to me that there may be a long-range trend at work in even this most reluctant of languages that is causing Chinese scholars to turn away from metaphor-building, their classic way of incorporating foreign notions, toward simple phonemic imitation...at least for the international vocabulary of science. That would make sense. Although formerly a European province, science is now an international enterprise and needs a uniform vocabulary in which to conduct its business. Reclothing the verbal tokens of this enterprise in a body of local metaphors would be to reprovincialize it, to impede the swift conduct of this most international of human activities.

2 During the work on the borrowing algorithm one worker suggested that glued words like athomi could be written as 'at:omi', a textual style in which close-colons would replace all gluing consonants. This move would certainly enhance visual recognizability; but it has a minor disadvantage. It slightly disturbs the audio-visual isomorphism of the language by requiring "local knowledge" on the part of a stenographer--human or machine--who is given the task of transcribing a heard speech in this "punctuated" way, i.e., to write close-colons for all the gluing /h/s in it but normal 'h's for all the ones that are non-gluing. It would be impossible for such a stenographer to perform this task unless da were first given some independent means of distinguishing gluing from non-gluing /h/s in positions in which the latter could be gluing. For example, the genuine /h/ in ethili ('ethyl') is in such a position. But how to distinguish it from the one in ekhinodermati ('echinoderm') which is gluing? Apart from supplying our stenographer with a list of all the loan-words in the language in which /h/s had been used in gluing positions non-gluingly, that might be difficult to do.

3 The first test was conducted in 1961 prior to, and in preparation for, the compilation of the first (1963) edition of the dictionaries. Working through Eaton's (1940) list of the most frequently used concepts in four European languages, English, French, German and Spanish, we undertook to make new complexes when we could from the 784 composite primitives then in hand, to add new primitives when we could not, and to call the test concluded when the rate of adding primitives went to zero. This occurred earlier than expected: between frequency ranks 1,500 and 2,000. Before rank 1,500 we had added 43 new primitives but at a decreasing rate, bringing the total number of composite primitives to 817 by rank 1,500. Between ranks 1,500 and 2,000 none were added. We built a total of 1,262 new complex predicates during this early test, having started with 689, bringing the total in this category to 1,951. We then allowed ourselves the prediction that very few new composite primitives would ever have to be added to the language, that the current set would be found sufficient to make indefinite numbers of new complex predicates for the concepts of at least these four major languages, whatever the degree of conceptual intricacy of the concepts.

That prediction was substantially upheld during a second test of the constructive adequacy of the set of primitives during 1972-74. This time the test was made during the first revision and augmentation of the dictionaries. In performing this essentially editorial task we discovered that very few new composite primitives were required to be made. During that revision, 311 new complexes were added, most of them to build a dense translation interface between Loglan and English below frequency rank 2,000, but some to meet certain lexicographical considerations (resolution of English homophony, for example). During this same period 11 new composite primitives were added, bringing the total in the primitive set to 838. But fully nine of these new words were not occasioned by the work of making complex predicates; instead they were the words for nine body parts and functions, normally regarded as indelicate, which had apparently been deliberately "overlooked" during the 1961-63 dictionary work. So the constructive adequacy of the 1961 set of primitives for at least one of these four languages is attested by the fact that only two substantial additions had to be made to its set of primitives in order to add more than 300 new complexes to the language ten years later.

A third test of the adequacy of the composite primitives took place during the 1983-87 period of dictionary work. This most recent work was aimed at extending the solid portion of the interface between the Loglan lexicon and the Eaton four-language list through frequency rank 5,000 (through Eaton page 171). This was done by making over 2,500 new complexes but only eight new composite primitives. But again, only a very small number of these composites--three of the eight, in fact--were actually required for complex-building. These semantically crucial additions were, interestingly enough, the words for 'parent', 'offspring' and 'sibling', none of which is an English primitive and all of which had been originally made in Loglan, too, as complexes (as it happens, with the Chinese-type metaphors "mother-father", "daughter-son" and "sister-brother"). But certain types of kinship relations (grandparent, cousin, for example) cannot be economically expressed without these gender-free notions. So the earlier complexes have been replaced by the added primitives penre, tciha and sibli, the first two compositely built, the third, a single-source borrowing.

4 We may not have done this, since, as will be pointed out in Note 6, the "largest" languages have the largest representations in the word-stock of Loglan. But this is, at least, a variety of "fairness." Observe what would have happened, for example, had we "unfairly" taken the entire fundamental word- stock of Loglan from English. Or from Russian or Chinese. Or even from what Whorf called 'Standard Average European'...which is the source, very largely, of the word-stock of Esperanto. No cross-cultural experiments could have been conducted with such a language, much less ones designed to elicit subtle Whorfian effects. For the very process of importing the words of a language into any grammatical frame whatever can be expected also to import the cultural values and conceptual biases associated with those words in their natural frames. Take the word which is to have the denotation of the word 'copulate'. If English is to furnish our word-stock, are we to use 'fuck'? It is the word, perhaps, which is most widely-known. Or the word 'copulate' itself? Or either of the metaphors 'make love to' or 'go to bed with', for use in our "experimental" language? It makes a difference which we choose. For the connotative values of these expressions differ hugely in English...as the reader has no doubt been reminded by encountering them here. Moreover, their connotations differ in ways in which denotatively corresponding words and phrases do not differ, for example, in Chinese...a language which is rather less suitable for the expression of either obscenity or Victorian obscurantism than English is, one gathers. Yet their denotations are the same. Are we, then, in our experiments, to reckon with English cultural values every time an English subject uses such a word? Or with Russian, if our words are Russian? Or with Chinese, if Chinese? Clearly no experimental elucidation of the Whorfian phenomenon could be expected with such "instruments" as these. The compositely-built Loglan word kitsa, in contrast, which shares the denotation of these diversely-laden natural words, is itself nearly neutral in connotation ("nearly" because those who know Latin may come to associate 'coitus' with the phonemes it shares with kitsa, and so kitsa itself may come to bear a burden of connotations for them...but, we think, a small one). So in the main we may say that whatever connotations kitsa will acquire it has not acquired yet. And it will acquire these connotations, as will every other composite word in Loglan, under conditions which are at least in principle subject to scientific observation and experimental control.

Now none of this means that it would not be interesting, and perhaps useful, to plug English words into Loglan grammar. Or Russian words. Or Chinese words. It has more than once occurred to me, as it has to other loglanists who work with computers, that such "hermaphroditic languages"--possessed of the grammatical and logical faculties of Loglan, but with predicate vocabularies which were utterly familiar to those who were going to use them--would make excellent computer languages: that is, languages capable of penetrating the machine/man interface the toughness of which is perhaps the central problem in computer technology now. But such a language would not be a scientific instrument for finding out how the structure of language mediates human thought. It would be a practical tool of strictly limited applicability: a logician's Pidgin English--or Pidgin French, or Pidgin Chinese--for talking with machines.

5 The building of the Loglan composite primitives was largely accomplished during the interval 1955-59. The latest year for which language statistics were available when the work began was 1950. In 1950 there were 1,108 million native speakers of the 8 source languages, and 778 million more who spoke at least one of them as a second language, according to statistics made available by the United Nations in 1955. The best estimate then available of the 1950 world population was 2,438 million, of which these primary and secondary speakers of our source languages constituted 78%.

6 This method is clearly biased in favor of the larger language groups. Thus the "larger" languages make the largest average contributions to Loglan composite words, the "smaller" languages, the smallest. From one point of view, this is unfortunate; for it means that speakers of linguistically very interesting but less widely spoken languages like Japanese will not be simply pairable, in experimental designs, with speakers of dominant languages like Chinese or English. For it may well take them longer to learn the to-them less recognizable primitive vocabulary of the experimental language than subjects who are native speakers of the favored languages. On the other hand, differences like this can be explored experimentally, and then adjusted by providing longer learning intervals for the groups that will find less in Loglan to recognize. The average level of recognizability is maximized by our procedure, so that experimental learning rates should in general be far higher than if we had used a procedure which equalized the contribution of each language. Speed of learning is certainly an advantage to the experimenter, at least in da's earliest experiments; and da may be confined, therefore, in these experiments, to subjects drawn from the favored language groups. At the same time, the cultural neutrality of the experimental language is, I think, not seriously marred by the unevenness of the distribution of recognizability over the several source language groups. Thus, the fact that English 'go' appears wholly in Loglan godzi does not deprive that word of its interesting exoticness; for the mysterious (Chinese) sounds -dz- also occur in it. Godzi is not likely, therefore, to import the connotations of English 'go' unmixed into the language. Thus, the fact that the incorporation of short native words within longer Loglan ones occurs more frequently for dominant languages like Chinese and English should not, I think, impair the over-all neutrality of the design...even for these favored speakers.

7 In calculating the sum of speakers of a language, we counted all native speakers of it plus half of those who had acquired it as a second language provided they did not speak any of the other source languages as a native tongue. Thus, half of all native Swahili speakers who speak English were counted as English-speakers, while all native Hindi speakers are counted in the Hindi column whether they also spoke English or not. If some group spoke both English and Hindi as second languages, say, and none of the source languages as a native tongue--this is true of many native Tamil-speakers, for example--then half their number was divided equally between the English and the Hindi columns.

8 That good cue-words to the foreign words that one is learning as an adult do not need to have the same, or even similar, denotations, is demonstrated by the way in which English 'vaccine' serves as an excellent reminder of the meaning of Spanish 'vaca' ('cow') to one who knows the history of the English word. Similarly, English 'tempo' serves as a cue to Loglan ckemo ('time interval'), 'reverie' to Loglan revri ('to dream') and 'polity' to Loglan poldi ('nation'); although none of these pairs of English words can be regarded as good synonyms.

9 The supra-segmental Chinese phonemes known as "tones" were not counted as part of the "phonetic length" of Chinese words. This is an error. Its not unpleasant consequence is that Chinese is slightly over-represented in Loglan.

10 As they were allowed, indeed, in 1975...a circumstance which led to the discovery of this flaw in the language and to the engineering research that finally "unpacked" the primitive set. Speaking technically, the 1975 language was not redundant enough to function well even in moderate noise. More important, learning was seriously impaired by having to master sets of what seemed only trivially different words. For example, the pair clivi/clivu and the kanta/kante/kanti/kanto/kantu set. No such "primitive-packing" exists now in the language.

11 Vowel-rich short forms like saadja are currently preferred by the Institute over hyphenated forms like sanydja, but the Loglan speaker should be able to recognize the alternative forms, which can be used in repetition of a sentence to aid its comprehension, or where desirable for poetic meter or rhyme.

12 William Greenhood and I have argued elsewhere (Brown and Greenhood, in press) that the earliest adaptive event in at least one scenario for the evolution of human language was the deliberate misuse of genetically-coded signs, an act that might be called proto-metaphor. If this conjecture turns out to be correct, then metaphor-making is not only an essential part of the human language performance, it is phylogenetically its earliest part. If so, it is extremely likely to be fixed in the human gene pool, that is, to be a hum-drum human skill, and therefore part of the basic language kit of everyone.

13 A program embodying this algorithm is available from The Institute.

14 Greenhood and I have argued (Brown and Greenhood, in press) that this skill is at bottom what has been called the "role-taking" skill: seeing the world from the perspective of others, an adaptation that seems to have been in some ways shared with the other Hominoidea. Role-taking was posited by the American philosopher-psychologist George Herbert Mead (1937) to have been the basic cognitive skill required for the evolution of language. Much recent data about language seem now to support this older view.

15 A computer program supplied by The Institute allows the learner to teach daself the 860 composite primitives and their 600 affixes in a few weeks. This is all the knowledge da will ever require to decipher any regular Loglan complex da might ever encounter. Irregular ones--i.e., complexes like iglymao ('igloo-maker') that are derived at least in part from borrowings--are a different matter. No teaching program is capable of supplying the learner with advance information about all the borrowings da might ever encounter. This is one of the deepest and most far-reaching advantages of regular complexes over borrowings, and the reason why loglanists make complexes instead of borrowing whenever that is both appropriate and possible.

16 At the moment the evaluation routine employed by the complex-making algorithm scores /r n/-hyphenated words so much lower than /y/-hyphenated ones that it picks martysensi over maersensi even though the latter is both shorter (a plus value, according to the algorithm) and higher in its V/C ratio (and currently, this is an even greater plus value). So the human word-maker who prefers a word like maersensi--and in certain categories of complexes such forms might be definitely preferable to their /y/-hyphenated alternatives--will have to flag its dictionary entry as "a handmade word". This move will prevent the complex-making algorithm from remaking it as martysensi on its periodic "update passes" through the dictionary. The policies that control the choices of the algorithm on these update passes are of course subject to revision and refinement as we come to understand more about the aesthetics of our language.

17 Two complexes meaning substantially the same thing offer no problem to a language, since they represent different ways of approaching the same, perhaps elusive, idea by what is at first an essentially poetic act, the coining of a metaphor. Two primitives meaning the same thing would be awkward, however, since every primitive notion of a language predicates whatever it does predicate, say blueness, by fiat, that is, by convention only. Thus blanu is not the "best" word for blue, it is the word for blue; there is no other. But that two rival coinages, say racbao ('travel-box') and racysakli ('travel-sack'), might both be used to predicate pieces of a traveller's luggage, and live a long time side by side in the language, is surely possible...and the two words might each acquire distinct nuances of "betterness" in complementary contexts...as indeed these two already have.

18 The Institute's choice for this work is the Simon and Schuster International Dictionary: English/Spanish, Spanish/English, (1973), a work of some 200,000 entries, and therefore very likely to contain all but the most esoteric portion of the ISV. It is weak on Linnaeans, but has good representation of nearly everything else.

19 If we allowed words like *spektri in the language, then words like tospektri would have to be disallowed; and vice versa. It is because complexes are so much more important than borrowings in the structure of the language that we tip the design-scales in favor of the former. Besides, the domain of possible complexes of any given length is much smaller than the domain of possible borrowings of that length; for unlike borrowings, complexes are a morphologically restricted class. So by disallowing *spektri-type words in Loglan, we withdraw from a large, sparsely-populated domain (the domain of borrowings) a certain word-space that can then be unambiguously allocated to a smaller, more densely-populated domain (the domain of complexes). On both grounds, it is a justified trade-off.

20 Names, to be sure, play a special role in any language; and there are philosophers who insist that names are not part of individual languages at all (e.g., Ziff 1960). This view cannot be supported linguistically, however, since all languages have definite rules for forming or incorporating names. Loglan is no exception.

21 Since [ng] is a Loglan phone but not a Loglan phoneme--it is the allophone of /n/ used before /k g/--there is no way of spelling it in Loglan as a single sound. Either Loglan Nng or Nnk would serve, I suppose, but both would render the name as a rather difficult two-sound monosyllable: [ng+g] or [ng+k], the former being the middle part of English '(ti)ng(le)', the latter, the last part of '(i)nk'.

22 I am indebted to Robert A. McIvor for suggesting this 'Laplace'-saving convention.

23 Sunday used to be the first day of the week--in the Judeo-Christian world, anyway--and on most Western calendars it still is. But the five-day work-week seems to have displaced it in the industrial countries with Monday, which has always been, apparently, the first day in the Western work-week. But now that Sunday has been joined to Saturday in a week-end, these are pretty clearly the sixth and seventh days of some new conception of the week, one in which Sunday is last, not first. The results of a small sample survey (N=30) showed that this perception is probably widely shared in the U.S. population at least, 75% of our respondents reporting that Monday is the first day of their week. We are told that the first workday is also taken to be the first day of the Chinese week as well.

24 In the technical argot of Loglan workers, classes of words whose members are grammatically interchangeable with one another are called lexemes, and members of any one such class are called its allolexes. So the question in hand may be rephrased as, What happens to the grammar if you add or delete an allolex from some lexeme? The answer: If there is still one allolex left, nothing.

25 'This is a good thoo. I like these two thoos. This thoo is bigger than that thoo' comes easily enough. But not 'Foo man gave foo book to foo girl'. One can hardly get one's tongue around the latter sequence, let alone one's mind! The difficulty experienced in making such "allolexic" replacements--as measured by overall time, hesitation, and the errors committed in making them--might well be used as an experimental measure of the size of the lexemes (see Note 22 above) in a natural language, i.e., the number of their "allolexes." Loglan, having lexemes of known size, offers an opportunity to test this notion.

26 These predictions--originally made in 1975--have already been once fulfilled. But the "important natural function" that was not well-served by the 1975 language turned out not to be grammatical but morphological: the 1975-style complex predicates were not regularly decipherable, and so had to be learned as quasi-primitives. This defect had wide-ranging consequences, the most critical of which was that it slowed down the acquisition of this portion of the vocabulary. There were others. The discovery of this raft of morphological problems during the 1977-78 learning trials led to what was called the "Great Morphological Revision", a program to build and test the best possible set of "decipherable affixes", an engineering program that was to last through 1982. The unsolved problem in the grammar, in contrast, was well-known in 1975: Loglan grammar was not formally, but only heuristically, unambiguous. Discovering a formal proof of its freedom from ambiguity, and disambiguating it where it was not, was a task that had been necessarily left undone for want of powerful enough tools. This lack was filled in 1975--the very year of Loglan's second public debut--by Aho, Ullman and Johnson (1975). With the "automatic parser generators" that issued from their work, Loglan grammar was made, and demonstrated to be, unambiguous in 1982, and has remained so ever since.

27 In fact, something like this actually happened in the course of the reengineering of Loglan morphology during the years 1978-85, a project called the "Great Morphological Revision" or GMR. A rebellion did occur--or did very nearly--in response to the publication of Notebook 2, a report on the early findings of the GMR research. Some loglanists did not welcome such fundamental changes in the morphological structure of their language. See Brown (1983c) for some reflections on why resistance of this kind might be more likely to occur when changes in morphology are proposed than when the changes proposed affect only grammar. I argue in that paper that this observed difference in resistance to change, if it turns out to be a real one, could very well reflect the very different evolutionary histories of these two compartments of the human language-handling apparatus, the early Pleistocene one, which was arguably evolved to handle the mimetic acquisition of morphology and a proto-lexicon, as opposed to the late (geologically) Recent one, evolved to handle the inductive acquisition of grammar. I argue that mimesis and induction involve very different kinds of mental programming, the first (in several senses) being essentially conservative and change-resisting, the second, essentially exploratory and change-welcoming.

(Return to the Table of Contents)