Skeptophilia (skep-to-fil-i-a) (n.) - the love of logical thought, skepticism, and thinking critically. Being an exploration of the applications of skeptical thinking to the world at large, with periodic excursions into linguistics, music, politics, cryptozoology, and why people keep seeing the face of Jesus on grilled cheese sandwiches.
Showing posts with label constructed languages. Show all posts
Showing posts with label constructed languages. Show all posts

Tuesday, December 17, 2024

A linguistic labyrinth

It's funny the rabbit holes fiction writers get dragged down sometimes.

This latest one occurred because of two things that happened kind of at the same time.  First, I was chatting with a friend about one of my books, a fall-of-civilization novel called In the Midst of Lions that in the current national and global situation is seeming to cut a little close to the bone.  In the story, one of the characters is a linguist who saw what was coming, and wrote a conlang -- a constructed (invented) language -- so he could communicate with people he trusted without it being decipherable by enemies.

My friend asked how I managed to develop the conlang, which is called Kalila, and what process I'd gone through to make it sound like a real language.

Following in the footsteps of the Star Trek folks with Klingon and J. R. R. Tolkien with Quenya and Sindarin (two of the languages of the Elves) was not an easy task.  My MA is in linguistics (yes, I know, I spent my career teaching biology; it's a long story) so I know a good bit about language structure, and I wanted to make the language different enough from the familiar Indo-European languages to seem (1) an authentic language, not just a word-for-word substitution, and (2) something a smart linguist would think up.  Unfortunately, my specialty is Indo-European languages, specifically Scandinavian languages.  (My wife gives me grief about having studied Old Norse.  My response is that if the Vikings ever take over the shipping industry, I'm gonna have the last laugh.)


A sample of Tolkien's lovely Quenya script [Image is in the Public Domain]

So I started out with a pair of blinders on.  There are a lot of rules specific to Indo-European languages that we tend to take for granted, which was exactly what I didn't want to do with my conlang.  But in order to identify those, you have to somehow lift yourself out of your own linguistic box -- which is awfully hard to do.

The second thing, though, was that shortly after chatting about my conlang with my friend, I stumbled on a question on Quora that asked, "What is the hardest language to learn to speak fluently?"  By "hardest" most people assumed "for speakers of English," which went right to what I'd been discussing earlier -- finding out what would seem odd/counterintuitive (and therefore difficult) for English speakers.

Well, between the conversation and the post on Quora, I was led directly into an online research labyrinth, literally for hours.

One respondent to the hardest-language-question said his choice would be the Northwest Caucasian languages of Georgia, Azerbaijan, and Armenia -- a group made up of Abaza, Abkhaz, Adyghe, Kabardian, and Ubykh -- the last-mentioned of which became extinct in 1992 when the last native speaker died of old age.  These languages form an isolate family, related to each other but of uncertain (and undoubtedly distant) relationship to other languages.

So naturally, I had to find out what's weird about them.  Here's what I learned.

Let's start out with the fact that they only have two vowels, but as many as 84 consonants depending on exactly how finely you want to break them up based on the articulation.  They use SOV (subject-object-verb) word order, plopping the verb at the end of the sentence, but that's hardly unique; Latin does that, giving rise to the old quip that by the time a Roman got to the verb in his sentence, his listeners had forgotten who he was talking about.

But in the parlance of the infomercial, "Wait, there's more!"  The Northwest Caucasian languages use agglutination -- gluing together various bits and pieces to make a more specific word -- but only for verbs.  In these languages, a verb is actually a cluster of parts called morphemes that tell you not only what the core verb is, but the place, time, manner of action, whether it's positive or negative, and even the subject's and object's person.

Then, there's the fact that they're ergative-absolutive languages.  When I hit this, I thought, "Okay, I used to know what this meant," and had to look it up.  It has to do with how the subject and object of a sentence are used.  In English (a nominative-accusative language), the subject has the same form regardless of what kind of verb follows it; likewise, the object always is the same.  So the subject of an intransitive verb like "to walk" is the same as the subject for a transitive verb like "to watch."  (We'd say, "she walked" and "she watched [someone or something];" in both cases, you use the form "she.")  The object form of "he" is always "him," regardless of any other considerations in the sentence.

Not so in the Northwest Caucasian languages, and other ergative-absolutive languages, such as Tibetan, Basque, and Mayan.  In these languages, the subject of an intransitive verb and the object of a transitive one have the same form; the subject of a transitive verb is the one with the different form.  (If English was an ergative-absolutive language, we might say "He watched her," but then it'd be "her walked.")

So there are lots of things that seem normal, obvious even, which in fact are simply arbitrary rules that we've learned are universal to English, but which are hardly universal to other languages.  It always puts me in mind of the Sapir-Whorf hypothesis, which is that the language you speak shapes your cognitive processes.  In other words, that speakers of languages differently structured from English literally perceive the world a different way because the form of the languages force different conceptualizations of what they see.

I've gone on long enough about all this, and I haven't even scratched the surface.  There are tonal languages like Thai, where the pitch and pitch change of a syllable alter its meaning.  There are languages like Finnish and Japanese where vowel length -- literally, how long you say the vowel for -- changes the meaning of the word it's in.  There are inflected languages like Greek, where the ending of a word tells you how it's being used in the sentence (e.g., in the phrases "the cat walked," "she pet the cat," "it's the cat's bowl," "give the food to the cat," and "the dog is with the cat," the word "cat" would in each case have a different suffix).

So it was a struggle to make my conlang something that would be believable to a linguist, and I can only hope I succeeded well enough to get by.  (Or, in the context of the story, something an actual linguist would invent.)  Of course, being that it's only one small piece of the story, in the end I used something like a dozen phrases total from the language, so it was kind of a lot of work with very little obvious result.

But I figure that in any case, what I came up with has still gotta be more realistic than the Judoon "ro po fo so no do" language from Doctor Who, which I'm only throwing in here because after yesterday's post my author friend Andrew Butters commented that I can always somehow find a way to work in a Doctor Who reference regardless of the topic, and I couldn't just refuse to rise to that challenge.


So there.

****************************************

Monday, August 7, 2023

One language to rule them all

The aphorism "No matter what you know, there's always more to learn" is something you'd be likely to see on one of those cheesy "motivational posters" that cheery type-A personalities like to pin up on office walls, but there's a lot of truth to it.  I rather prefer the formulation credited to Socrates -- "The more I know, the more I realize how little I know."

I ran into a fun example of this principle a while back, when a member of the online linguistic geekery group Our Bastard Language posted an article from The Public Domain Review called "Trüth, Beaüty, and Volapük," about a constructed language (or "conlang," in the lingo of the field) called Volapük that I had never heard of before.

My M.A. is in linguistics, but my field of study was historical/reconstructive linguistics (my thesis was about the effects of the Viking invasions on Old English and Old Gaelic, and should have won some kind of award for research that has absolutely no practical application).  But even though conlangs aren't my specialty, I've always had a fascination from them, and in fact I created a conlang called Kalila as a part of my soon-to-be-released novel In the Midst of Lions.  There are a remarkable number out there, from the familiar (Esperanto, Klingon, Elvish) to the obscure but fascinating (such as John Quijada's Ithkuil, which attempts to express concepts in a combinatory way from the smallest possible number of root words).

A sample of Tolkein's lovely Elvish script [Image is in the Public Domain]

But despite my interest in conlangs, I had never run across Volapük, which is strange because next to Esperanto, it's apparently one of the most studied constructed languages ever created.  It was the invention of a German priest named Johann Schleyer, who not only wanted to create a regularized speech that came from familiar roots (to Europeans, anyhow) and was easy to learn, but was also "beautiful sounding."  Schleyer had an inordinate fondness for umlauts, which he added because he thought that "A language without umlauts sounds monotonous, harsh, and boring."

Which reminds me of the opening credits in Monty Python and the Search for the Holy Grail, especially the "A mööse once bit my sïster" part.  One of Schleyer's contemporaries couldn't resist poking some gøød-natured fün at him over his umlautophilia, and published the following limerick in the Milwaukee Sentinel:
A charming young student of Grük
Once tried to acquire Volapük
But it sounded so bad
That her friends called her mad,
And she quit it in less than a wük.
To my ears, it doesn't sound bad at all, and kind of has a Scandi-Slavic lilt to it. Here's a sample:


The author of the article in The Public Domain Review, Arika Okrent, attributes the relative failure of Volapük to its plethora of umlauts and the easier word roots of its competitor Esperanto, which currently has about two million fluent speakers (an estimated one thousand of whom learned it as their first language).  I'm a little doubtful about that; certainly umlautiness hasn't discouraged anyone from learning Finnish.  I think it's more that the idea of a universal language is one of those high-flown ideals that won't ever catch on because most people are going to be resistant to giving up their native tongue in favor of an invented system of speech, however easy it is to learn.  Language is such a deep part of culture that to jettison our own mode of communication runs counter to every social instinct we have.  (Note that one of the most common things conquerors do to conquered people is to outlaw the speaking of the native language -- it's a sure way to deal a death blow to the culture.)

Even so, I find the whole conlang thing fascinating, and was tickled to run across one I'd never heard of.  Back in my teaching days I every so often had students who participated in an independent study class I offered in introductory linguistics, and the final project was to invent the framework of a language -- a phonetic and phonemic structure, morphological scheme, and syntax, along with a lexicon of at least a hundred words.  They then translated a passage from English into their language.  (One of the best ones I've ever seen involved a charming translation of Eric Carle's The Very Hungry Caterpillar.)

The result of this project was twofold -- students found out how hard it is to create a realistic language, and they learned a tremendous amount about the structure of our own language.  And that's just from producing a rudimentary skeleton of a language.  For people like Schleyer, who created a rich and fully functional language, it was the result of many years of devotion, hard work, and love for language itself.

So it's kind of a shame that people didn't appreciate Volapük more.  Schleyer's dream of having a language that would bring the entire world together in a common mode of communication may be as far off as ever, but even so, it's a beautiful dream.  Even if it would mean making friends with the mäjestïc ümlaüt.

****************************************



Friday, October 16, 2020

A linguistic labyrinth

It's funny the rabbit holes fiction writers get dragged down sometimes.

This latest one occurred because of two things that happened kind of at the same time.  First, in my work-in-progress, a fall-of-civilization novel called In the Midst of Lions that in the current national and global situation is seeming to cut a little close to the bone, one of the characters is a linguist who saw what was coming and wrote a conlang -- a constructed (invented) language -- so he could communicate with people he trusted without it being decipherable by enemies.

So of course, to make it authentic, I've had to write the language, following in the footsteps of the Star Trek folks with Klingon and J. R. R. Tolkien with Quenya and Sindarin (two of the languages of the Elves).  My MA is in linguistics (yes, I know, I spent my career teaching biology; it's a long story) so I know a good bit about language structure, and I wanted to make the language different enough from the familiar Indo-European languages to seem (1) an authentic language, not just a word-for-word substitution, and (2) something a smart linguist would think up.  Unfortunately, my specialty is Indo-European languages, specifically Scandinavian languages.  (My wife gives me grief about having studied Old Norse.  My response is that if the Vikings ever take over the shipping industry, I'm gonna have the last laugh.)

A sample of Quenya script, with the English transliteration.  It translates to, "Ah! like gold fall the leaves in the wind, long years numberless as the wings of trees!"  [Image is in the Public Domain]

So I started out with a pair of blinders on.  There are a lot of rules specific to Indo-European languages that we tend to take for granted, which was exactly what I didn't want to do with my conlang.  But in order to identify those, you have to somehow lift yourself out of your own linguistic box -- which is awfully hard to do.

The second thing, though, was a nice kick in the rear that came from a question on Quora that asked, "What is the hardest language to learn to speak fluently?"  By "hardest" most people assumed "for speakers of English," which went right to what I was interested in -- finding out what would seem odd/counterintuitive (and therefore difficult) for English speakers.

Well, this is what led me directly into the research labyrinth, literally for hours.

One respondent answered that the hardest ones would be the Northwest Caucasian languages of Georgia, Azerbaijan, and Armenia -- a group made up of Abaza, Abkhaz, Adyghe, Kabardian, and Ubykh -- the last-mentioned of which became extinct in 1992 when the last native speaker died of old age.  These languages form an isolate family, related to each other but of uncertain (but undoubtedly distant) relationship to other languages.

So naturally, I had to find out what's weird about them.  Here's what I learned.

Let's start out with the fact that they only have two vowels, but as many as 84 consonants depending on exactly how finely you want to break them up based on the articulation.  They use SOV (subject-object-verb) word order, plopping the verb at the end of the sentence, but that's hardly unique; Latin does that, giving rise to the old quip that by the time a Roman got to the verb in his sentence, his listeners had forgotten who all he was talking about.

But in the parlance of the infomercial, "Wait, there's more!"  The Northwest Caucasian languages use agglutination -- gluing together various bits and pieces to make a more specific word -- but only for verbs.  In these languages, a verb is actually a cluster of parts called morphemes that tell you not only what the core verb is, but the place, time, manner of action, whether it's positive or negative, and even the subject's and object's person.

Then, there's the fact that they're ergative-absolutive languages.  When I hit this, I thought, "Okay, I used to know what this meant," and had to look it up.  It has to do with how the subject and object of a sentence are used.  In English (a nominative-accusative language), the subject has the same form regardless of what kind of verb follows it; likewise, the object always is the same.  So the subject of an intransitive verb like "to walk" is the same as the subject for a transitive verb like "to watch."  (We'd say, "she walked" and "she watched [someone or something];" in both cases, you use the form "she.")  The object form of "he" is always "him," regardless of any other considerations in the sentence.

Not so in the Northwest Caucasian languages, and other ergative-absolutive languages, such as Tibetan, Basque, and Mayan.  In these languages, the subject of an intransitive verb and the object of a transitive one have the same form; the subject of a transitive verb is the one with the different form.  (If English was an ergative-absolutive language, we might say "He watched her," but then it'd be "her walked.")

So there are lots of things that seem normal, obvious even, which in fact are simply arbitrary rules that we've learned are universal to English, but which are hardly universal to other languages.  It always puts me in mind of the Sapir-Whorf hypothesis, which is that the language you speak shapes your cognitive processes.  In other words, that speakers of languages differently structured from English literally perceive the world a different way because the form of the languages force different conceptualizations of what they see.

I've gone on long enough about all this, and I haven't even scratched the surface.  There are tonal languages like Thai, where the pitch and pitch change of a syllable alter its meaning.  There are languages like Finnish and Japanese where vowel length -- literally, how long you say the vowel for -- changes the meaning of the word it's in.  There are inflected languages like Greek, where the ending of a word tells you how it's being used in the sentence (e.g., in the phrases "the cat walked," "she pet the cat," "it's the cat's bowl," "give the food to the cat," and "the dog is with the cat," the word "cat" would in each case have a different suffix).

So I have some work to do to make my conlang something that would be believable to a linguist.  Or, in the context of the story, something an actual linguist would invent.  Of course, being that it's only one small piece of the story, in the end I'll probably use something like a dozen phrases total from the language, so it'll be a lot of work with very little useful result.

But hey, if J. R. R. Tolkien did it, who am I to criticize?

***************************************

This week's Skeptophilia book recommendation is brand new, and is as elegiac as it is inspiring -- David Attenborough's A Life on Our Planet: My Witness Statement and a Vision for the Future.

Attenborough is a familiar name, face, and (especially) voice to those of us who love nature documentaries.  Through series such as Our Planet, Life on Earth, and Planet Earth, he has brought into our homes the beauty of nature -- and its desperate fragility.

At 93, Attenborough's A Life on Our Planet is a fitting coda to his lifelong quest to spark wonder in our minds at the beauty that surrounds us, but at the same time wake us up to the perils of what we're doing to it.  His message isn't all doom and gloom; despite it all, he remains hopeful, and firm in his conviction that we can reverse our course and save what's left of the biodiversity of the Earth.  It's a poignant and evocative work -- something everyone who has been inspired by Attenborough for decades should put on their reading list.

[Note: if you purchase this book using the image/link below, part of the proceeds goes to support Skeptophilia!]



Tuesday, May 30, 2017

One language to rule them all

The aphorism "No matter what you know, there's always more to learn" is something you'd be likely to see on one of those cheesy "motivational posters" that cheery type-A personalities like to pin up on office walls, but there's a lot of truth to it.  I rather prefer the formulation credited to Socrates -- "The more I know, the more I realize how little I know."

I ran into a fun example of this principle yesterday, when a member of the online linguistic geekery group Our Bastard Language posted an article from The Public Domain Review called "Trüth, Beaüty, and Volapük," about a constructed language (or "conlang," in the lingo of the field) called Volapük that I had never heard of before.

My M.A. is in linguistics, but my field of study was historical/reconstructive linguistics (my thesis was about the effects of the Viking invasions on Old English and Old Gaelic, and should have won some kind of award for research that has absolutely no practical application).  But even though conlangs aren't my specialty, I've always had a fascination from them.  There are a remarkable number out there, from the familiar (Esperanto, Klingon, Elvish) to the obscure but fascinating (such as John Quijada's Ithkuil, which attempts to express concepts in a combinatory way from the smallest possible number of root words).

A sample of Tolkein's lovely Elvish script [image courtesy of the Wikimedia Commons]

But despite my interest in conlangs, I had never run across Volapük, which is strange because next to Esperanto, it's apparently one of the most studied constructed languages ever created.  It was the invention of a German priest named Johann Schleyer, who not only wanted to create a regularized speech that came from familiar roots (to Europeans, anyhow) and was easy to learn, but was also "beautiful sounding."  Schleyer had an inordinate fondness for umlauts, which he added because he thought that "A language without umlauts sounds monotonous, harsh, and boring."

Which reminds me of the credits in Monty Python and the Search for the Holy Grail, especially the "A mööse once bit my sïster" part.  One of Schleyer's contemporaries couldn't resist poking some gøød-natured fün at him over his umlautophilia, and published the following limerick in the Milwaukee Sentinel:
A charming young student of Grük
Once tried to acquire Volapük
But it sounded so bad
That her friends called her mad,
And she quit it in less than a wük.
To my ears, it doesn't sound bad at all, and kind of has a Scandi-Slavic lilt to it.  Here's a sample:


The author of the article in The Public Domain Review, Arika Okrent, attributes the relative failure of Volapük to its plethora of umlauts and the easier word roots of its competitor Esperanto, which currently has about two million fluent speakers (an estimated 1,000 of which learned it as their first language).  I'm a little doubtful about that; certainly umlautiness hasn't discouraged anyone from learning Finnish.  I think it's more that the idea of a universal language is one of those high-flown ideals that won't ever catch on because most people are going to be resistant to giving up their native tongue in favor of an invented system of speech, however easy it is to learn.  Language is such a deep part of culture that to jettison our own mode of communication runs counter to every social instinct we have.  (Note that one of the most common things conquerors do to conquered people is to outlaw the speaking of the native language -- it's a sure way to deal a death blow to the culture.)

Even so, I find the whole conlang thing fascinating, and was tickled to run across one I'd never heard of.  Every so often I have students who participate in an independent study class I teach in introductory linguistics, and the final project is to invent the framework of a language -- a phonetic and phonemic structure, morphological scheme, and syntax, along with a lexicon of at least a hundred words.  They then translate a passage from English into their language.  (One of the best ones I've ever seen did a charming translation of Eric Carle's The Very Hungry Caterpillar.)

The result of this project is twofold -- students find out how hard it is to create a realistic language, and they learn a tremendous amount about the structure of our own language.  And that's just from producing a rudimentary skeleton of a language.  For people like Schleyer, who created a rich and fully functional language, it was the result of many years of devotion, hard work, and love for language itself.

So it's kind of a shame that people didn't appreciate Volapük more.  Schleyer's dream of having a language that would bring the entire world together in a common mode of communication may be as far off as ever, but even so, it's a beautiful dream.  Even if it would mean making friends with the mäjestïc ümlaüt.