Indo-European languages

The family of Indo-European languages is a collection of several hundred languages, including the majority of languages spoken in Europe, the plateau of Iran and the subcontinent of India, that share a considerable common vocabulary and linguistic features. These shared traits have led many scholars to believe that these languages derive from a common ancestor, usually designated Indo-European or Proto-Indo-European (or PIE). Among the most famous languages that belong to this group are English, French, German, Greek, Hindi-Urdu, Italian, Latin, Persian (Farsi), Portuguese, Russian, Sanskrit and Spanish.

Naming
The native name of the first Indo-European populations remains unknown.

A general name, accepted by nearly all scholars, is Indo-European, since this language family used to cover, during Antiquity and the Middle Ages, a vast territory stretching from India to Europe.

Another name is Indo-Germanic, used mostly by German scientists during the 19th and the 20th centuries, but quite obsolete since the second half of the 20th century. The explanation of this name is quite simple: German scholars have played an important role in the development of Indo-European studies.

A recent alternative name proposal has been Indo-Hittite, stressing the fact that the Anatolian branch of Indo-European (including the Hittile language) was a very early offshoot from the Indo-European urheimat. This name has not found a wide success among scholars.

The name Aryan was used as a synonym for Indo-European by several authors during the 19th century and the beginning of the 20th century. But in fact, Aryan (from Sanskrit Arya) designates chiefly the Indo-Iranian branch of Indo-European, rather than the Indo-European family as a whole. The name Arya for the original Indo-European people is only a hypothesis. The main problem is that some racist authors of the 19th century, and then the nazi ideology, misappropriated the term Aryan in order to express the absurd idea of a so-called supremacy of a European “race”. After the massive crimes commited by the nazis during the Second World War, the term Aryan has been abandoned by scholars as a synonym of Indo-European. But it is still accepted in its Sanskrit, attested sense, as a synonym of the Indo-Iranian branch.

Classic list of branches
The family of Indo-European languages is subdivided into a number of subgroups. These are:


 * 1) Indo-Iranian languages, comprising two close subfamilies: Indian and Iranian.
 * 2) Indian languages. These languages are now spoken in the modern countries of India, Bangladesh, and Pakistan. The oldest literary texts preserved in any Indo-European language are the Vedas. The oldest texts among them date to around 1500 BC. They are written in an early form of Sanskrit. Among the modern languages belonging to this subgroup are:Hindi, Urdu, Bengali, Punjabi.
 * 3) Iranian languages. These languages are spoken on the plateau of Iran. There are close affinities between Iranian and Indian languages, suggesting that the peoples who speak dialects of these respective language subgroups have lived in close proximity with each other for a long time. It is believed by many historical linguists that both Indian and Iranian descended from a common ancestor Proto-Indo-Iranian. The Iranian languages are divided into an eastern and a western branch. The modern language of Farsi (or Persian) is the main representative of the Iranian languages, and it belongs to the eastern branch. Other Iranian languages are Afghan (or Pushtu) and Beluchi, both spoken in parts of Afghanistan, and Kurdish, which is spoken in an area covering northern Iraq, eastern Turkey, and northwestern Iran.
 * 4) Armenian. Armenian is somewhat isolated within Indo-European, since it does not appear to be linked to any other group by shared linguistic (grammatical) features, though its vocabulary contains numerous items borrowed from Farsi as a result of many centuries of Persian domination. Other lexical items found in Armenian come from Semitic languages, Greek, and Turkish.
 * 5) Greek or Hellenic. The Greek people (or Hellenes) entered the area now known as Greece around 2000 BC where they displaced numerous other peoples. The early flowering Greek culture produced a number of masterpieces, including the Illiad and the Odyssey, both Homeric poems. The Greek language comprised the following, notable dialects in the classical Antiquity: Ionic, Aeolic, Arcadian-Cyprian, Doric, and Northwest Greek. The inclusion of Ancient Macedonian in Greek is debated. The most prestigious dialects was Attic, the dialect of Ancient Athens, which belonged to the Ionic group. Attic attained supremacy in the fifth century BC through the dominant political and commercial position of Athens. Attic formed the basis of a koiné or lingua franca, that is, a mixture of several dialects to facilitate communication between different parts of the Greek world and for use as a unified standard in foreign commerce and diplomacy. Modern Greek, or Demotic, is ultimately descended from koiné Greek.
 * 6) Albanian. Albanian is an independent member of the Indo-European family, but this has been recognized only since the early twentieth century because the language is permeated with influences from Latin, Greek, Turkish, and Slavonic. Records for Albanian only go back to the fifteenth century AD.
 * 7) Italo-Celtic languages, comprising three close subfamilies: Italic, Ancient Ligurian and Celtic.
 * 8) Italic languages (including the Romance languages). This group includes numerous languages now extinct, such as Faliscan and Umbrian, but the main historical representative of this group is Latin, originally the language of Latium (the area around Rome). Vulgar dialects of Latin were spread throughout the Balkans, the Mediterranean and Western Europe and over time these developed into the Romance languages which are from east to west: Romanian, Italian proper and Northern Italian, Sardinian, Corsican, Friulian, Ladin, Romansh, French, Francoprovençal, Occitan, Catalan, Aragonese, Spanish, Asturian-Leonese and Galician-Portuguese.
 * 9) Ancient Ligurian language. This language was intermediary between the Italic and the Celtic languages. It was spoken in Antiquity in what are now Provence and Liguria.
 * 10) Celtic languages. These languages were once spoken throughout Western and Central Europe, but are now confined to the British Isles and Brittany. There are two branches: Goidelic or Gaelic and Brythonic or Britannic. The former are represented by the modern languages of Irish Gaelic, Scottish Gaelic, and Manx. The second group includes Welsh, Cornish and Breton. The prospects of survival for the remaining Celtic languages are not good, as decline for all in favor of English or French has been tremendous.
 * 11) Balto-Slavic languages fall into two main close groups: Baltic and Slavic.
 * 12) The Baltic languages have three representatives: Latvian (sometimes called Lettish), Lithuanian, and the now extinct Prussian. Lithuanian is one of the most conservative Indo-European languages still spoken and is therefore of great interest to historical linguists.
 * 13) The Slavic languages are further subdivided into East Slavic, which includes Russian (also known as "Great Russian"), White Russian, and Ukrainian (also known as "Little Russian"), West Slavic, which includes Polish, Czech, and Slovak, and South Slavic, which includes Bulgarian, Slovenian, and Serbo-Croatian. The oldest texts we have in Slavic are fragments of the Bible and other liturgical texts written by St. Cyril in the ninth century in a language usually referred to as Old Church Slavonic.
 * 14) Germanic languages. The Germanic languages differ from other Indo-European languages by the First or Germanic Consonant Shift (described as Grimm's Law). The common ancestor for the Germanic languages is called either Germanic or Proto-Germanic. This subgroup has three branches: East Germanic, North Germanic, and West Germanic. The former branch is now extinct but it is relatively well known through the fragments of Wulfilla's Gothic Bible, which dates to the fourth century AD. The North-Germanic branch comprises the Scandinavian languages Swedish, Norwegian, Danish, Icelandic, and Faroese. The West-Germanic branch includes English, German, Dutch, and Frisian.
 * 15) Tocharian, more exactly called Arshi-Kuchi. This is the most obscure branch of Indo-European since it has been extinct since at least the ninth century AD and because we have virtually no data for it. We know of two (or perhaps three) different languages belonging to this branch, usually referred to as Tocharian A (Arshi) and Tocharian B (Kuchi).
 * 16) Anatolian. Although this most ancient branch of Indo-European has been extinct since ca. 1100 BC, we know relatively much about it as a result of the discovery of cuneiform tablets with inscriptions in Hittite, the main representative of this branch, in the early twentieth century.

Sergent's classification
A comprehensive and detailed classification was proposed in 1995 by Bernard Sergent in his huge synthesis of the Indo-European question, compiling a large amount of previous works. Sergent's classification processes all groups and subgroups.


 * 1) Northwest group
 * 2) Italo-Celtic
 * 3) Macro-Celtic
 * 4) Celtic
 * 5) Gaelic, including Irish, Manx, Scottish Gaelic.
 * 6) Brythonic, including Welsh, Cornish, Breton, most varieties of Gaulish (extinct).
 * 7) Lepontic (extinct)
 * 8) Celtiberian (extinct)
 * 9) Ancient Asturian (extinct)
 * 10) Ancient Ligurian (extinct; intermediate between Celtic and Italic)
 * 11) Italic or Macro-Italic
 * 12) Osco-Umbrian (extinct), including Umbrian and the Sabellic languages (Sabinian, Samnite, Oscan, Pelignian, Volscan, Marse, Marrucine, Vestinian…).
 * 13) Latino-Faliscan, including Faliscan (extinct) and Latin.
 * 14) Deriving from Latin: the Romance languages, including Galician-Portuguese, Asturian-Leonese, Spanish, Aragonese, Catalan, Occitan, French, Francoprovençal, Romansh, Ladin, Friulian, Northern Italian, Italian, Corsican, Sardinian, Romanian.
 * 15) North Adriatic (extinct), including Venetic.
 * 16) Dalmato-Pannonian (extinct)
 * 17) possibly: Rhaetic (extinct)
 * 18) Siculian-Elymian (extinct)
 * 19) Northwest block or Belgian (extinct)
 * 20) Germanic
 * 21) East Germanic (extinct), including Gothic, Burgundian, Vandal, Rugian, Gepid, Taifal.
 * 22) North Germanic or Scandinavian, becoming Old Norse in an early stage, then giving birth to Danish, Swedish, Norwegian, Faeroese, Icelandic.
 * 23) West Germanic, including English, Frisian, Low German, Dutch, Afrikaans, German proper (or High German), Yiddish.
 * 24) Balto-Balkanic
 * 25) Macro-Baltic (a better name than Balto-Slavic)
 * 26) Baltic, including Old Prussian (extinct), Latvian, Lithuanian.
 * 27) Slavic—in fact, a particular, southern offshoot of Baltic—, including Old Church Slavonic (extinct), Polish, Sorbian, Kashubian, Czech, Slovak, Slovene, Serbo-Croatian, Bulgarian (with Slavomacedonian), Russian, Belarussian, Ukrainian.
 * 28) Balkanic
 * 29) Daco-Thracian
 * 30) Dacian or Daco-Mysian or Getic, including Dardanian, Moesian (extinct), Mysian (extinct).
 * 31) Albanian, probably descending from Dardanian.
 * 32) Thracian, including Thracian proper (extinct), Thynian (extinct) and Bythinian (extinct).
 * 33) Armenian, a far offshoot of Thracian (but developping a close contact with Helleno-Phrygian).
 * 34) Illyro-Messapian (extinct), including Illyrian and Messapian.
 * 35) South Italic (extinct; hard to classify)
 * 36) Philistine, maybe the same language as Pelasgian (extinct; hard to classify, possibly a branch of Macro-Italic).
 * 37) Arshi-Kuchi (extinct), often called improperly Tocharian, including Arshi (or Tocharian A) and Kuchi (or Tocharian B).
 * 38) Southeast group
 * 39) Helleno-Phrygian
 * 40) Greek or Hellenic, including probably Ancient Macedonian and Aetolian.
 * 41) Phrygian (extinct)
 * 42) (Armenian, a far offshoot of Thracian, but developping a close contact with Helleno-Phrygian)
 * 43) Aryan or Indo-Iranian
 * 44) Iranian, including:
 * 45) Extinct languages as Cimmerian, Old Persian, Avestan, Scythian/Saka (with Sarmatian, Alanian, Parthian, Mede), Pehlvi.
 * 46) Current languages as Modern Persian (including Tajik), Ossetian (which comes from Alanian, initially a variety of Scythian), Afghan (or Pashto), Baluchi, Kurdish, Zaza, Lur, Gorani, Mazandarani, Gilani, various languages of Pamir.
 * 47) Indo-Aryan, including Sanskrit (extinct), the various Dardic languages (including Kashmiri), Nuristani, Lahnda, Sindhi, Gujrati, Mahratti, Bhili, Rajasthani, Punjabi, the various Pahari languages (including Nepalese), Hindi-Urdu, Oriya, Bengali, Bihari, Assamese, Singhalese, Divehi, Romany.
 * 48) Anatolian (extinct), including Hittite, Palaic, Luwian, Hieroglyphic Luwian, Lycian, Sidetic, Lydian, Pisidian, Carian (possibly).
 * 49) Indo-European languages with undetermined status
 * 50) Lusitanian
 * 51) Alteuropäisch (“Old European”)
 * 52) Prehellenic A (possibly belonging to the Anatolian group)
 * 53) Prehellenic B (possibly belonging to the Balto-Balkanic group)
 * 54) Hypothetically Indo-European languages
 * 55) Tartessian
 * 56) North Picenian
 * 57) Etruscan (possibly close to the Anatolian group)

Origins
The origins of the Indo-European family have been explained by various hypotheses.

Kurgan hypothesis
The most widely accepted explanation is, by far, the kurgan hypothesis. According to it, all Indo-European languages come from a common mother tongue, called Indo-European or Proto-Indo-European (PIE), that was spoken during the mid or late Neolithic by a people of pastoralists across the Pontic-Caspian Steppe (Ukraine, south Russia and west Kazakhstan). Some famous archeological remnants of this Indo-European people are the “kurgans” (a type of tumulus or burial mound).

Several variants of the kurgan hypothesis suppose that the Pontic-Caspian Steppe could be a secondary and late urheimat, preceded by a first and earlier urheimat located somewhere south of the Caucasus, in the Near East or east of the Caspian Sea. This location could help to explain some interesting common features shared by Indo-European, by the Semitic languages (as the root for seven: Indo-European *septm-, Semitic *sab‘-at-u-m) and by the Kartvelian languages of south Caucasus.

The hard lifestyle of the Indo-European pastoralists, in the Pontic-Caspian Steppe, led them to invade continuously other countries and finally to impose their language and culture in the conquered lands. This tendency to expansion—favored by an early mastery of the horse and the wheel—would have split the Indo-European mother tongue and created new languages and cultures that kept essentially Indo-European features, but mixed with remnants of the former languages and cultures of the conquered peoples.

Among the numerous scientists who support the kurgan hypothesis—linguists, archeologists, historians, religion specialists, anthropologists—, one can notice the syntheses of archeologists Marija Gimbutas and J.P. Mallory and of historian Bernard Sergent.

Anatolian hypothesis
A minoritary current of scholars suppose that Indo-European would come from the slow spread of languages and cultures brought by peoples who were expanding agriculture from Anatolia. This scenario is supported especially by archeologist Colin Renfrew.

Paleolithic continuity theory
A very minoritary current, whose main proponent is linguist Mario Alinei, states that the Indo-European family would have existed in Europe since the Paleolithic. This suggests a very old continuity. According to Alinei, a lot of boundaries of current Indo-European languages would be very old, even if some former Indo-European languages enclosed within those boundaries have been replaced several times by new Indo-European languages.

Work in Progress