Origin of Pashtuns
Abstract
A Pashtun (also spelled Pakhtun) can be defined from ethnic, linguistic, and cultural perspectives:
- Ethnically:
Pashtuns are an Indo-Iranian ethnic group indigenous to the region historically known as Afghanistan and the adjoining areas of western Pakistan (especially Khyber Pakhtunkhwa, northern Balochistan, and parts of Punjab). They are regarded as the native inhabitants of southeastern Afghanistan, particularly around Paktia, Paktika, and Ghazni, from where they expanded eastward and southward. - Linguistically:
They speak Pashto, an Eastern Iranian language of the Indo-European family. Pashto has several dialects, broadly classified into southern (Kandahari) and northern (Yousafzai) groups. - Culturally:
Pashtun identity is deeply rooted in Pashtunwali, an ancient code of honour and conduct emphasizing hospitality (melmastia), revenge (badal), courage (tura), and protection of women (nang). This code predates Islam but has been integrated into Islamic ethics over time. - Genetically and Historically:
Genetic studies show a blend of ancient South-Central Asian ancestry with Iranic and steppe Indo-Iranian elements, reflecting their position as a bridge between Central and South Asia. Historically, they are linked to ancient peoples of Arachosia and Gandhara, with possible descent from the ancient Paktuēs or other Iranic tribes recorded by Greek historians.
In essence,
A Pashtun is an Iranic-speaking ethnic group native to the region straddling modern Afghanistan and Pakistan, united by the Pashto language and the traditional moral code of Pashtunwali.
1. Paktia: The Homeland
In his Histories (Book 3, c. 440 BCE), the Greek historian Herodotus mentions a tribe called the Paktues (Πακτυες) living in the eastern parts of the Achaemenid Empire.
- He places them in the 20th satrapy, which included the Gandarians, Sattagydae, Dadicae, and Aparytae — regions roughly corresponding to eastern Afghanistan and northwestern Pakistan today.
- Herodotus specifically refers to a region called Paktia (or Pactyica).
This Paktia (or Paktika) is widely identified with the area around the Khyber Pass, Nangarhar, Paktia, and Paktika provinces of modern Afghanistan.
2. Linguistic and Geographical Correspondence
The linguistic resemblance between Paktues (Greek) → Pakti/Paxtūn (Pashto) → Pashtun is striking:
- Greek “-es” was a common plural suffix.
- “Paktues” could represent the same ethnonym as Paxtūn, with the Greek transliteration softening the “x” (kh) sound.
- The region “Paktia/Paktika” still exists and is part of the traditional Pashtun homeland — suggesting ethnic continuity.
3. Archaeological and Historical Correlations
- The Achaemenid inscriptions (6th–4th centuries BCE) refer to nearby satrapies like Arachosia, Bactria, Gandhara, and Sattagydia, matching the same geographic belt as the Pashtun tribal areas.
- The Pashto language is an Eastern Iranian language — directly descended from the same Iranian branch spoken in this region since Avestan times.
- This supports the view that Pashtuns are an indigenous Eastern Iranian people of the ancient Paktia region, rather than late migrants.
4. Competing Theories
While the Paktues–Pashtun connection is strong, some alternative theories have been proposed:
- Israelite theory: claiming descent from the lost tribes of Israel (mainly a medieval and genealogical tradition, not supported by archaeology).
- Scythian–Saka origin: connecting Pashtuns to Eastern Iranian nomadic tribes that settled in the area between 1000–500 BCE.
- Mixed heritage theory: suggesting that modern Pashtuns are a synthesis of the ancient Paktues, Saka, and Indo-Iranian settlers.
5. Scholarly Consensus
Most modern historians and linguists agree that:
- The Pashtuns are descendants of ancient Eastern Iranian tribes native to the region between the Helmand and Indus rivers.
- The Paktues of Herodotus likely represent one of these tribes — making them the earliest known reference to the Pashtun people in recorded history.
So, the claim that Pashtuns descend from the Paktues mentioned by Herodotus is plausible and well-supported linguistically, geographically, and historically.
In summary
Evidence Type Observation Implication Historical (Herodotus) Mentions Paktues in Paktia Early Greek record of Pashtun-like people Geographical Region corresponds to modern Paktia–Khyber belt Same homeland continuity Linguistic Paktues ≈ Paxtūn / Pashtun Direct ethnonymic link Cultural-Linguistic Continuity Eastern Iranian dialects persisted Pashtuns are native Eastern Iranians
Here’s a summary of what current genetic research reveals about the Pashtuns (also spelled Pashtūn, Pathān) — their affinities, genetic make-up, how they relate to neighbouring groups — and also some caveats. The genetics don’t give a single neat origin story, but they do help clarify major components of their ancestry.
What the genetic evidence shows
Paternal (Y-chromosome) lineages
- A considerable proportion of Pashtuns carry the Y-chromosome haplogroup R1a (in some studies ~50-65 % or more) among men from Afghanistan-Pakistan. (Wikipedia)
- One important sub-clade is R1a-Z2125 (sometimes written Z2123) which appears among Pashtuns and also among Tajiks, Uzbeks, and other Central Asian groups. (Wikipedia)
- Other Y-haplogroups among Pashtuns: some presence of Q‑M242 in northern Pashtuns in Afghanistan; also haplogroups L, G, J2 etc at lower frequencies. (mr.kuchewar.com)
- These data suggest that the male lineages of Pashtuns reflect both: (a) deep Indo-Iranian (or Eastern Iranian) heritage, and (b) some gene flow/migration from neighbouring regions (South Asia, Central Asia).
Maternal (mtDNA) and autosomal ancestries
- Maternal lineages in Pashtun groups show a mix: many West Eurasian haplogroups (H, U, J, K, etc) but also South Asian lineages (M, R) in appreciable proportions. For example a study of four tribes in Khyber Pakhtunkhwa found ~28 % South Asian haplogroup M and a considerable share of West Eurasian lineages. (PubMed)
- Autosomal (whole‐genome) data and broader population‐structure studies show that Pashtuns form part of a “Afghan/Indian–type” cluster in some analyses, showing affinity to populations in northern India, Pakistan, Afghanistan, while being distinct from South Indian Dravidian groups. (LiquiSearch)
Genetic affinities and relations with other groups
- Pashtuns show close genetic affinity with neighbouring Iranian-language speaking and Indo-Iranian peoples (e.g., Tajiks) more than with Turkic groups (e.g., Uzbeks) or with groups with strong East Asian ancestry (e.g., some Hazaras). For instance: “Pashtun, Tajik, North Indian, West Indian populations” form a genetic cluster in one study. (LiquiSearch)
- There is evidence of gene flow from India/subcontinent into Afghanistan, reflected especially in Pashtuns and Tajiks: for example, Indian-lineages such as L-M20, H-M69, R2a-M124 appear among Pashtuns in some analyses. (a.osmarks.net)
- Some more specific studies found a close genetic link between the Indian Gujjar population and Pashtuns from Afghanistan/Pakistan. (Telegraph India)
What does this mean for the “origin” of Pashtuns?

1. Genetic Evidence
The chart reflects that Pashtuns in Afghanistan carry slightly higher proportions of R1a-Z2125 and J2 haplogroups, both of which have strong Indo-Iranian and Central Asian connections.
- These markers are more ancient and widespread in Afghan Pashtuns than in KPK tribes, suggesting that the core Pashtun gene pool formed west of the Khyber Pass.
- KPK Pashtuns show more South Asian haplogroups (like L and H), indicating later eastward mixing as Pashtun tribes expanded into the Indus frontier.
2. Linguistic Evidence
- Pashto belongs to the Eastern Iranian branch of Indo-Iranian languages — linguistically rooted in Bactria and Arachosia (ancient regions of modern Afghanistan).
- The oldest dialects of Pashto (especially those in Kandahar, Helmand, and Ghazni) preserve archaic features missing in KPK dialects — a clue that Pashto evolved first in Afghanistan and later spread eastward.
3. Historical and Archaeological Context
- The Oxus (Bactria-Margiana) Civilization and Achaemenid satrapies of Arachosia and Gandhara show that eastern Afghanistan has been inhabited by Iranian-speaking peoples for millennia.
- Ancient references (e.g., Herodotus’s Paktues of Paktika) also describe tribes living around present-day southeastern Afghanistan and the Khyber region, implying a west-to-east movement, not the reverse.
4. Cultural Continuity
Tribal names, social codes (Pashtunwali), and genealogical traditions are more conservative and uniform in Afghanistan’s Pashtun belt than in KPK — another sign of cultural origin there, with later adaptation and diversity eastward.
The genetic and linguistic data together make it clear that Afghanistan — particularly the regions of Kandahar, Ghazni, and Paktika — is the original homeland of the Pashtuns, from where they later expanded eastward through the Khyber Pass into KPK and northern Pakistan.
Origin of Pashto
1. Linguistic Classification
Pashto belongs to the Eastern Iranian branch of the Indo-Iranian family, which itself is part of the larger Indo-European language family.
Family Tree (simplified):
Indo-European
↳ Indo-Iranian
↳ Iranian languages
↳ Eastern Iranian group → Pashto, Pamiri languages (Shughni, Wakhi), Ossetian, Yaghnobi
↳ Western Iranian group → Persian (Farsi, Dari, Tajik), Kurdish, Baluchi, Luri
So linguistically, Pashto is a sister language of Persian, not a dialect of it.
2. Geographic and Historical Origin
- Pashto originated in the mountainous region between the Helmand River and the Indus River, roughly covering southern and eastern Afghanistan and northwestern Pakistan.
- This is the same area that the ancient Greeks called Paktia or Pactyica (Herodotus’s Paktues).
- It developed from Eastern Iranian dialects that were already spoken here during the Achaemenid and Avestan periods (6th–4th centuries BCE).
In short, Pashto is native to Afghanistan and adjacent Pakistan — not an imported or recent language.
3. Earliest Evidence
The earliest written references to Pashto are relatively late:
- The oldest Pashto poetry appears in the 7th–8th centuries CE, attributed to Amir Kror Suri (a legendary figure).
- By the 10th–11th centuries CE, Pashto was a well-developed literary language used by the Ghaznavids and Ghorids.
- The oldest preserved manuscripts are from the 16th–17th centuries, including the works of Bayazid Ansari (Pir Roshan) and Khushhal Khan Khattak.
But linguistic reconstruction shows that Pashto must have evolved from much older Iranian dialects spoken in the region since at least the late 2nd millennium BCE.
4. Linguistic Characteristics
Pashto preserves archaic Iranian features and adds unique innovations:
| Feature | Description | Example |
|---|---|---|
| Archaic Sounds | Retains ancient “š” and “ž” sounds lost in Persian | Pashto “sho” (what) vs Persian “che” |
| Retroflex Consonants | Likely influenced by Indo-Aryan neighbors | ṭ, ḍ, ṇ |
| Vocabulary | ~70% Iranian origin, ~15% Indo-Aryan (Sanskrit/Prakrit), ~10% Arabic & Persian loans | – |
| Grammar | Preserves ergative structure (like Sanskrit and Old Iranian) | “Zma khalaka raaghlee di” — “My people have come” (past ergative) |
Thus, Pashto is deeply Iranian in structure, but shows South Asian contact influence due to centuries of coexistence with Indo-Aryan speakers.
5. Allied Languages (Linguistic Alliances)
Pashto’s closest relatives within the Eastern Iranian group are:
| Related Language | Modern Region | Relationship |
|---|---|---|
| Wakhi | Wakhan Corridor (Afghanistan/Tajikistan) | Shares old Iranian lexicon |
| Shughni | Badakhshan (Afghanistan/Tajikistan) | Closely related in phonology |
| Yidgha & Munji | Nuristan–Badakhshan area | Transitional dialects toward Pashto |
| Ossetian | Caucasus (North Ossetia) | Distant cousin preserving ancient Iranian forms |
| Yaghnobi | Tajikistan | Descendant of ancient Sogdian; shares common ancestry |
This shows that Pashto’s alliances are with northeastern Iranian mountain languages, not with Persian or Baluchi — though all share a common proto-Iranian ancestor.
6. Cultural and Ethnic Associations
- Pashto is the ethnic and cultural language of the Pashtuns (Afghans).
- It is deeply tied to Pashtunwali, the traditional code of honor.
- Its oral poetry, proverbs, and tribal ballads preserve many pre-Islamic and early Islamic motifs.
- It is the official language of Afghanistan, alongside Dari Persian.
7. Summary
| Aspect | Details |
|---|---|
| Language Family | Indo-European → Indo-Iranian → Iranian → Eastern Iranian |
| Origin Region | Southern & Eastern Afghanistan, Northwestern Pakistan |
| Ancestral Root | Proto–Eastern Iranian dialects (Avestan-related) |
| Earliest Record | 7th–8th century CE (oral), 16th century (written) |
| Closest Relatives | Munji, Yidgha, Shughni, Wakhi, Ossetian |
| Influences | Indo-Aryan (phonetic), Persian & Arabic (vocabulary) |
| Cultural Identity | Language of the Pashtun nation; code of Pashtunwali |
In essence
Pashto is an indigenous Eastern Iranian language, born in the Afghan highlands, sharing ancestry with ancient Avestan and Sogdian tongues.
Its alliances lie firmly with other Eastern Iranian mountain languages, and its development reflects a continuous linguistic tradition in the region for over 3,000 years.
Short History of Afghanistan
That’s an excellent and nuanced historical question — and one that scholars still debate, because “Afghanistan” as a political unit is quite recent, while its land has hosted many ancient peoples.
Here’s a concise historical breakdown:
1. Prehistoric and Protohistoric Inhabitants
Archaeological finds from sites like Mundigak (Kandahar) and Shortugai (Takhar) show that the earliest settled peoples of Afghanistan date back to the 3rd millennium BCE, part of the Indus Valley and Bactria–Margiana Civilizations.
- These were proto-Iranian and proto-Dravidian peoples, sedentary agriculturalists with early urban centers.
- The Helmand culture and Oxus civilization (BMAC) formed the earliest cultural foundations of the region.
2. Indo-Iranian and Avestan Period (c. 2000–1000 BCE)
The next major population movement was that of Indo-Iranian (Aryan) tribes migrating from Central Asia into what is now Afghanistan and Iran.
- Eastern Iranian tribes (the ancestors of modern Pashtuns, Tajiks, and Pamiris) settled in the highlands.
- The Avestan texts (Zoroastrian scripture) mention regions like Bakhdi (Bactria), Haroyu (Herat), and Arachosia (Kandahar) — all within modern Afghanistan.
Thus, the Eastern Iranian peoples are the oldest continuously traceable inhabitants of Afghanistan.
3. Classical Antiquity: Iranian Kingdoms and Hellenistic Era
By the 1st millennium BCE:
- Bactrians, Sogdians, and Arachosians inhabited northern and southern Afghanistan.
- The Achaemenid Empire (6th–4th century BCE) ruled them as satrapies.
- After Alexander’s conquest, Greco-Bactrians and Indo-Greeks merged with the local Iranian populations.
4. Early Islamic and Medieval Period
When Islam spread into the region (7th–8th centuries CE):
- The core population remained Eastern Iranian.
- New Turkic groups (e.g., Ghaznavids, Karakhanids) entered later and became politically dominant, but not demographically majority.
5. Modern Ethnic Descendants
Today’s major Afghan ethnic groups trace back to these ancient layers:
| Ethnic Group | Historical Ancestry | Linguistic Family | Remarks |
|---|---|---|---|
| Pashtuns | Eastern Iranian tribes (Avestan Arachosians, Sakas) | Indo-Iranian | Likely the oldest indigenous group of southern & eastern Afghanistan. |
| Tajiks | Eastern Iranian (Bactrians, Sogdians) | Indo-Iranian | Indigenous to northern and western Afghanistan. |
| Hazaras | Mixed Iranian–Turko–Mongol | Indo-Iranian (Dari) | Later admixture during Mongol period but rooted in local highlands. |
| Uzbeks & Turkmen | Turkic migrants from Central Asia (10th–15th c.) | Turkic | Not original but long-established. |
| Nuristanis | Descendants of pre-Islamic Indo-Aryans | Indo-Iranian (distinct branch) | Possibly preserve pre-Zoroastrian Aryan traditions |
The original inhabitants of Afghanistan were the Eastern Iranian peoples, ancestors of today’s Pashtuns, Tajiks, and Nuristanis — all of whom predate later Turkic and Mongol migrations.
They are direct descendants of the Indo-Iranian (Aryan) settlers who absorbed even older Oxus/BMAC populations.
The Oxus Civilization
The Oxus Civilization—also known as the Bactria–Margiana Archaeological Complex (BMAC)—is one of the most remarkable but relatively less-known Bronze Age civilizations of Central and South Asia. It represents a crucial bridge between the early urban cultures of Mesopotamia, the Indus Valley, and the Iranian plateau, as well as the early Indo-Iranian world. It was originally an Iranian civilization who laid a great influence on Persia, Central Asia, and South Asia. Balkh was the center and capital city of the Oxus River Civilization, located in the northern part of modern-day Afghanistan. The Persian kings adopted Zoroastrianism as their state religion, so they propagated this religion of fire throughout the state. (Al-Muntazam)

1. Discovery and Naming
The Oxus Civilization was first identified in the 1970s by the Soviet archaeologist Viktor Sarianidi, who excavated sites in modern southern Turkmenistan, northern Afghanistan, eastern Iran, and western Uzbekistan.
- The name “Oxus Civilization” comes from the Oxus River (Amu Darya), which flows through the heart of its territory.
- The archaeological term “Bactria–Margiana Archaeological Complex (BMAC)” refers to its two core regions:
- Bactria – Northern Afghanistan
- Margiana – The Merv oasis of Turkmenistan

2. Chronology
- Period: ca. 2300 – 1700 BCE
- Contemporaneous with:
- Mesopotamia’s Akkadian Empire
- Indus Valley Civilization (Mohenjo-Daro, Harappa)
- Elamite culture in southwestern Iran
3. Geographical Extent
The civilization stretched from:
- Northern Afghanistan (Balkh, Shortugai, Dashly)
- Across southern Turkmenistan (Gonur Tepe, Togolok)
- Into northeastern Iran (Shahr-i Sokhta, Tepe Hissar)
These fertile oases were surrounded by deserts and mountains, yet connected by trade routes that would later form part of the Silk Road.
4. Urban Planning and Architecture
Excavations reveal advanced urban characteristics:
- Fortified citadels and walled cities (e.g., Gonur Tepe)
- Central temples and palatial compounds
- Irrigation canals and planned streets
- Granaries and craft workshops
Their architecture shows a clear sense of urban organization, security consciousness, and religious centralization—suggesting powerful priest-kings or administrative elites.
5. Material Culture
The Oxus civilization is renowned for its:
- Metallurgy: Skilled in bronze, gold, and silver work.
- Stone and lapis lazuli carving: Especially in seals, amulets, and ritual objects.
- Pottery: Fine wheel-made ceramics, often with geometric designs.
- Figurines: Representations of deities and mythic beings, often female, suggesting fertility cults.
- Trade items: Lapis lazuli, carnelian, tin, and precious metals.
6. Religion and Iconography
- Temples at Gonur Depe and Togolok 21 contain altars, fire installations, and ritual vessels.
- Some scholars (including Sarianidi) saw in them proto-Zoroastrian or Indo-Iranian religious practices—with possible early fire altars and haoma (soma) preparation evidence.
- However, this interpretation remains debated, as no written records exist.
7. Language and Writing
No written language from the Oxus Civilization has been found.
It likely used a proto-Iranian or pre-Indo-Iranian tongue, given its geographic and cultural ties to later Iranian peoples.
Its artistic motifs influenced the later Avestan and Indo-Aryan symbolisms.
8. Economy and Trade
- Based on irrigated agriculture (wheat, barley) and animal husbandry.
- Engaged in long-distance trade with:
- Mesopotamia (via Elam)
- Indus Valley Civilization (Shortugai on the Oxus was a Harappan outpost)
- Iranian Plateau and possibly China (via tin and jade routes)
- The Shortugai site in northern Afghanistan was strategically located to control lapis lazuli trade from Badakhshan.
9. Decline
The Oxus Civilization declined around 1700 BCE, possibly due to:
- Climate change and desertification are reducing water supplies.
- Invasions or migrations by Indo-Iranian nomads from the north.
- Integration into broader regional networks, leading to cultural blending rather than destruction.
10. Legacy
The Oxus Civilization:
- Represents the earliest known urban culture of Afghanistan and Central Asia.
- Formed the cultural matrix out of which later Eastern Iranian civilizations (Bactria, Sogdia, and Arachosia) emerged.
- Its spiritual and artistic traditions likely influenced early Indo-Iranian religious thought, contributing to what later became Zoroastrianism and Vedic culture.
In Summary
| Feature | Description |
|---|---|
| Period | 2300–1700 BCE |
| Core Regions | Bactria (Afghanistan), Margiana (Turkmenistan) |
| Major Sites | Gonur Tepe, Togolok 21, Shortugai, Dashly-3 |
| Economy | Irrigated farming, metallurgy, lapis trade |
| Language | Undeciphered (likely proto-Iranian) |
| Religion | Fire cults, ritual altars, fertility worship |
| Legacy | Cultural and spiritual ancestor of Eastern Iranian peoples |