Swaveda
Geneticsancient DNApopulation history

Why Harappan DNA Is Still Missing: The Preservation Problem That Leaves the Indus Valley Blank

No verified ancient DNA from Indus Valley sites has been published. The gap reflects bone preservation failures in tropical soil that make one of history's largest urban cultures a genetic blank.

Dr. Anil Patel for SwavedaMay 23, 2026

Photo by Boris Hamer on Pexels

Why Harappan DNA Is Still Missing: The Preservation Problem That Leaves the Indus Valley Blank

Ancient DNA has rewritten Indian prehistory over the past decade. Researchers have sequenced genomes from the Rakhigarhi burial site dated to around 2800 BCE, from Iron Age populations across the subcontinent, and from individuals who lived in the same river valleys as the Harappans—just thousands of years earlier or later.

But from the Indus Valley Civilization itself, during its urban peak between roughly 2600 and 1900 BCE, we have nothing verified and published.

The gap is not a sampling problem. Archaeologists have excavated Harappan sites for nearly a century. Mohenjo-daro, Harappa, Dholavira, Lothal, and dozens of smaller settlements have yielded pottery, seals, urban planning, drainage systems, and skeletal remains. Teams have attempted DNA extraction. The Rakhigarhi samples, analyzed by Vasant Shinde and collaborators including scientists from Harvard's Reich lab, came from a cemetery at the periphery of a Harappan-period settlement. That study, published in 2019, reported one individual's genome. Even that sample came from the mature Harappan period's cultural horizon but has been debated regarding its direct association with urban Harappan material culture.

Since then, no additional verified genomes from the urban core of the civilization have entered the published record.

The problem is preservation. DNA degrades. The double helix breaks into shorter and shorter fragments over time, and environmental conditions determine how fast that happens. Cold and dry environments slow the process. The permafrost of Siberia and the cool caves of Europe have yielded Neanderthal genomes tens of thousands of years old. High-altitude sites in the Andes and the Himalayas preserve DNA across millennia.

The Indus Valley is none of these things.

The region experiences high heat and monsoonal humidity. Soil pH in much of the alluvial plains skews neutral to alkaline, and seasonal flooding introduces microbial activity that accelerates organic breakdown. Bones become porous, then crumble. Collagen—the protein that encases DNA within bone—degrades faster in warm, wet conditions. By the time an archaeological team reaches a burial that has spent 4,000 years in Sindh or Gujarat soil, the skeleton may look intact, but at the molecular level, it is often already gone.

Researchers have documented this pattern across South Asia. A 2016 overview by Ron Pinhasi and collaborators noted that ancient DNA recovery in the Indian subcontinent faces "taphonomic challenges" that do not apply in temperate zones. Taphonomy is the study of what happens to organic remains after death. In this case, the term is a polite way of saying the bones rot.

The Rakhigarhi sample succeeded partly because of location. The site lies in Haryana, where the climate is somewhat less humid than the Indus river plains. The burial context—shallow graves in a dry sediment layer—may have limited water exposure. Even so, the DNA yield was low, and the sequencing required significant technical effort. The researchers did not publish additional individuals from the same cemetery, which suggests the other samples did not yield usable data.

Other Harappan sites face worse odds. Mohenjo-daro, in present-day Sindh, sits near the Indus River and experiences regular flooding. Dholavira, in the Rann of Kutch, alternates between intense heat and seasonal waterlogging. Lothal, near the Gujarat coast, is humid year-round. These are not environments where collagen survives 4,000 years.

Comparisons with other regions clarify the challenge. Ancient DNA from Bronze Age Anatolia, the Pontic steppe, and the Iranian plateau has reshaped our understanding of Indo-European migrations. Those samples come from burials in semi-arid zones, often in kurgans or stone-lined graves that limit water infiltration. The steppe itself is cold and dry. The Indus Valley is neither.

Researchers have attempted DNA extraction from Harappan sites beyond Rakhigarhi, but results have not appeared in peer-reviewed journals. This is not unusual. Failed extractions are common in ancient DNA work and rarely get published. The absence of data is itself data, but it does not generate papers.

Some have asked whether the lack of Harappan DNA is deliberate—whether political or cultural considerations have delayed publication. Evidence does not support this. The Rakhigarhi study proceeded despite intense public interest and nationalist pressure from multiple directions. Genetic research on Indian populations continues, including work on later periods that touches politically sensitive questions about migration and ancestry. The simpler explanation is that the samples do not exist because the bones do not preserve DNA.

This leaves the Indus Valley Civilization as a genetic blank. We know its trade networks reached Mesopotamia. We know it had standardized weights, a script no one has deciphered, and urban planning that rivals any contemporary society. We do not know, from direct genetic evidence, how its people related to earlier South Asian hunter-gatherers, to the pastoralists of the Central Asian steppe, or to later Iron Age populations. We rely on individuals from the margins—like the Rakhigarhi burial—and on populations that came before or after.

The gap matters because genetic data anchors debates that would otherwise remain speculative. Did the Harappans speak a Dravidian language, an early Indo-Aryan dialect, or something unrelated to any modern family? Did they share ancestry with the Bactria-Margiana Archaeological Complex to the northwest? Did their population decline after 1900 BCE result from migration, climate shift, or social reorganization? Genetics cannot answer these questions alone, but it can narrow the possibilities. Without it, we guess.

Future methods may help. Techniques for extracting DNA from petrous bone—the dense part of the skull's temporal bone—have improved recovery rates in other difficult contexts. Sequencing methods that target heavily fragmented DNA have pushed the limits of what counts as viable material. If Harappan samples exist in museum or excavation collections, reanalysis may eventually succeed.

But the underlying problem remains. The Indus Valley Civilization flourished in one of the worst environments on Earth for DNA preservation. Until that changes, the genetic record will stay blank.

ShareXBlueskyLinkedInEmail