Or press ESC to close.

The Mountains and Hills of South Asia’s Linguistic Landscape

29 Jun 2026
The Mountains and Hills of South Asia’s Linguistic Landscape

South Asia is one of the most intensely multilingual regions on the planet. Spanning over five million square kilometers and home to nearly two billion people, which is roughly a quarter of humanity, the subcontinent is a brilliant tapestry woven from five distinct language families: Indo-European, Iranian, Dravidian, Austro-Asiatic (Munda), and Tibeto-Burman. Around eighty percent of the population speaks an Indo-European tongue, meaning these languages alone are spoken by about one-fifth of the human race.

Yet, trying to pinpoint exactly how many languages and dialects exist across this landscape is notoriously difficult. The blur between where a dialect ends and a standalone language begins leaves even modern census takers guessing. For instance, the 1991 Census of India consolidated more than 10,400 raw language names collected on the ground into 114 official languages. Just a decade later, the 2001 Census identified 122 languages from a vastly different pool of roughly 6,600 names.

This is not a new dilemma. A century ago, a monumental colonial project ran headfirst into the exact same problem.

The Man Who Tried to Map the Subcontinent's Speech

Between 1901 and 1928, the Linguistic Survey of India (LSI) was published in Calcutta. Spanning twenty-one massive volumes, it remains one of the most ambitious linguistic undertakings in human history. The man behind it was Sir George Abraham Grierson (1851 to 1941), an Irish-born civil servant who had studied mathematics at Trinity College Dublin before developing a lifelong obsession with the languages and literatures of India.

When Grierson compiled his introductory volume in 1927, he admitted the sheer scale of Indian speech was dazzling. His informants initially threw 231 languages and 774 dialects at him. Realizing many were the same tongues hidden under different local names, Grierson conservatively pared the list down to 179 languages and 544 dialects. By comparison, the official 1921 Census counted 188 languages, leaving the number of dialects as an unsolved mystery.

To explain the headache of sorting these out, Grierson famously compared the process to distinguishing a mountain from a hill:

One has no hesitation in saying that Everest is a mountain, and Holborn Hill [in London], a hill, but between these two the dividing line cannot be accurately drawn. Moreover, we often talk of the 'Darjiling Hills' which are over 7,500 feet high, while everyone calls Snowden [in Wales] with its poor 3,500 feet, a mountain.

Linguistic identity, in other words, is often a matter of perspective, scale, and politics rather than objective science.

A Mixed Legacy: Dividers or Preservers?

Weighing heavily on the political landscape, Grierson’s Survey was far more than an academic exercise. It addressed two very different audiences: British colonial officers looking for a practical guide to rule their districts, and global scholars trying to understand the evolution of human speech.

Because the project attempted to draw hard lines on a fluid map, its legacy is deeply complicated. The modern nations that succeeded the British Raj still struggle with the very questions Grierson’s work brought to light: Where do you draw the borders of a language? How do you balance regional mother tongues against a singular national language?

1. Weaponized for Local Identity

Throughout the twentieth century, local communities used the Survey to fight for their own recognition. In the 1920s and 30s, Assamiya activists wrote frequently to Grierson, using his own findings to prove that Assamese was an independent language with its own rich literary history, rather than just a dialect of Bengali. Grierson had explicitly written that Assamese had "won for itself the right of a separate, independent existence," a quote that became a political shield for the community. Today, activists in Pakistan demanding a separate Siraiki province still point to the Survey to validate their linguistic distinctiveness.

2. The Communal Divide

On the flip side, Grierson viewed Indian society through a heavily fractured, religious lens. This was a worldview likely sharpened by his background in the Protestant Anglo-Irish elite during a time of bitter sectarian strife back home in Ireland.

This bias heavily warped his scholarship. When compiling his Dictionary of the Kashmiri Language, Grierson intentionally stripped away Persian influences, focusing almost exclusively on upper-caste Hindu texts and Kashmiri Shaivism, largely ignoring the region’s syncretic, Muslim-majority reality.

More damagingly, his views solidified the split between Hindi and Urdu. While treating Hindi as a native, Hindu language, he framed Urdu as an artificial, foreign, and Semitic tongue imported by Muslim rulers. His personal letters reveal a highly sensationalized, dark view of Islamic history in the subcontinent, using terms like "tyranny" and "persecution" to describe its impact. By ignoring the deeply shared, blended heritage of Hindustani, where centuries of poetry and everyday speech thoroughly blurred the lines between the two, the Survey inadvertently fed into the communal politics that eventually fractured the subcontinent.

The Human Side of the Data

It would be a mistake, however, to dismiss the Linguistic Survey of India as a mere tool of colonial divide and rule policies. The project was far too vast, messy, and human for such a simple label.

Crucially, the Survey was not a solo effort. It relied entirely on thousands of Indian scholars, local officials, and everyday speakers who gathered data and challenged its categories. Grierson was remarkably open about this, frequently exchanging books and maintaining warm friendships with Indian intellectuals. He wrote the preface to the groundbreaking doctoral thesis of legendary Bengali linguist Suniti Kumar Chatterji, titled The Origin and Development of the Bengali Language (1927), and local academic bodies like the Bangiya Sahitya Parishad honored Grierson for his genuine contributions to the preservation of local literatures.

Furthermore, the project took an immense personal toll on Grierson. Running a massive operation with virtually no financial backing from a stingy colonial government, he often had to bully overworked local bureaucrats into sending him field data. Having retired to Surrey, England, in 1903 for health reasons, he spent twenty-five years managing this endless jigsaw puzzle from across the ocean without a salary. The stress triggered severe bouts of depression and temporary blindness.

Facing the Indefinite

Perhaps the most human aspect of the Survey is its self-awareness. Throughout its pages, Grierson freely admitted his own ignorance, pointing out that his linguistic maps were merely "conventional methods of showing definitely a state of things which is in essence indefinite."

Today, the Linguistic Survey of India remains an indispensable reference. Its skeletal grammars, vocabulary lists, and early gramophone recordings offer an invaluable freeze-frame of South Asian speech at the dawn of the twentieth century. While modern readers must approach its political and religious biases with a critical eye, it is impossible to deny the enduring value of the work, which stands as a monument built by a man who looked at the dazzling, overwhelming ocean of South Asian speech and, despite the fog, tried his best to map it.


Latest Articles

Latest News