Google improves discoverability in Maps in Indian languages with automatic transliteration

Google Maps is now more useful for Indian language users after incorporating an ensemble of learning models that automatically transliterate the names of points of interest (POIs) to 10 prominent local languages. This new feature will deliver a richer, more intuitive language experience for users, and will enable millions to issue queries in their own language and find information on Maps such as restaurants, petrol pumps, hospitals, grocery stores, banks, bus stops, train stations, and numerous other services. By using automatic transliteration from the Latin script (English) name of a POI, Maps now displays their names in  Hindi, Bangla, Marathi, Telugu, Tamil, Gujarati, Kannada, Malayalam, Punjabi, and Odia. We recently launched the language picker on Google Maps (via Settings) in India first, and this will ensure a seamless user experience in the language of their choice.

Speaking of this feature, Cibu Johny, Software Engineer, Google Maps said, “Nearly three-quarters of Indians interact with the web primarily using local languages rather than English, and this number is only growing. To make Google Maps as helpful as possible for millions of Indian language users, we’ve introduced an updated automatic transliteration system that enables us to deliver more accurate results when users search for POIs in their preferred language. In a country where names of establishments are written with words from multiple languages and acronyms, phonetically mapping these words into their native language  will help us more accurately surface the results that local language users are looking for.”

Explaining how the feature works, he added, “Common English words are frequently used in names of places in India, even when written in the native script. How the name is written in these scripts is largely driven by its pronunciation. For example, एनआईटी from the acronym NIT is pronounced ‘en-aye-tee’, not as the English word ‘nit’. Therefore by understanding that NIT is a common acronym from the region, Maps can derive the correct transliteration. In the past when Maps could not understand the context of एनआईटी, it would instead show a related entity that might be farther away from the user. With this development, we can find the desired result from the local language query. Additionally, users can see the POI names in their local language, even when they do not originally have that information.”

Using an ensemble of learned models, various transliteration dictionaries, and a module for acronyms — with a very large sample of names from on-line text — Google Maps has already added names in local languages to millions of POIs across India, increasing the quality and the coverage, which became nearly twenty-fold in some languages.

Language  Coverage Improvement Quality Improvement
Hindi 3.2x 1.8x
Bengali 19x 3.3x
Marathi 19x 2.9x
Telugu 3.9x 2.6x
Tamil 19x 3.6x
Gujarati 19x 2.5x
Kannada 24x 2.3x
Malayalam 24x 1.7x
Odia 960x *
Punjabi 24x *

* Unknown / No Baseline.

Leave a Reply

Your email address will not be published. Required fields are marked *