ChatGPT vs. Google Translate Comparison: Which AI Chatbot is The Best Language Translator

In a head-to-head comparison between language translation services, ChatGPT has emerged as a top contender, surpassing Google Translate, Bing AI, and Google Bard. Fluent speakers of seven non-English languages participated in a blind test to evaluate the AI chatbots' translations against those of Google Translate.

While Google Translate offers quick and easy translations to over 100 languages, users are well aware of its limitations and room for improvement. Large language models like ChatGPT have the potential to revolutionize language translation, drawing from vast amounts of training data and real-time user feedback to generate coherent and human-like sentences in multiple languages.

To determine whether ChatGPT could replace Google Translate as the go-to translation service for various needs such as travel, work, and cross-border communication, a thorough evaluation was conducted by PCMag alongside Microsoft Bing and Google Bard.

Bilingual speakers were given English paragraphs to translate into their respective languages using Google Translate, ChatGPT, and Microsoft Bing. After completing the exercise, the participants were informed about which service produced each translation.

The languages tested included Polish, French, Korean, Spanish, Arabic, Tagalog, and Amharic. Although this study is not comprehensive and further rigorous testing is necessary for a statistically significant evaluation, the results provided valuable insights into the performance of AI models.

The selected paragraphs were carefully crafted to expose the limitations of each translation service. The first paragraph contained colloquial phrases like "blow off steam" and "Cheers!" as well as measurements in USD and miles. The second paragraph, sent only to half of the participants, included slang terms such as "hooligans" and "pop champagne."

Surprisingly, the AI chatbots—ChatGPT, Google Bard, and Microsoft Bing—outperformed Google Translate in all 12 examples sent to the participants. ChatGPT received the highest rankings overall.

According to Ana Romero, who ranked the Spanish translations, ChatGPT provided a translation that closely resembled a normal conversation. The level of formality in the key questions was consistent and the translation of "to blow off steam" was accurate.

While Google Bard was rarely successful and even suggested using Google Translate, the few times it did work, participants ranked its translations higher than Google Translate. However, all the chatbots, including ChatGPT, fell short in accurately translating currency and distance measurements, displaying inconsistencies across languages and services.

One notable weakness of Google Translate was its tendency to provide literal translations, missing out on contextual nuances. In contrast, the chatbots excelled in capturing subtleties and context. The models with larger amounts of training data and more user interactions in a specific language demonstrated a better understanding of cultural phrases and produced more appropriate translations.

ChatGPT's success can be attributed to its reinforcement learning with human feedback (RLHF) approach, which allows for the selection of culturally appropriate responses based on user preferences. On the other hand, Google Bard and Google Translate employ different underlying technologies, with Google Translate being specifically optimized for translation tasks.

Although the chatbots offered improved translations compared to Google Translate, they were not flawless and occasionally produced awkward or inaccurate word choices. Nevertheless, they demonstrated potential for surpassing Google Translate's capabilities with further training and more data in each language.

In the test, Google Translate performed exceptionally well in Tagalog and Amharic, two languages with a relatively small population of speakers. For languages with low resources or insufficient data, the chatbots struggled to provide nuanced translations and often resorted to literal interpretations, contrary to their performance in other languages.

While ChatGPT presents a viable alternative to Google Translate for language translation needs, it is important to note that AI models are not yet a complete substitute for human translators. As AI continues to evolve, the challenge lies in accurately measuring its performance and ensuring effective translation across diverse languages.

Google is also experimenting with search generative experiences (SGE), which offer ChatGPT-style answers to queries. However, these experiments are ongoing, and Google has not provided any information on the possibility of replacing Google Translate in the future.

As the capabilities of AI translation models progress, the ultimate goal is to develop robust and culturally sensitive models that can make the web more accessible on a global scale. The future of language translation hinges on the continual improvement of AI models and their ability to provide accurate and natural translations across different languages.



Read next: Could AI Make the Loneliness Epidemic Worse?
Previous Post Next Post