Addressing The Idiom Challenge in Machine Translation: A Review Focused on Low-Resource Languages

Authors

  • Manjot Kaur Author
  • Jasvir Kaur Author
  • Jasmin Kaur Gahlot Author
  • Prof. Palak Sood Author

DOI:

https://doi.org/10.5281/zenodo.18431088

Keywords:

Machine Translation, Idiomatic Expressions, Low-Resource Languages, Large Language Models, SurveyIntroduction

Abstract

Machine translation (MT) has made significant progress for high-resource languages, yet idiomatic translation remains a persistent challenge, especially for low-resource languages like Punjabi. Idioms are non-compositional with respect to cultural values, which makes literal translations insufficient. This paper presents a systematic review of idiom translation in pairs of low-resource-to-high-resource languages, focusing on Punjabi-English as a case study. We analyze key challenges—including dataset limitations, figurative-literal ambiguity, structural complexity, and evaluation limitations—and examine existing approaches, including rule-based, statistical, neural, and large language model (LLM)-based methods. We identify gaps in idiom-specific datasets, evaluation frameworks, and multilingual transfer techniques. Finally, we provide some guidance for future research, highlighting hybrid models, community-driven datasets, multimodal translation, and idiom-aware evaluation metrics. This review aims to guide the development of more accurate and culturally aware MT systems for low-resource languages.

Downloads

Published

2025-12-29

Issue

Section

Review Article

How to Cite

Addressing The Idiom Challenge in Machine Translation: A Review Focused on Low-Resource Languages. (2025). Journal of Global Research in Multidisciplinary Studies(JGRMS), 1(12), 59-65. https://doi.org/10.5281/zenodo.18431088

Similar Articles

21-30 of 75

You may also start an advanced similarity search for this article.