Addressing The Idiom Challenge in Machine Translation: A Review Focused on Low-Resource Languages

Manjot  t Kaur; Jasvir   Kaur; Jasmin Kaur Gahlot; Prof. Palak  Sood

doi:10.5281/zenodo.18431088

PDF

Published: 2025-12-29

DOI: https://doi.org/10.5281/zenodo.18431088

Keywords:

Machine Translation, Idiomatic Expressions, Low-Resource Languages, Large Language Models, SurveyIntroduction

Manjot Kaur

Jasvir Kaur

Jasmin Kaur Gahlot

Prof. Palak Sood

Abstract

Machine translation (MT) has made significant progress for high-resource languages, yet idiomatic translation remains a persistent challenge, especially for low-resource languages like Punjabi. Idioms are non-compositional with respect to cultural values, which makes literal translations insufficient. This paper presents a systematic review of idiom translation in pairs of low-resource-to-high-resource languages, focusing on Punjabi-English as a case study. We analyze key challenges—including dataset limitations, figurative-literal ambiguity, structural complexity, and evaluation limitations—and examine existing approaches, including rule-based, statistical, neural, and large language model (LLM)-based methods. We identify gaps in idiom-specific datasets, evaluation frameworks, and multilingual transfer techniques. Finally, we provide some guidance for future research, highlighting hybrid models, community-driven datasets, multimodal translation, and idiom-aware evaluation metrics. This review aims to guide the development of more accurate and culturally aware MT systems for low-resource languages.

Downloads

Download data is not yet available.

Issue

Vol. 1 No. 12 (2025): December-2025

Section

Review Article

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

This work is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) License. Authors retain the copyright of their work and grant the Journal of Global Research in Multidisciplinary Studies (JGRMS) the right of first publication. This license permits unrestricted use, distribution, adaptation, and reproduction in any medium or format, provided the original author(s), source, and publication are properly credited. Users may copy, redistribute, remix, transform, and build upon the published material for any purpose, including commercial use, in accordance with the terms of the CC BY 4.0 License.

How to Cite

Addressing The Idiom Challenge in Machine Translation: A Review Focused on Low-Resource Languages (M. . t Kaur, J. . Kaur, J. K. Gahlot, & . P. P. . Sood , Trans.). (2025). Journal of Global Research in Multidisciplinary Studies(JGRMS), 1(12), 59-65. https://doi.org/10.5281/zenodo.18431088

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Issue

Section

How to Cite

Similar Articles