DTW at Qur'an QA 2022: Utilising Transfer Learning with Transformers for Question Answering in a Low-resource Domain

Abstract

The task of machine reading comprehension (MRC) is a useful benchmark to evaluate the natural language understanding of machines. It has gained popularity in the natural language processing (NLP) field mainly due to the large number of datasets released for many languages. However, the research in MRC has been understudied in several domains, including religious texts. The goal of the Qur'an QA 2022 shared task is to fill this gap by producing state-of-the-art question answering and reading comprehension research on Qur'an. This paper describes the DTW entry to the Quran QA 2022 shared task. Our methodology uses transfer learning to take advantage of available Arabic MRC data. We further improve the results using various ensemble learning strategies. Our approach provided a partial Reciprocal Rank (pRR) score of 0.49 on the test set, proving its strong performance on the task. © 2022, CC BY.

Publication DOI: https://doi.org/10.48550/arXiv.2205.06025
Additional Information: Copyright 2022 with a Creative Commons Attribution 4.0 International (CC BY 4.0) license
Last Modified: 29 Oct 2024 16:56
Date Deposited: 02 Mar 2023 13:11
Full Text Link: http://www.scop ... tnerID=MN8TOARS
Related URLs: https://arxiv.o ... /abs/2205.06025 (Publisher URL)
PURE Output Type: Conference contribution
Published Date: 2022-05-12
Authors: Premasiri, Damith
Ranasinghe, Tharindu (ORCID Profile 0000-0003-3207-3821)
Zaghouani, Wajdi
Mitkov, Ruslan

Download

[img]

Version: Published Version

License: Creative Commons Attribution

| Preview

Export / Share Citation


Statistics

Additional statistics for this record