Back to overview

Ad Hoc Retrieval with Marathi Language

Type of publication Peer-reviewed
Publikationsform Contribution to book (peer-reviewed)
Publication date 2013
Author Akasereh Mitra Savoy Jacques,
Project Multilingual and Domain-Specific Information Retrieval
Show all

Contribution to book (peer-reviewed)

Book Multilingual Information Access In South Asian Languages
Editor , P. Majumder M. Mitra P. Bhattacharyya L. Subramaniam D. Contractor & P. Rosso
Publisher Springer-Verlag, Berlin
Page(s) 23 - 37
ISBN 978-3-642-40086-5
Title of proceedings Multilingual Information Access In South Asian Languages

Open Access


Our goal in participating in FIRE 2011 evaluation campaign is to analyse and evaluate the retrieval effectiveness of our implemented retrieval system when using Marathi language. We have developed a light and an aggressive stemmer for this language as well as a stopword list. In our experiment seven different IR models (language model, DFR-PL2, DFR-PB2, DFR-GL2, DFR-I(ne)C2, tf idf and Okapi) were used to evaluate the influence of these stemmers as well as n-grams and trunc-n language-independent indexing strategies, on retrieval performance. We also applied a pseudo relevance-feedback or blind-query expansion approach to estimate the impact of this approach on enhancing the retrieval effectiveness. Our results show that for Marathi language DFR-I(ne)C2, DFR-PL2 and Okapi IR models result the best performance. For this language trunc-n indexing strategy gives the best retrieval effectiveness comparing to other stemming and indexing approaches. Also the adopted pseudo-relevance feedback approach tends to enhance the retrieval effectiveness.