The Impact of Online Indexing in Improving Arabic Information Retrieval Systems

Tahar Dilekh; Saber Benharzallah; Ali Behloul

doi:10.31449/inf.v42i4.2297

The Impact of Online Indexing in Improving Arabic Information Retrieval Systems

Abstract

This paper suggests a new type of indexing Arabic Language text that contributes to improving the quality of IRS. The proposed method of indexing belongs to the semi-automatic category of indexing and consists of two types. The first type conducts an online indexing where one document is the indexing unit. This type of indexing refers to the indexing process that begins directly after the writing of each unit ends, which allows assisting human expert (author of the text) to select Arabic appropriate descriptors to improve the search results. The output of this process gives a rise to a Partial index. The second type – under this method- is an ofﬂine indexing, which refers to the process of indexing based on the collection of textual documents available from different corpora. The output of this process leads to a General index. We illustrate the application and the performance of this new method of indexing using an Arabic text editor developed and designed to allow for an online semi-automatic indexing system and Information Retrieval tool that contains an offline automatic indexing system. We also illustrate the process of building a new form of Arabic corpus appropriate to conduct the necessary experiments. Our findings show that the online indexing model successfully identifies the descriptors most relevant to the document, which is primarily due to the intervention of the human expert in the descriptors’ identification process. In addition, this model is more efficient as it helps to minimize index storage size, consequently, improving the response time of the different requests. Finally, the paper proposes a solution to issues and deficiencies Arabic language processing suffers from, especially regarding corpora building and information retrieval evaluation systems. This latter enables researchers to test their indexing and retrieval algorithms.

References

Authors

Tahar Dilekh
Saber Benharzallah
Ali Behloul

DOI:

https://doi.org/10.31449/inf.v42i4.2297

Downloads

Published

06/26/2018

Issue

Vol. 42 No. 4 (2018)

Section

Regular papers

License

Authors retain copyright in their work. By submitting to and publishing with Informatica, authors grant the publisher (Slovene Society Informatika) the non-exclusive right to publish, reproduce, and distribute the article and to identify itself as the original publisher.

All articles are published under the Creative Commons Attribution license CC BY 3.0. Under this license, others may share and adapt the work for any purpose, provided appropriate credit is given and changes (if any) are indicated.

Authors may deposit and share the submitted version, accepted manuscript, and published version, provided the original publication in Informatica is properly cited.

How to Cite

The Impact of Online Indexing in Improving Arabic Information Retrieval Systems. (2018). Informatica, 42(4). https://doi.org/10.31449/inf.v42i4.2297

Download Citation

The Impact of Online Indexing in Improving Arabic Information Retrieval Systems

Abstract

References

Authors

DOI:

Downloads

Published

Issue

Section

License

How to Cite

Developed By

Information