An N-gram-based Information Retrieval Approach for Surveys on Scientific Articles
Abstract
Humans are constantly searching for knowledge. This quest for knowledge has pushed back the boundaries of science. As a result, new scientific contributions are published daily in a variety of fields. However, it is not easy for a novice researcher to visualize all existing scientific contributions to a specific research problem in a short period of time. This study proposes an approach for extracting useful information from the metadata of scientific documents. Then, the design of an intelligent search system exploits the metadata contained in scholarly documents to provide an overview of scientific contributions to a research problem. The proposed model uses a new similarity measure based on the extraction of n-grams from the metadata of scientific articles. The model offers each user the possibility of visualizing the results of scientific contributions proposed by researchers in the form of a graph. Experiments carried out on a dataset of 126k data show that the model we propose achieves an overall precision of 0.89, a recall of 0.84 and an F1-score of 0.86. This shows that the model can refine the search to provide scientific contributions that have a direct correlation with a user’s need.DOI:
https://doi.org/10.31449/inf.v49i20.5895Downloads
Published
How to Cite
Issue
Section
License
I assign to Informatica, An International Journal of Computing and Informatics ("Journal") the copyright in the manuscript identified above and any additional material (figures, tables, illustrations, software or other information intended for publication) submitted as part of or as a supplement to the manuscript ("Paper") in all forms and media throughout the world, in all languages, for the full term of copyright, effective when and if the article is accepted for publication. This transfer includes the right to reproduce and/or to distribute the Paper to other journals or digital libraries in electronic and online forms and systems.
I understand that I retain the rights to use the pre-prints, off-prints, accepted manuscript and published journal Paper for personal use, scholarly purposes and internal institutional use.
In certain cases, I can ask for retaining the publishing rights of the Paper. The Journal can permit or deny the request for publishing rights, to which I fully agree.
I declare that the submitted Paper is original, has been written by the stated authors and has not been published elsewhere nor is currently being considered for publication by any other journal and will not be submitted for such review while under review by this Journal. The Paper contains no material that violates proprietary rights of any other person or entity. I have obtained written permission from copyright owners for any excerpts from copyrighted works that are included and have credited the sources in my article. I have informed the co-author(s) of the terms of this publishing agreement.
Copyright © Slovenian Society Informatika







