Improving the Relevancy of Document Search using the Multi-Term Adjacency Keyword-Order Model

Main Article Content

Lim Bee Huang
Vimala Balakrishnan
Ram Gopal Raj

Abstract

This paper presents an enhanced vector space model, Multi-Term Adjacency Keyword-Order Model, to improve the relevancy of search results, specifically document search. Our model is based on the concept of keyword grouping. The keyword-order relationship in the adjacency terms is taken into consideration in measuring a term’s weight. Assigning more weights to adjacency terms in a query order results in the document vector being moved closer to the query vector, and hence increases the relevancy between the two vectors and thus eventually results in documents with better relevancy being retrieved. The performance of our model is measured based on precision metrics against the performance of a classic vector space model and the performance of a Multi-Term Vector Space Model. Results show that our model performs better in retrieving more relevant results based on a particular search query compared to both the other models.

Downloads

Download data is not yet available.

Article Details

How to Cite
Huang, L. B., Balakrishnan, V., & Raj, R. G. (2012). Improving the Relevancy of Document Search using the Multi-Term Adjacency Keyword-Order Model. Malaysian Journal of Computer Science, 25(1), 1–10. Retrieved from http://mojem.um.edu.my/index.php/MJCS/article/view/6584
Section
Articles

Most read articles by the same author(s)

1 2 > >>