JSW 2012 Vol.7(6): 1315-1320 ISSN: 1796-217X
doi: 10.4304/jsw.7.6.1315-1320
doi: 10.4304/jsw.7.6.1315-1320
Semantically Enhanced Uyghur Information Retrieval Model
Bo Ma1, Yating Yang1, Xi Zhou1, and Junlin Zhou2
1Research Center for Multilingual Information Technology, Xinjiang Technical Institute of Physics and Chemistry,
Chinese Academy of Sciences, Urumuqi, China
2Xinjiang Branch of Chinese Academy of Sciences, Urumuqi, China
Abstract—Traditional Uyghur search engine lacks semantic information, aiming to solve this problem, a semantically enhanced Uyghur information retrieval model was proposed based on the characteristics of Uyghur language. Firstly word stemming was carried out and web pages were represented by the form of 3-triples to construct the Uyghur knowledge base, then the matching between ontologies and web pages was established by computing concept similarity and relation similarity. Semantic inverted index was built to save the association between semantic entities and web pages, and user query analysis was implemented by expanding the queries and analyzing the relations between the queries, finally by combining the benefits of both keyword-based and semantic-based methods, ranking algorithm was implemented. By comparing with the Google search engine and the Lucene based method, the experiments validate the effectiveness and the feasibility of the model preliminarily.
Index Terms—Uyghur, ontology, semantic search, semantic relation, information retrieval
2Xinjiang Branch of Chinese Academy of Sciences, Urumuqi, China
Abstract—Traditional Uyghur search engine lacks semantic information, aiming to solve this problem, a semantically enhanced Uyghur information retrieval model was proposed based on the characteristics of Uyghur language. Firstly word stemming was carried out and web pages were represented by the form of 3-triples to construct the Uyghur knowledge base, then the matching between ontologies and web pages was established by computing concept similarity and relation similarity. Semantic inverted index was built to save the association between semantic entities and web pages, and user query analysis was implemented by expanding the queries and analyzing the relations between the queries, finally by combining the benefits of both keyword-based and semantic-based methods, ranking algorithm was implemented. By comparing with the Google search engine and the Lucene based method, the experiments validate the effectiveness and the feasibility of the model preliminarily.
Index Terms—Uyghur, ontology, semantic search, semantic relation, information retrieval
Cite: Bo Ma, Yating Yang, Xi Zhou, and Junlin Zhou, "Semantically Enhanced Uyghur Information Retrieval Model," Journal of Software vol. 7, no. 6, pp. 1315-1320, 2012.
General Information
ISSN: 1796-217X (Online)
Frequency: Quarterly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, CNKI, Google Scholar, ProQuest, INSPEC(IET), ULRICH's Periodicals Directory, WorldCat, etc
E-mail: jsweditorialoffice@gmail.com
-
Mar 01, 2024 News!
Vol 19, No 1 has been published with online version [Click]
-
Jan 04, 2024 News!
JSW will adopt Article-by-Article Work Flow
-
Apr 01, 2024 News!
Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec) [Click]
-
Apr 01, 2024 News!
Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP [Click]
-
Nov 02, 2023 News!
Vol 18, No 4 has been published with online version [Click]