Volume 5 Number 2 (Feb. 2010)
Home > Archive > 2010 > Volume 5 Number 2 (Feb. 2010) >
JSW 2010 Vol.5(2): 195-205 ISSN: 1796-217X
doi: 10.4304/jsw.5.2.195-205

High-speed Detection of Ontological Knowledge and Bi-directional Lexico-Syntactic Patterns from the Web

Hiroaki Ohshima and Katsumi Tanaka

Department of Social Informatics, Graduate School of Informatics, Kyoto University, Japan

Abstract—We propose a high-speed method of detecting ontological knowledge from the Web. Ontological knowledge in this paper means a term related to a given term. For example, hypernyms and hyponyms are basic related terms that are treated in dictionaries. Synonyms and coordinate terms are also well-defined related terms. Topic terms and description terms represent topics of the given term and they are vaguely defined. There are other related terms such as abbreviations and nicknames. The proposed method can be used for detecting many kinds of related terms. It extracts related terms from text resources only from Web search results, which consist of the titles, snippets, and URLs of Web pages. We use two different kinds of lexico-syntactic patterns to extract related terms from the search results, and these are called bi-directional lexico-syntactic patterns. The proposed method can be applied to both languages where words are separated by a space such as English and Korean and ones where words are not separated by a space such as Japanese and Chinese. The proposed method does not need any advanced natural language processing such as morphological analysis or syntactic parsing. It works relatively fast and has excellent precision. We also propose a method of automatically discovering superior bi-directional lexico-syntactic patterns using Web search engines because it is sometimes difficult to find appropriate patterns to detect related terms in a certain relationship.

Index Terms—Knowledge search, Knowledge acquisition.

[PDF]

Cite: Hiroaki Ohshima and Katsumi Tanaka, "High-speed Detection of Ontological Knowledge and Bi-directional Lexico-Syntactic Patterns from the Web," Journal of Software vol. 5, no. 2, pp. 195-205, 2010.

General Information

ISSN: 1796-217X (Online)
Frequency:  Quarterly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, CNKIGoogle Scholar, ProQuest, INSPEC(IET), ULRICH's Periodicals Directory, WorldCat, etc
E-mail: jsweditorialoffice@gmail.com
  • Mar 01, 2024 News!

    Vol 19, No 1 has been published with online version    [Click]

  • Jan 04, 2024 News!

    JSW will adopt Article-by-Article Work Flow

  • Apr 01, 2024 News!

    Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec)     [Click]

  • Apr 01, 2024 News!

    Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP   [Click]

  • Nov 02, 2023 News!

    Vol 18, No 4 has been published with online version   [Click]