Volume 11 Number 11 (Nov. 2016)
Home > Archive > 2016 > Volume 11 Number 11 (Nov. 2016) >
JSW 2016 Vol.11(11): 1089-1101 ISSN: 1796-217X
doi: 10.17706/jsw.11.11.1089-1101

Discipline Hotspots Mining Based on Hierarchical Dirichlet Topic Clustering and Co-word Network

Ying Cai, Fang Huang*, Mengya Peng
School of Information Science and Engineering, Central South University, Changsha, China

Abstract—Discovering inherent correlations and hot research topics among various disciplines from massive scientific documents is very important to understand the scientific research tendency. The LDA (Latent Dirichlet Allocation) topic model can find topics from big data sets, but the number of topics must to be told before topic clustering. There is a lot of randomness to determine the number of topics for the unknown structure of data sets. Therefore, this paper introduces the Hierarchical Dirichlet Process (HDP) to achieve topic clustering with discipline division. Those clustering topics are composed by a discrete set of words, and these words do not have semantic relation. For this problem, this paper proposes a method to find out relationships between topic words so as to extract discipline hotspots. This method contains classifying topics with the co-occurrence of subject words, constructing co-word network and analyzing discipline hotspots with weak co-occurrence theory. The experiment results indicate that the Hierarchical Dirichlet Process can mine topic word-sets, and effectiveness better than the LDA topic model. The co-word network based on the weak tie theory can effectively find the discipline hotspots, which explicitly reflects the research hotspots and inherent connections of disciplines.

Index Terms—Co-word network, discipline research hotspots, hierarchical dirichlet Process (HDP), weak co-occurrence theory.


Cite: Ying Cai, Fang Huang*, Mengya Peng, "Discipline Hotspots Mining Based on Hierarchical Dirichlet Topic Clustering and Co-word Network," Journal of Software vol. 11, no. 11, pp. 1089-1101, 2016.

General Information

ISSN: 1796-217X
Frequency: Monthly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, ProQuest, INSPEC, ULRICH's Periodicals Directory, WorldCat, CNKI,etc
E-mail: jsw@iap.org
  • Dec 22, 2017 News!

    Papers published in JSW Vol. 12, No. 1- Vol. 12, No. 11 have been indexed by DBLP.    [Click]

  • Dec 22, 2017 News!

    [CFP] 2018 the annual meeting of JSW Editorial Board, ICCSM 2018, will be held in Nice, France, July 17-19.   [Click]

  • Dec 22, 2017 News!

    Vol.12, No.6 has been indexed by EI (Inspec).    [Click]

  • Dec 29, 2017 News!

    Vol 12, No. 12 has been published with online version 6 original aritcles from 4 countries are published in this issue.      [Click]

  • Dec 22, 2017 News!

    Vol.12, No.7 has been indexed by EI (Inspec).   [Click]