Volume 11 Number 11 (Nov. 2016)
Home > Archive > 2016 > Volume 11 Number 11 (Nov. 2016) >
JSW 2016 Vol.11(11): 1089-1101 ISSN: 1796-217X
doi: 10.17706/jsw.11.11.1089-1101

Discipline Hotspots Mining Based on Hierarchical Dirichlet Topic Clustering and Co-word Network

Ying Cai, Fang Huang*, Mengya Peng
School of Information Science and Engineering, Central South University, Changsha, China

Abstract—Discovering inherent correlations and hot research topics among various disciplines from massive scientific documents is very important to understand the scientific research tendency. The LDA (Latent Dirichlet Allocation) topic model can find topics from big data sets, but the number of topics must to be told before topic clustering. There is a lot of randomness to determine the number of topics for the unknown structure of data sets. Therefore, this paper introduces the Hierarchical Dirichlet Process (HDP) to achieve topic clustering with discipline division. Those clustering topics are composed by a discrete set of words, and these words do not have semantic relation. For this problem, this paper proposes a method to find out relationships between topic words so as to extract discipline hotspots. This method contains classifying topics with the co-occurrence of subject words, constructing co-word network and analyzing discipline hotspots with weak co-occurrence theory. The experiment results indicate that the Hierarchical Dirichlet Process can mine topic word-sets, and effectiveness better than the LDA topic model. The co-word network based on the weak tie theory can effectively find the discipline hotspots, which explicitly reflects the research hotspots and inherent connections of disciplines.

Index Terms—Co-word network, discipline research hotspots, hierarchical dirichlet Process (HDP), weak co-occurrence theory.


Cite: Ying Cai, Fang Huang*, Mengya Peng, "Discipline Hotspots Mining Based on Hierarchical Dirichlet Topic Clustering and Co-word Network," Journal of Software vol. 11, no. 11, pp. 1089-1101, 2016.

General Information

ISSN: 1796-217X
Frequency: Monthly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, DOAJ, ProQuest, INSPEC, ULRICH's Periodicals Directory, WorldCat, CNKI,etc
E-mail: jsw@iap.org
  • Aug 02, 2017 News!

    Papers published in JSW Vol. 12, No. 1- Vol. 12, No. 8 have been indexed by DBLP.    [Click]

  • Jan 05, 2017 News!

    [CFP] 2017 the annual meeting of JSW Editorial Board, ICSTE 2017, will be held in Hong Kong, October 27-29, 2017.   [Click]

  • Sep 27, 2017 News!

    Vol.12, No.5 has been indexed by EI (Inspec).   [Click]

  • Oct 16, 2017 News!

    Vol 12, No. 10 has been published with online version 7 original aritcles from 5 countries are published in this issue.     [Click]

  • Oct 16, 2017 News!

    The papers published in Vol.12, No. 10 have all received dois from Crossref.