Volume 11 Number 11 (Nov. 2016)
Home > Archive > 2016 > Volume 11 Number 11 (Nov. 2016) >
JSW 2016 Vol.11(11): 1089-1101 ISSN: 1796-217X
doi: 10.17706/jsw.11.11.1089-1101

Discipline Hotspots Mining Based on Hierarchical Dirichlet Topic Clustering and Co-word Network

Ying Cai, Fang Huang*, Mengya Peng

School of Information Science and Engineering, Central South University, Changsha, China

Abstract—Discovering inherent correlations and hot research topics among various disciplines from massive scientific documents is very important to understand the scientific research tendency. The LDA (Latent Dirichlet Allocation) topic model can find topics from big data sets, but the number of topics must to be told before topic clustering. There is a lot of randomness to determine the number of topics for the unknown structure of data sets. Therefore, this paper introduces the Hierarchical Dirichlet Process (HDP) to achieve topic clustering with discipline division. Those clustering topics are composed by a discrete set of words, and these words do not have semantic relation. For this problem, this paper proposes a method to find out relationships between topic words so as to extract discipline hotspots. This method contains classifying topics with the co-occurrence of subject words, constructing co-word network and analyzing discipline hotspots with weak co-occurrence theory. The experiment results indicate that the Hierarchical Dirichlet Process can mine topic word-sets, and effectiveness better than the LDA topic model. The co-word network based on the weak tie theory can effectively find the discipline hotspots, which explicitly reflects the research hotspots and inherent connections of disciplines.

Index Terms—Co-word network, discipline research hotspots, hierarchical dirichlet Process (HDP), weak co-occurrence theory.


Cite: Ying Cai, Fang Huang*, Mengya Peng, "Discipline Hotspots Mining Based on Hierarchical Dirichlet Topic Clustering and Co-word Network," Journal of Software vol. 11, no. 11, pp. 1089-1101, 2016.

General Information

ISSN: 1796-217X (Online)
Frequency:  Quarterly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, CNKIGoogle Scholar, ProQuest, INSPEC(IET), ULRICH's Periodicals Directory, WorldCat, etc
E-mail: jsweditorialoffice@gmail.com
  • Mar 01, 2024 News!

    Vol 19, No 1 has been published with online version    [Click]

  • Jan 04, 2024 News!

    JSW will adopt Article-by-Article Work Flow

  • Apr 01, 2024 News!

    Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec)     [Click]

  • Apr 01, 2024 News!

    Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP   [Click]

  • Nov 02, 2023 News!

    Vol 18, No 4 has been published with online version   [Click]