Volume 9 Number 1 (Jan. 2014)
Home > Archive > 2014 > Volume 9 Number 1 (Jan. 2014) >
JSW 2014 Vol.9(1): 11-19 ISSN: 1796-217X
doi: 10.4304/jsw.9.1.11-19

CWAAP: An Authorship Attribution Forensic Platform for Chinese Web Information

Jianbin Ma1, Ying Li2, Guifa Teng1

1College of Information Science and Technology, Agricultural University of Hebei, Baoding, China
2College of Economic and Trade, Agricultural University of Hebei, Baoding, China


Abstract—Illegal web information is common on the Internet. To prevent phenomena of illegal web information from happening, providing effective evidence for court to punish the criminals by means of law is one effective method. In this paper, an authorship attribution platform for Chinese web information, CWAAP, is described. Based on the language characteristics of Chinese web information, lexical features and structural features which can express the author’s writing habit are extracted. Support vector machines (SVM) are used for learning author’s writing features. To test the effectiveness of CWAAP, literature, Blog and BBS datasets are used in the experiments on the platform. Five experiments are performed. Experimental results show that lexical features and structural features are effective. The number of words in training samples should exceed 200 at least. By Information Gain feature selection methods, 800 lexical features can express the authors’ writing style. There is a small difference between the authors’ topics. All the parts of speech reserved are perfect. These results confirm that the platform is effective and feasible for cybercrime forensic.

Index Terms—CWAAP, Authorship attribution, Forensic, Support Vector Machine, Chinese, Web information

[PDF]

Cite: Jianbin Ma, Ying Li, Guifa Teng, "CWAAP: An Authorship Attribution Forensic Platform for Chinese Web Information," Journal of Software vol. 9, no. 1, pp. 11-19, 2014.

General Information

ISSN: 1796-217X (Online)
Frequency:  Quarterly
Editor-in-Chief: Prof. Antanas Verikas
Executive Editor: Ms. Yoyo Y. Zhou
Abstracting/ Indexing: DBLP, EBSCO, CNKIGoogle Scholar, ProQuest, INSPEC(IET), ULRICH's Periodicals Directory, WorldCat, etc
E-mail: jsweditorialoffice@gmail.com
  • Mar 01, 2024 News!

    Vol 19, No 1 has been published with online version    [Click]

  • Jan 04, 2024 News!

    JSW will adopt Article-by-Article Work Flow

  • Apr 01, 2024 News!

    Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec)     [Click]

  • Apr 01, 2024 News!

    Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP   [Click]

  • Nov 02, 2023 News!

    Vol 18, No 4 has been published with online version   [Click]