Volume 15 Number 3 (May 2020)
Home > Archive > 2020 > Volume 15 Number 3 (May 2020) >
JCP 2020 Vol.15(3): 105-113 ISSN: 1796-203X
doi: 10.17706/jcp.15.3.105-113

A Novel Document Weighted Approach for Text Classification

S. Sai Satyanarayana Reddy1, N. Hanuman Reddy1, T. Raghunadha Reddy2
1Department of CSE, Vardhaman College of Engineering, Hyderabad, Telangana, India.
2Department of IT, Vardhaman College of Engineering, Hyderabad, Telangana, India.


Abstract—The textual data in the internet is increasing exponentially through blogs, twitter and various social media sites. The users are not specifying the type of text that they are uploading into the internet. In this regard most of the researchers are looking for automated tools for classifying the data or assigning class label to the unknown documents. Text classification is one such area used for classifying the texts. Several solutions were provided for text classification by the researchers. The text classification approaches generally contains collection of training data, preprocessing of the text, features extraction, feature reduction, document representation and finally applying classification algorithms to build the model for class label prediction of a new textual document. In the phases of text classification, the document representation is one important step to increase the efficiency of the accuracy of text classification. In this work, a new document representation approach is proposed. The experimentation conducted on 20-Newsgroup and Reuters-21578 datasets and different types of classification algorithms. Our approach attained best accuracy results for text classification and observed that the results are more promising than most of the popular approaches for text classification.

Index Terms—Accuracy, bag of words model, document representation, document weight measure, term weight measure, text classification.

[PDF]

Cite: S. Sai Satyanarayana Reddy, N. Hanuman Reddy, T. Raghunadha Reddy, "A Novel Document Weighted Approach for Text Classification," Journal of Computers vol. 15, no. 3, pp. 105-113, 2020.

Copyright © 2020 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

General Information

ISSN: 1796-203X
Abbreviated Title: J.Comput.
Frequency: Bimonthly
Editor-in-Chief: Prof. Liansheng Tan
Executive Editor: Ms. Nina Lee
Abstracting/ Indexing: DBLP, EBSCO,  ProQuest, INSPEC, ULRICH's Periodicals Directory, WorldCat,etc
E-mail: jcp@iap.org
  • Nov 14, 2019 News!

    Vol 14, No 11 has been published with online version   [Click]

  • Mar 20, 2020 News!

    Vol 15, No 2 has been published with online version   [Click]

  • Dec 16, 2019 News!

    Vol 14, No 12 has been published with online version   [Click]

  • Sep 16, 2019 News!

    Vol 14, No 9 has been published with online version   [Click]

  • Aug 16, 2019 News!

    Vol 14, No 8 has been published with online version   [Click]

  • Read more>>