NON-PARAMETRIC CLASSIFICATION OF OPINION MINING BASED ON NAIVE BAYES WITH KERNEL DENSITY ESTIMATION
Abstract
With the rapid growth of social media in recent years, opinion mining, also called Sentiment Analysis, has gained much attention. Opinion mining refers to the use of Natural Language Processing (NLP), text analysis and biometrics to analyse, extract and identify the customer views of a product. The polarity classification is a main task of opinion mining for getting useful decision from online customer reviews and survey responses. Existing approaches in opinion mining performs the classification based on the parametric form. In this paper, we introduce the Naive Bayes classifier with Kernel Density Estimation (KDE) as a non parametric way of opinion mining that computes the probability density function based on the kernel estimator. The features are extracted from the opinions in the document level and are used to train the classifier in a supervised manner. KDE also performs data smoothing problem that inferences about the population that are based on finite data sample. We underwent some pre-processing techniques like stemming and stopword removal. Experimental results show that the proposed algorithm performed well, in terms of accuracy, precision, recall and F-measure.
Keywords
References
Abhishek Bhattacharya & David Dunsonb, “Nonparametric Bayes classification and hypothesis testing on manifolds,†in Journal of Multivariate Analysis: Elsevier, Vol.111, pages 1–19, 2012.
Al Amrani, Mohamed Lazaar, Kamal Eddine El Kadiri, “Random Forest and Support Vector Machine based hybrid approach to sentiment analysis,†in Proceedings of the 1st International Conference on Intelligent Computing in Data Sciences: Elsevier, Vol. 127, pages 511–520, 2018.
Aritz Perez, Pedro Larranaga & Inaki Inza, “Bayesian classifiers based on kernel density estimation: Flexible classifiers,†in International Journal of Approximate Reasoning: Elsevier, Vol.50, pages 341–362, 2009.
Arun Meena & T.V. Prabhakar, “Sentence Level Sentiment Analysis in the Presence of Conjuncts Using Linguistic Analysis,†in European Conference on Information Retrieval: Springer, Vol. 4425, pages 573-580, 2007.
Bo Pang and Lillian Lee, “Opinion mining and sentiment analysis,†in Foundations and Trends in Information Retrieval: Now publishers, Vol. 2, pages 1-135, 2008.
B.W. Silverman, "Density Estimation for Statistics and Data Analysis,†in Monographs on Statistics and Applied Probability: Chapman and Hall Publishers, 1986.
Dharmendra Sharma and Suresh Jain, “Evaluation of Stemming and Stop Word Techniques on Text Classification Problem,†in International Journal of Scientific Research in Computer Science and Engineering, Vol. 3, Pages 1-4, 2015.
E. Fersini , E. Messina & F.A. Pozzi, “Sentiment analysis: Bayesian Ensemble Learning,†in Decision Support Systems: Elsevier, Vol.68, pages 26–38, 2014.
Janardhana D R and Manjunath Mulimani, “Sentiment Analysis and Opinion Mining using Machine Learning,†in International Journal of Innovative Research in Computer and Communication Engineering, Vol. 3, pages 9321-9329, 2015.
Jayashri Khairnar and Mayura Kinikar, “Machine Learning Algorithms for Opinion Mining and Sentiment Classification,†in International Journal of Scientific and Research Publications, Vol. 3, 2013.
Jyoti S. Deshmukh, Amiya Kumar Tripathy, “Entropy based classifier for cross-domain opinion mining,†in Applied Computing and Informatics, Vol.14, pages 55–64, 2018.
Kumar Ravi and Vadlamani Ravi, “A survey on opinion mining and sentiment analysis: Tasks, approaches and applications,†in Knowledge-Based Systems: Elsevier, Vol.89, pages 14-46, 2015.
Liu Yang, Han Liguo & Cai Xuesen, “A kernel density estimation based text classification algorithm,†in Advanced Science and Technology Letters, SERSC: Science & Engineering Research Support society, Vol. 78, pages 49-54, 2014.
Liu, B, Synthesis lectures on human language technologies, “Sentiment Analysis and Opinion Mining,†in Morgan & Claypool Publishers. 2012.
Maite Taboada, Julian Brooke, Milan Tofiloski, Kimberly Voll and Manfred Stede, “Lexicon - Based Methods for Sentiment Analysis,†in Computational Linguistics, Vol. 37, pages 267-307, 2011.
Mangi Kang, Jaelim Ahn and Kichun Lee, “Opinion mining using ensemble text hidden Markov models for text classification,†in Expert Systems with Applications: Elsevier, Vol. 94, Pages 218–227. 2018.
Mans Hulden, Miikka Silfverberg and Jerid Francom, “Kernel Density Estimation for Text-Based Geolocation,†in Association for the Advancement of Artificial Intelligence, Pages, 145-150, 2015.
Michael P. Holmes, Alexander G. Gray and Charles Lee Isbell, “Fast Nonparametric Conditional Density Estimation,†in Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence, Pages, 175-182, 2012.
Min Xiao and Yuhong Guo, “Semi-Supervised Kernel Matching for Domain Adaptation,†in Proceedings of the 26th AAAI Conference on Artificial Intelligence, Pages 1183-1189, 2012.
Muhammad Bilal, Huma Israr, Muhammad Shahid and Amin Khan, “Sentiment classification of Roman-Urdu opinions using Naı¨ve Bayesian, Decision Tree and KNN classification techniques,†Journal of King Saud University – Computer and Information Sciences: Elsevier, Vol. 28, pages 330–344, 2016.
Wei Jin & Hung Hay Ho, “A Novel Lexicalized HMM-based Learning Framework for Web Opinion Mining,†in Proceedings of the 26th International Conference on Machine Learning, Pages 465-472, 2009.
Yoichi Murakami and Kenji Mizuguchi, “Applying the Naïve Bayes classifier with kernel density estimation to the prediction of protein–protein interaction sites,†in Bio informatics, Vol. 26, pages 1841–1848, 2010.
DOI: https://doi.org/10.26877/jiu.v8i1.11713
Refbacks
- There are currently no refbacks.
Copyright (c) 2022 RAJA RAJESWARI SETHURAMAN

