International Journal of Advanced Technology and Engineering Exploration (IJATEE) ISSN (Print): 2394-5443 ISSN (Online): 2394-7454 Volume - 10 Issue - 102 May - 2023

  1. Google Scholar
Implicit aspect based sentiment analysis for restaurant review using LDA topic modeling and ensemble approach

Shini George and V. Srividhya

Abstract

Technological advancements in e-commerce and Web 2.0 have revolutionized how customers express their opinions about services and features through reviews on various websites. This trend is particularly prominent in the travel industry, where online sources offer valuable insights into the food and accommodations of destinations. However, the abundance of reviews available online presents a challenge for travelers in filtering relevant information. To tackle this issue, aspect-based sentiment analysis (ABSA) was proposed as a technique for extracting opinions based on specific features. Topic modeling and sentiment analysis are two significant techniques employed to assist in this analysis. Topic modeling involves identifying thematic relationships among documents, while sentiment analysis aims to determine the expressed opinions in the text. This study utilized one of the leading travel websites, Tripadvisor, to gather customer reviews of different restaurants. These reviews were then subjected to aspect-based sentiment analysis using latent Dirichlet allocation (LDA) and ensemble bagging support vector machine (EBSVM) classifier techniques. The objective is to identify the most relevant aspect within the restaurant domain and enhance sentiment analysis performance. To address class imbalances in the datasets, the synthetic minority over-sampling technique (SMOTE) was implemented. The performance of LDA was evaluated using the coherence score, which indicates the quality of topics generated for restaurant reviews. The effectiveness of the EBSVM classifier was measured using metrics such as accuracy, precision, recall, and F1 score. The proposed model achieved an accuracy of 96.1%, surpassing other techniques. Overall, this study demonstrates the effectiveness of aspect-based sentiment analysis in extracting relevant opinions from a large volume of reviews. It also highlights the potential of machine learning techniques in enhancing sentiment analysis performance. The suggested approach outperforms other techniques discussed in the existing literature, contributing to an overall improvement in sentiment analysis.

Keyword

LDA, Topic modeling, ABSA, EBSVM, SMOTE.

Cite this article

George S, Srividhya V.Implicit aspect based sentiment analysis for restaurant review using LDA topic modeling and ensemble approach. International Journal of Advanced Technology and Engineering Exploration. 2023;10(102):554-568. DOI:10.19101/IJATEE.2022.10100099

Refference

[1]Aïmeur E, Amri S, Brassard G. Fake news, disinformation and misinformation in social media: a review. Social Network Analysis and Mining. 2023; 13(1):1-36.

[2]Wankhade M, Rao AC, Kulkarni C. A survey on sentiment analysis methods, applications, and challenges. Artificial Intelligence Review. 2022; 55(7):5731-80.

[3]Patruni MR, Angadi A, Gorripati SK, Saraswathi P. Artificial intelligence techniques in text and sentiment analysis. In handbook of research on artificial intelligence applications in literary works and social media 2023 (pp. 171-91). IGI Global.

[4]Zhu L, Xu M, Bao Y, Xu Y, Kong X. Deep learning for aspect-based sentiment analysis: a review. Journal for Computer Science. 2022; 8: e1044.

[5]Bardutz E, Bigazzi A. Communicating perceptions of pedestrian comfort and safety: Structural topic modeling of open response survey comments. Transportation Research Interdisciplinary Perspectives. 2022; 14:100600.

[6]Mutanga MB, Abayomi A. Tweeting on COVID-19 pandemic in South Africa: LDA-based topic modelling approach. African Journal of Science, Technology, Innovation and Development. 2022; 14(1):163-72.

[7]Bagul DV, Barve S. A novel content-based recommendation approach based on LDA topic modeling for literature recommendation. In 6th international conference on inventive computation technologies 2021 (pp. 954-61). IEEE.

[8]Guo C, Lu M, Wei W. An improved LDA topic modeling method based on partition for medium and long texts. Annals of Data Science. 2021; 8:331-44.

[9]Eirinaki M, Pisal S, Singh J. Feature-based opinion mining and ranking. Journal of Computer and System Sciences. 2012; 78(4):1175-84.

[10]Savanur SR, Sumathi R. SentiNeg: algorithm to process negations at sentence level in sentiment analysis. International Journal of Software Innovation. 2023; 11(1):1-27.

[11]Ozyurt B, Akcayol MA. A new topic modeling based approach for aspect extraction in aspect based sentiment analysis: SS-LDA. Expert Systems with Applications. 2021; 168.

[12]Latif S, Shafait F, Latif R. Analyzing LDA and NMF topic models for urdu tweets via automatic labeling. IEEE Access. 2021; 9:127531-47.

[13]Pratama MD, Sarno R, Abdullah R. Sentiment analysis user regarding hotel reviews by aspect based using latent dirichlet allocation, semantic similarity, and support vector machine method. International Journal of Intelligent Engineering and Systems. 2022; 15(3):514-24.

[14]Mishra RK, Urolagin S, Jothi JA, Neogi AS, Nawaz N. Deep learning-based sentiment analysis and topic modeling on tourism during Covid-19 pandemic. Frontiers in Computer Science. 2021; 3: 775368.

[15]Alsayat A. Improving sentiment analysis for social media applications using an ensemble deep learning language model. Arabian Journal for Science and Engineering. 2022; 47(2):2499-511.

[16]Abdullah R, Suhariyanto S, Sarno R. Aspect based sentiment analysis for explicit and implicit aspects in restaurant review using grammatical rules, hybrid approach, and SentiCircle. International Journal of Intelligent Systems. 2021; 14(5):294-305.

[17]AlGhamdi N, Khatoon S, Alshamari M. Multi-aspect oriented sentiment classification: Prior knowledge topic modelling and ensemble learning classifier approach. Applied Sciences. 2022; 12(8).

[18]Gangadharan V, Gupta D. Recognizing named entities in agriculture documents using LDA based topic modelling techniques. Procedia Computer Science. 2020; 171:1337-45.

[19]Banjar A, Ahmed Z, Daud A, Abbasi RA, Dawood H. Aspect-based sentiment analysis for polarity estimation of customer reviews on Twitter. Computers, Materials & Continua. 2021; 67(2):2203-25.

[20]Kishan JG. Enhanced heterogeneous ensemble technique for improving software fault prediction. International Journal on Technical and Physical Problems of Engineering. 2021; 49(13):63-71.

[21]Wang L, Han M, Li X, Zhang N, Cheng H. Review of classification methods on unbalanced data sets. IEEE Access. 2021; 9:64606-28.

[22]Desuky AS, Hussain S. An improved hybrid approach for handling class imbalance problem. Arabian Journal for Science and Engineering. 2021; 46:3853-64.

[23]Naim FA. Bangla aspect-based sentiment analysis based on corresponding term extraction. In international conference on information and communication technology for sustainable development 2021 (pp. 65-9). IEEE.

[24]Arseniev-Koehler A, Cochran SD, Mays VM, Chang KW, Foster JG. Integrating topic modeling and word embedding to characterize violent deaths. Proceedings of the National Academy of Sciences. 2022; 119(10).

[25]Sunardi PN, Harjo RA. Aspect-based sentiment analysis for hotel review using LDA, semantic similarity, and BERT. International Journal of Intelligent Engineering and Systems. 2022; 15(5):232-43.

[26]Wen H, Zhao J. Sentiment analysis model of imbalanced comment texts based on BiLSTM. 2023.

[27]Sethi SC. Reviewing flipkart product comments using methods based on sentiment analysis. International Research Journal of Modernization in Engineering Technology and Science. 2023; 5(2): 806-10.

[28]D Aniello G, Gaeta M, La Rocca I. KnowMIS-ABSA: an overview and a reference model for applications of sentiment analysis and aspect-based sentiment analysis. Artificial Intelligence Review. 2022; 55(7):5543-74.

[29]Cyril CP, Beulah JR, Subramani N, Mohan P, Harshavardhan A, Sivabalaselvamani D. An automated learning model for sentiment analysis and data classification of Twitter data using balanced CA-SVM. Concurrent Engineering. 2021; 29(4):386-95.

[30]Mujahid M, Lee E, Rustam F, Washington PB, Ullah S, Reshi AA, Ashraf I. Sentiment analysis and topic modeling on tweets about online education during COVID-19. Applied Sciences. 2021; 11(18): 8438.

[31]Tao J, Zhou L. Can online consumer reviews signal restaurant closure: a deep learning-based time-series analysis. IEEE Transactions on Engineering Management. 2020; 70 (3): 834-848.

[32]Obiedat R, Qaddoura R, Ala M AZ, Al-Qaisi L, Harfoushi O, Alrefai MA, et al. Sentiment analysis of customers’ reviews using a hybrid evolutionary SVM-based approach in an imbalanced data distribution. IEEE Access. 2022; 10:22260-73.

[33]Sattar K, Umer Q, Vasbieva DG, Chung S, Latif Z, Lee C. A multi-layer network for aspect-based cross-lingual sentiment classification. IEEE Access. 2021; 9:133961-73.

[34]Chen Y, Chang C, Gan J. A template approach for summarizing restaurant reviews. IEEE Access. 2021; 9:115548-62.

[35]Rupapara V, Rustam F, Shahzad HF, Mehmood A, Ashraf I, Choi GS. Impact of SMOTE on imbalanced text features for toxic comments classification using RVVC model. IEEE Access. 2021; 9:78621-34.

[36]Pradha S, Halgamuge MN, Vinh NT. Effective text data preprocessing technique for sentiment analysis in social media data. In 11th international conference on knowledge and systems engineering 2019 (pp. 1-8). IEEE.

[37]Mukherjee M, Khushi M. SMOTE-ENC: a novel SMOTE-based method to generate synthetic data for nominal and continuous features. Applied System Innovation. 2021; 4(1):1-12.

[38]Susan S, Kumar A. The balancing trick: Optimized sampling of imbalanced datasets-a brief survey of the recent State of the Art. Engineering Reports. 2021; 3(4): e12298.

[39]Khong WH, Soon LK, Goh HN, Haw SC. Leveraging part-of-speech tagging for sentiment analysis in short texts and regular texts. In semantic technology: 8th joint international conference, JIST, Awaji, 2018, Proceedings 2018 (pp. 182-97). Springer International Publishing.

[40]Jelodar H, Wang Y, Yuan C, Feng X, Jiang X, Li Y, Zhao L. Latent dirichlet allocation (LDA) and topic modeling: models, applications, a survey. Multimedia Tools and Applications. 2019; 78:15169-211.

[41]Grootendorst B. Neural topic modeling with a class-based TF-IDF procedure. arXiv preprint arXiv:2203.05794. 2022.

[42]Gupta RK, Agarwalla R, Naik BH, Evuri JR, Thapa A, Singh TD. Prediction of research trends using LDA based topic modeling. Global Transitions Proceedings. 2022; 3(1):298-304.

[43]Catelli R, Pelosi S, Esposito M. Lexicon-based vs. Bert-based sentiment analysis: a comparative study in Italian. Electronics. 2022; 11(3): 374.

[44]Raza GM, Butt ZS, Latif S, Wahid A. Sentiment analysis on COVID tweets: an experimental analysis on the impact of count vectorizer and TF-IDF on sentiment predictions using deep learning models. In international conference on digital futures and transformative technologies 2021 (pp. 1-6). IEEE.

[45]Luo C. A comparison analysis for credit scoring using bagging ensembles. Expert Systems. 2022; 39(2).

[46]Syed S, Spruit M. Full-text or abstract? examining topic coherence scores using latent dirichlet allocation. In international conference on data science and advanced analytics (DSAA) 2017 (pp. 165-74). IEEE.

[47]Valdivia A, Luzón MV, Herrera F. Sentiment analysis in tripadvisor. IEEE Intelligent Systems. 2017; 32(4):72-7.

[48]Kalaivani P, Dinesh D. Machine learning approach to analyze classification result for twitter sentiment. In international conference on smart electronics and communication 2020 (pp. 107-12). IEEE.

[49]Huang CY, Dai HL. Learning from class-imbalanced data: review of data driven methods and algorithm driven methods. Data Science in Finance and Economics. 2021; 1(1):21-36.

[50]Mutinda J, Mwangi W, Okeyo G. Sentiment analysis of text reviews using lexicon-enhanced bert embedding (LeBERT) model with convolutional neural network. Applied Sciences. 2023; 13(3): 1445.

[51]Kaur G, Sharma A. A deep learning-based model using hybrid feature extraction approach for consumer sentiment analysis. Journal of Big Data. 2023; 10(1):1-23.

[52]Sazzed S. A hybrid approach of opinion mining and comparative linguistic analysis of restaurant reviews. In proceedings of the international conference on recent advances in natural language processing 2021 (pp. 1281-8).

[53]AlGhamdi N, Khatoon S. Improving sentiment prediction using heterogeneous and homogeneous ensemble methods: a comparative study. Procedia Computer Science. 2021; 194:60-8.

[54]Nurifan F, Sarno R, Sungkono KR. Aspect based sentiment analysis for restaurant reviews using hybrid elmo-wikipedia and hybrid expanded opinion lexicon-senticircle. International Journal of Intelligent Engineering and Systems. 2019; 12(6):47-58.