NAPREDNE TEHNIKE ZA SENTIMENT ANALIZU: STUDIJA KLASIFIKACIONIH I GENERATIVNIH MODELA NA ONLINE KOMENTARIMA
Ključne reči:
Sentiment analiza, RNN, CNN, Word2Vec, GloVe, BERT
Apstrakt
Ovaj rad se bavi sentiment analizom komentara korisnika koristeći različite tehnike obrade prirodnog jezika i algoritme dubokog učenja. Implementirani su modeli poput rekurentnih neuronskih mreža (RNN), konvolutivnih neuronskih mreža (CNN), Word2Vec, GloVe, BERT, kao i generativni llama-7b-hf model. Skup podataka je preuzet sa Kaggle platforme i sadrži komentare korisnika na različite proizvode. Evaluacija modela je izvršena korišćenjem F1 mere, omogućavajući detaljnu analizu performansi u kontekstu nebalansiranih klasa. Najbolji rezultati su postignuti upotrebom transformera i SVM klasifikatora.
Reference
[1] https://www.kaggle.com/datasets/nicapotato/womens-ecommerce-clothing-reviews (pristupljeno u martu 2022.)
[2] Abien Fred M. Agarap, „Statistical Analysis on E-Commerce Reviews, with Sentiment Classification using Bidirectional Recurrent Neural Network“, 2018.
[3] Shuangyin Xie, „Sentiment Analysis using machine learning algorithms: online women clothin reviews“, 2019.
[4] Manish Munikar, Sushil Shakya, Aakash Shrestha, “Fine-grained Sentiment Classification using BERT”, 2019.
[5] Jun Xie, Bo Chen, Xinglong Gu, Fengmei Liang, Xinying Xu, “Self-Attention-Based BiLSTM Model for Short Text Fine-Grained Sentiment Classification”, 2018.
[6] https://www.crummy.com/software/BeautifulSoup/bs4/doc/ (pristupljeno u martu 2022.)
[7] https://keras.io/ (pristupljeno u martu 2022.)
[8] https://www.tensorflow.org/ (pristupljeno u martu 2022.)
[9] Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, Tie-Yan Liu, “MPNet: Masked and Permuted Pre-training for Language Understanding”, 2020.
[10] https://huggingface.co/meta-llama/Llama-2-7b-hf (pristupljeno u februaru 2024.)
[11] https://spacy.io/ (pristupljeno u martu 2022.)
[2] Abien Fred M. Agarap, „Statistical Analysis on E-Commerce Reviews, with Sentiment Classification using Bidirectional Recurrent Neural Network“, 2018.
[3] Shuangyin Xie, „Sentiment Analysis using machine learning algorithms: online women clothin reviews“, 2019.
[4] Manish Munikar, Sushil Shakya, Aakash Shrestha, “Fine-grained Sentiment Classification using BERT”, 2019.
[5] Jun Xie, Bo Chen, Xinglong Gu, Fengmei Liang, Xinying Xu, “Self-Attention-Based BiLSTM Model for Short Text Fine-Grained Sentiment Classification”, 2018.
[6] https://www.crummy.com/software/BeautifulSoup/bs4/doc/ (pristupljeno u martu 2022.)
[7] https://keras.io/ (pristupljeno u martu 2022.)
[8] https://www.tensorflow.org/ (pristupljeno u martu 2022.)
[9] Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, Tie-Yan Liu, “MPNet: Masked and Permuted Pre-training for Language Understanding”, 2020.
[10] https://huggingface.co/meta-llama/Llama-2-7b-hf (pristupljeno u februaru 2024.)
[11] https://spacy.io/ (pristupljeno u martu 2022.)
Objavljeno
2024-11-02
Sekcija
Elektrotehničko i računarsko inženjerstvo