MSc. Thesis

This project proposes a way to analyze the sentiment of a new technology, more specific that of generative language models (GLMs) based on social media data. As data, the project uses tweets about ChatGPT (a proxy product of GLMs) and identifies the industries to which users discussing this technology pertain, as well as their sentiment.

The notebooks in this repository cover the following steps

Data cleaning and preprocessing (e.g. removal of redundant data, lemmatization etc.)
Topic modeling based on unsupervised machine learning (LDA)
Sentiment polarity with the help of VADER.

(Note: Once the University will publish the MSc. Thesis, a link to it will be added)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
1_Notebook_DataCollection & Anonymization.ipynb		1_Notebook_DataCollection & Anonymization.ipynb
2_Notebook_TopicModellingPreprocessing.ipynb		2_Notebook_TopicModellingPreprocessing.ipynb
3_Notebook_UserDescription_TopicModelling.ipynb		3_Notebook_UserDescription_TopicModelling.ipynb
4_Notebook_Tweets_SA_and_TopicModelling.ipynb		4_Notebook_Tweets_SA_and_TopicModelling.ipynb
5_Notebook_Tweets_Post_Processing.ipynb		5_Notebook_Tweets_Post_Processing.ipynb
6_Users_Post_Processing.ipynb		6_Users_Post_Processing.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MSc. Thesis

About

Releases

Packages

Languages

oanaale95/MScThesis

Folders and files

Latest commit

History

Repository files navigation

MSc. Thesis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages