This project focuses on the understanding of the impact of the covid-19 pandemic through social media discussion on Twitter and explore a dataset of over 13 million tweets with the keywords related to covid-19 and ‘vaccine’ or ‘vax’, spanning from March 2020 to February 2022. Due to the size of the data, the analysis was done on the Unity cluster. Various analysis, including topic modelling and emotion analysis were conducted to understand how the topic of the vaccine was discussed in Twitter, how the discussion of the topics changed over time and what is people’s emotion regarding this topic and how it differs by time and location.
The project explores the possibility/challenges of running state of the art natural language processing algorithm on a big data set using HPC.
This project contributes to our knowledge in the field of psychology and health care. The result of this project will provide insights on people’s attitude and emotion toward covid-19 vaccination, how such emotion differs by time and location. This finding helps understand the psychological impact of the pandemic and may facilitate the adoption of covid-19 vaccination.
None
{Empty}
None
None
None
As mentioned previously, the project is timely and will deepen our understanding of the impact of covid-19 pandemic by identifying dominant topics discussed and people’s emotions associated with this topic.
The student (Brenna Rojek) working on this project was able to learn start-of-art natural language processing algorithms and learn to use GPU cluster. Due to the large data size, it takes a very long time (more than one week) to process all data. A better approach needs to be developed to scale the data better in the future.
The four emotions (joy, optimism, sadness, and anger) were extracted from each tweet using Huggingface Carddiff NLP emotion model. The results show the dominant emotion regarding covid1-19 are anger and sadness. In addition, people’s emotion toward covid-19 vaccination change over time. There is a substantial increase in anger since August 2021 toward the discussion of covid-19 vaccination. In addition, some states (Arizona, Wyoming, and Florida) also show a higher level of anger compared to other states.
https://public.tableau.com/app/profile/brenna.rojek/viz/shared/KYCRFGDWT