adding time-series data to enhance performance of naural language processing tasks

zhao, jingtian

view/open

zhaoj2022m-1a.pdf (2.578mb)

date

2023

author

zhao, jingtian

metadata

show full item record

abstract

in the past few decades, with the explosion of information, a large number of computer scientists have devoted themselves to analyzing collected data and applying these findings to many disciplines. natural language processing (nlp) has been one of the most popular areas for data analysis and pattern recognition. a significantly large amount of data is obtained in text format due to the ease of access nowadays. most modern techniques focus on exploring large sets of textual data to build forecasting models; they tend to ignore the importance of temporal information which is often the main ingredient to determine the performance of analysis, especially in the public policy view. the contribution of this paper is three-fold. first, a dataset called covid-news is collected from three news agencies, which consists of article segments related to wearing masks during the covid-19 pandemic. second, we propose a long-short term memory (lstm)-based learning model to predict the attitude of the articles from the three news agencies towards wearing a mask with both temporal and textural information. then we added the bert model to further improve and enhance the performance of the proposed model. experimental results on the covidnews dataset show the effectiveness of the proposed lstm-based algorithm.

uri

https://knowledgecommons.lakeheadu.ca/handle/2453/5194

collections

electronic theses and dissertations from 2009 [1627]