Research: The Dark Side of Sentiment Analysis: An Exploratory Review Using Lexicons, Dictionaries, and a Statistical Monkey and Chimp

January 18, 2022

by Marcia Hannigan

Sentiment analysis (SA) uses a combination of natural language processing (NLP) methods to analyze a text to estimate the implied sentiment. Sentiment may be classified into categories such as positive, negative or neutral, or be measured by a range of numerical scores.  It is used frequently in business to determine consumer attitudes towards products where data such as customer reviews may be too voluminous for traditional analysis, but the use of SA and NLP helps to identify meaningful trends.  

SA can be used to analyze a wide range of texts, including short snippets of text such as Twitter feeds to generate meaningful insights. A new study by Jim Samuel (Rutgers University), Gavin C. Rozzi (Rutgers University),  Ratnakar Palle (Apple, Inc.) in SSRN (Jan. 2022) reviews known issues with SA as documented by prior research and then compares the application of multiple of-the-shelf lexicon and dictionary methods to stock market and vaccine tweets. The intention of this research is to identify and discuss critical aspects of the “dark side” of SA and develop a conceptual discussion of the characteristics of the dark side.

The study demonstrates flaws with a plug-and-play approach to SA and concludes with notes on conceptual solutions for the dark side of SA. It points to future strategies that could be used to improve the accuracy of SA. This research can help align researcher and practitioner expectations to understand the limits and boundaries of NLP-based solutions for sentiment analysis and estimation.

The study concludes that lexicons and dictionaries help in implementing sentiment analysis. While an in-depth analysis of SA is necessary before drawing conclusions, it is important to know the limits of SA methods and tools. SA modeling may need to be customized for some situations while acknowledging the absence of satisfactory SA solutions for other situations. SA tools are very useful and must continue to be used for research and practice – however, as demonstrated and described in this study, it is vital to understand the conflicts and ways to acknowledge and address them.

It is expected that this study will lead to deeper attention to applied SA and spur new strategies for the improvement of sentiment analysis research and practice.

Read the full study.

Recent Posts

Mi Shih Recognized with GPEIG Best Journal Article Award

Mi Shih, Ph.D., Associate Professor and director of the Urban Planning and Policy Development Program, was recognized with the Global Planning Educators’ Interest Group’s (GPEIG) 2025 award for the best journal article. The award honors outstanding, peer-reviewed...

Building Capacity to Support New Jersey Autism Professionals

Building Capacity to Support New Jersey Autism Professionals: A Workforce Study and Multi-state Comparative Landscape of Policies and Practices Daniel Rosario, Josephine O’Grady, Lily McFarland, Peter Walter, Ryne Kremer, Sean Nguyen, and Wun-cian Lin for Autism New...

Dr. Rushing Talks About AI for Sickle Cell and Beyond

Dr. Melinda Rushing recently appeared on the podcast Zora Talks. In this podcast, Dr. Rushing breaks down what sickle cell really is, why it disproportionately affects people of color, and how her team is developing a new approach called Clinically Guided AI to...

NJSPL: Increasing Enrollment of Paid Family Leave

The Increasing Enrollment of Paid Family Leave for Parents in the U.S. Over the past 10 years, many U.S. states have implemented mandatory paid family leave policies to help address the lack of such policy on the national level. In this post, we examine how paid...