Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vaderSentiment seems to underweight long sentences, not useful at detecting sarcasm, and can't interpret messages right after a hashtag. #120

Open
neldivad opened this issue Dec 8, 2020 · 2 comments

Comments

@neldivad
Copy link

neldivad commented Dec 8, 2020

I've tried vaderSentiment on tweets about the topic "Amber Heard". In a sample size of 100, all of the tweets are negative towards the topic. Here are some issues I've encountered:

  1. vader is bad at detecting sarcasm.

Sarcastic memes with a caption such as "Thing's that hit different: Amber Heard", "The Gorgeous Ad of Amber Heard" are falsely labeled as incredibly positive.

  1. vader does not understand words suffixed by hashtags.

analyzer.polarity_scores('You all speak about #ToxicMasculinity but where is #ToxicFeminity? #AmberHeard')
{'neg': 0.0, 'neu': 1.0, 'pos': 0.0, 'compound': 0.0}

  1. vader sometimes yield questionable results.

analyzer.polarity_scores('God I can\'t stand her face')
>>> {'neg': 0.0, 'neu': 0.704, 'pos': 0.296, 'compound': 0.2732}

@neldivad neldivad changed the title vaderSentiment seems to underweight long sentences, overweight short sentences, and not useful at detecting sarcasm. vaderSentiment seems to underweight long sentences, not useful at detecting sarcasm, and can't interpret messages right after a hashtag. Dec 8, 2020
@PeterH220
Copy link

Do you really expect detection of Amber Heard? wtf get real. Even 97% of people don't have a clue who the hell she is.

@internot169
Copy link

Might want to try to weight sentiment based on sentence length

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants