Sentiment analysis laser #274

NIXBLACK11 · 2023-11-28T17:00:37Z

No description provided.

…R-fork into Sentiment-analysis-laser

avidale · 2023-11-29T11:17:26Z

tasks/SentimentAnalysis/README.md

+
+To run the notebook in Google Colab, simply click the "Open in Colab" button below:
+
+[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/12gQUG7rPJvOVeWQkpMFzMiixqwDIdv4W?usp=sharing)


It seems that you have now two independent notebooks: one in Google Drive (tied to Colab), and another here in Github.

I suggest that instead, we have only one copy of the notebook, the one in Github, and modify the Colab url so that it always loads the version from Github. The url will look like https://colab.research.google.com/github/NIXBLACK11/LASER-fork/blob/Sentiment-analysis-laser/tasks/SentimentAnalysis/SentimentAnalysis.ipynb, only you'll need to update the path in a way that refers the final destination (the main branch of the LASER repository).

(I found this trick here)

avidale · 2023-11-29T11:19:10Z

tasks/SentimentAnalysis/SentimentAnalysis.ipynb

+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "with open('/content/drive/MyDrive/dataset/train.csv', 'rb') as f:\n",


You seem to be using Google Drive here, but there is not code above that mounts it. This is confusing.

Yes, I have to add that.

avidale · 2023-11-29T11:20:12Z

tasks/SentimentAnalysis/SentimentAnalysis.ipynb

+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "with open('/content/drive/MyDrive/dataset/train.csv', 'rb') as f:\n",


Can we maybe add above some text about where and how to download this dataset?
Currently, those who open the notebook directly have no idea where to get it.

fissoreg · 2023-12-01T11:38:20Z

tasks/SentimentAnalysis/SentimentAnalysis.ipynb

+      "source": [
+        "## Step 3: Download the Dataset\n",
+        "\n",
+        "Next, let's acquire a sentiment analysis dataset to train our model. We'll download a dataset from Kaggle and unzip it into a directory named ./dataset. Execute the following commands:\n",


What dataset are you using? Can you put a short description and a link to the Kaggle page presenting the dataset?
Also, I see that some credentials are included in the URL; did you use your own credentials for this?

Yes I used my own credentials for this.
I think I can add just steps on how to download the dataset from kaggle.

Yes, maybe that would be better. It's a bit annoying to have to download the dataset, but I suppose your credentials might expire at a certain point and break the notebook. Isn't there another source to download the dataset with no need for credentials?
@avidale @heffernankevin might have ideas about this.

Let's maybe use one which we can download from HuggingFace. For this we can use the datasets library:

python -m pip install datasets

An example could be this dataset: https://huggingface.co/datasets/carblacac/twitter-sentiment-analysis

Also I see you're reporting "accuracy" at the end for evaluating the trained model. However earlier you show that the labels in the tweet dataset are not balanced. If you move to another dataset and the labels are balanced then you can stick with accuracy. Otherwise ideally we would should precision and recall per label (your confusion matrix sheds light on this).

fissoreg · 2023-12-01T11:39:36Z

tasks/SentimentAnalysis/SentimentAnalysis.ipynb

+        }
+      ],
+      "source": [
+        "# Sentiment Prediction with RNN Neural Network and Confusion Matrix\n",


Maybe it would be better to normalize the confusion matrix?

…R-fork into Sentiment-analysis-laser merge balanced

heffernankevin · 2023-12-07T13:35:41Z

I believe you're training the sentiment model on "eng_Latn". When you then try sentiments on other languages, perhaps just mention in the title that this is technically "zero-shot sentiment prediction" for languages other than English.

eg. "Step 14: Zero-shot Sentiment Prediction for Multilingual Texts"

It's one of the benefits of LASER that such a sentiment model trained only on English should hopefully do well in other languages (even though not explicitly trained on them). You can then also remove your first example in "english"

NIXBLACK11 and others added 5 commits November 24, 2023 19:22

sentiment analysis using laser encoders

1ad6d26

Updated readme

b40847c

Update README.md

f73a835

Added button to run on collab

603a56c

Merge branch 'Sentiment-analysis-laser' of github.com:NIXBLACK11/LASE…

a1dbe66

…R-fork into Sentiment-analysis-laser

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Nov 28, 2023

NIXBLACK11 marked this pull request as draft November 28, 2023 17:00

avidale reviewed Nov 29, 2023

View reviewed changes

Added button to run on collab

99b242a

NIXBLACK11 marked this pull request as ready for review November 30, 2023 15:11

added context to the notebook

3b22a18

fissoreg reviewed Dec 1, 2023

View reviewed changes

NIXBLACK11 and others added 11 commits December 4, 2023 22:52

Added the steps to download

599b4cd

Changed the dataset which helped the accuracy to 83%

069471f

Updated the readme

fcf72cf

update

3e6a144

Update SentimentAnalysis.ipynb

5de9edc

Update README.md

d12059d

balanced the training and test

48cc180

Add files via upload

fb869cb

balanced

bdb4102

Merge branch 'Sentiment-analysis-laser' of github.com:NIXBLACK11/LASE…

0582aa7

…R-fork into Sentiment-analysis-laser merge balanced

updated notebook

19c8966

NIXBLACK11 and others added 5 commits December 7, 2023 19:31

Changed title of step 14 SentimentAnalysis.ipynb

1405e67

Update SentimentAnalysis.ipynb

b4842a5

Update SentimentAnalysis.ipynb

a4e0c60

Update SentimentAnalysis.ipynb

9d616e1

updated

da22645

NIXBLACK11 added 2 commits December 7, 2023 19:59

Changed the title of step 14

6df5966

updated sentimentAnalysis.ipynb

93a6494

heffernankevin approved these changes Dec 7, 2023

View reviewed changes

heffernankevin merged commit 83c07d3 into facebookresearch:MLH-dev Dec 7, 2023
2 checks passed

avidale mentioned this pull request Jan 8, 2024

Create a tutorial showing how LASER can be applied for downstream practical tasks (e.g. multilingual sentiment analysis) #271

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sentiment analysis laser #274

Sentiment analysis laser #274

NIXBLACK11 commented Nov 28, 2023

avidale Nov 29, 2023 •

edited

Loading

avidale Nov 29, 2023

NIXBLACK11 Nov 29, 2023

avidale Nov 29, 2023

fissoreg Dec 1, 2023

NIXBLACK11 Dec 1, 2023

fissoreg Dec 1, 2023

heffernankevin Dec 5, 2023 •

edited

Loading

heffernankevin Dec 5, 2023 •

edited

Loading

fissoreg Dec 1, 2023

heffernankevin commented Dec 7, 2023 •

edited

Loading


		To run the notebook in Google Colab, simply click the "Open in Colab" button below:

		[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/12gQUG7rPJvOVeWQkpMFzMiixqwDIdv4W?usp=sharing)

Sentiment analysis laser #274

Sentiment analysis laser #274

Conversation

NIXBLACK11 commented Nov 28, 2023

avidale Nov 29, 2023 • edited Loading

Choose a reason for hiding this comment

avidale Nov 29, 2023

Choose a reason for hiding this comment

NIXBLACK11 Nov 29, 2023

Choose a reason for hiding this comment

avidale Nov 29, 2023

Choose a reason for hiding this comment

fissoreg Dec 1, 2023

Choose a reason for hiding this comment

NIXBLACK11 Dec 1, 2023

Choose a reason for hiding this comment

fissoreg Dec 1, 2023

Choose a reason for hiding this comment

heffernankevin Dec 5, 2023 • edited Loading

Choose a reason for hiding this comment

heffernankevin Dec 5, 2023 • edited Loading

Choose a reason for hiding this comment

fissoreg Dec 1, 2023

Choose a reason for hiding this comment

heffernankevin commented Dec 7, 2023 • edited Loading

avidale Nov 29, 2023 •

edited

Loading

heffernankevin Dec 5, 2023 •

edited

Loading

heffernankevin Dec 5, 2023 •

edited

Loading

heffernankevin commented Dec 7, 2023 •

edited

Loading