Fine-Tuning DistilBERT for Sentiment Analysis

This repository contains the code and workflow for fine-tuning the DistilBERT (distilbert-base-uncased) model from Hugging Face on a sentiment analysis task. The dataset used for training is sourced from Kaggle.

🚀 Features

Fine-tuning distilbert-base-uncased for sentiment classification. Data preprocessing and tokenization using Hugging Face Transformers. Model training and evaluation. Inference script to predict sentiment on new text samples.

📂 Dataset

The 3 datasets used for fine-tuning is available on Kaggle. You can download it using below links:

IMDB dataset (Sentiment analysis) in CSV format link
Sentiment Analysis Dataset link
Stock News Sentiment Analysis(Massive Dataset) link
final dataset in Huggingface link

📊 Data Preprocessing & Visualization

The dataset is cleaned, preprocessed, and visualized using Pandas, Matplotlib, and Seaborn. Open and run the notebook:

📜 Notebook: notebooks/data_preprocessing.ipynb

📦 Installation

Clone the repository and install the required dependencies:

``` python 
git clone https://github.com/KaushiML3/Fine-tuning-a-LLM-for-sentiment-analysis.git
cd your-repo-name
pip install -r requirements.txt
```

🛠 Training the Model

The DistilBERT model is fine-tuned using Hugging Face's Transformers library. Training includes learning rate scheduling, and evaluation metrics. Open and run the notebook:

📜 Notebook: notebook/Fine tune LLM with LoRA for sentiment analysis.ipynb

2.Alternatively, run the training script:

``` python 
python train.py
```

🔍 Inference

To test the model on new text inputs, run:

``` python 
    python app.py 
```

sentiment analysis DistilBERT model demo

📄 Acknowledgments

Hugging Face for the DistilBERT model.
Kaggle for the dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
img		img
notebook		notebook
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
test.txt		test.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-Tuning DistilBERT for Sentiment Analysis

🚀 Features

📂 Dataset

📊 Data Preprocessing & Visualization

📦 Installation

🛠 Training the Model

🔍 Inference

📄 Acknowledgments

About

Releases

Packages

Languages

License

KaushiML3/Fine-tuning-a-LLM-for-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning DistilBERT for Sentiment Analysis

🚀 Features

📂 Dataset

📊 Data Preprocessing & Visualization

📦 Installation

🛠 Training the Model

🔍 Inference

📄 Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages