NLP Sentiment Analysis Project

Project Overview

This project builds a text classification pipeline for emotion detection using NLP techniques. The workflow includes data loading, preprocessing, feature extraction, model training, and evaluation.

What Was Done

Loaded the dataset from train.txt.
Cleaned and preprocessed text data.
Converted emotion labels to numeric values.
Vectorized text using Bag-of-Words and TF-IDF.
Trained and evaluated multiple classification models.

Data Files

train.txt: Training dataset containing text and emotion labels.
test.txt: Optional test data file for further evaluation.
val.txt: Optional validation data file for model tuning.

Preprocessing Steps

Lowercased all text.
Removed URLs.
Removed digits.
Removed emojis.
Removed punctuation.
Removed stop words.

Models Evaluated

Multinomial Naive Bayes
Logistic Regression
Support Vector Machine (SVM)

Results

The models were evaluated using:

Accuracy
Precision
Recall

Summary Table

Step	Description
Data Loading	Read `train.txt` with text and emotion labels.
Label Encoding	Mapped emotion labels to integer values.
Text Cleaning	Lowercase, remove URLs, digits, emojis, punctuation, and stop words.
Vectorization	Converted text into numeric features using Bag-of-Words and TF-IDF.
Model Training	Trained Naive Bayes, Logistic Regression, and SVM models.
Evaluation	Compared models using accuracy, precision, and recall.

Notes

This project focuses on text preprocessing and classification for emotion detection. The notebook includes the full pipeline for preparing the data and evaluating models.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
LabelEncoder.pkl		LabelEncoder.pkl
NLP.ipynb		NLP.ipynb
README.md		README.md
model.pkl		model.pkl
test.txt		test.txt
train.txt		train.txt
val.txt		val.txt
vectorizer.pkl		vectorizer.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Sentiment Analysis Project

Project Overview

What Was Done

Data Files

Preprocessing Steps

Models Evaluated

Results

Summary Table

Notes

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NLP Sentiment Analysis Project

Project Overview

What Was Done

Data Files

Preprocessing Steps

Models Evaluated

Results

Summary Table

Notes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages