📃: Text Classification for Spam Detection #70

Avdhesh-Varshney · 2024-06-30T12:57:58Z

🔴 Title : Text Classification for Spam Detection
🔴 Aim : Create a text classification system to detect spam messages using machine learning techniques.
🔴 Brief Explanation :

Gather a dataset of text messages labeled as spam or not spam.
Preprocess the text data and extract relevant features.
Train a machine learning model (such as Naive Bayes, SVM, or logistic regression) to classify messages as spam or not spam.
Develop an interface where users can input text messages and receive predictions on whether they are spam or not.

Screenshots 📷

N/A

✅ To be Mentioned while taking the issue :

Full name :
What is your participant role? (Mention the Open Source Program name. Eg. GSSOC, SSOC, JWOC, etc.)

Happy Contributing 🚀

All the best. Enjoy your open source journey ahead. 😎

sid7219 · 2024-10-02T07:13:51Z

Full name : Siddharth Gupta
GSSoc'24 Extended

saniyaahemad12 · 2024-10-02T17:22:43Z

Please assign this topic to me

Avdhesh-Varshney · 2024-10-02T18:06:29Z

@sid7219 @saniyaahemad12 tell me the approach and dataset used for the same?

ramu-nukavarapu · 2024-10-03T00:14:43Z

Approach:
Dataset : Use the public dataset "SMS spam collection dataset"
Text preprocessing : using libraries like pandas, nltk to tokenizing, lowering, remove stop words and perform stemming or lemmatization.
Feature extraction : extract features using TF-IDF vectorizer
Train the model : for training, use naive bayes (good for classification tasks)
User interface : develop using streamlit components

Name : Ramu
Role : GSSoC contributor

Assign this issue to me, to work on this!

saniyaahemad12 · 2024-10-03T11:34:37Z

Here is the approach for text classification for spam detection
Data Collection: Use a labeled dataset like the SMS Spam Collection dataset.
Data Preprocessing:
Clean text by removing special characters and stopwords.
Tokenize and apply stemming/lemmatization.
Feature Extraction:
Use TF-IDF to convert text into numerical vectors.
Optionally, use Bag of Words model.
Model Selection:
Train models using Naive Bayes, SVM, and Logistic Regression.
Model Training and Testing:
Split data into training/testing sets, apply cross-validation.
Model Evaluation:
Use accuracy, precision, recall, and F1-score to evaluate performance.
Interface Development:
Develop a user interface with Tkinter for input and spam prediction.
If you like the technique please assign this topic to me.

Avdhesh-Varshney · 2024-10-04T09:30:07Z

@ramu-nukavarapu In PR attach the screenshot of the accuracy of the model during training and testing.

ramu-nukavarapu · 2024-10-04T09:49:43Z

@Avdhesh-Varshney okay

Add hackotoberfest label as well

aakashmohole · 2024-10-06T17:08:34Z

i like to solve this issue please assign it to me

Name : Aakash

Avdhesh-Varshney added the Up-for-Grabs ✋ Issues are opened for the contributors label Jun 30, 2024

Avdhesh-Varshney added this to Jarvis Project Guide Jul 1, 2024

Avdhesh-Varshney moved this to Todo in Jarvis Project Guide Jul 1, 2024

Avdhesh-Varshney added Priority: Low Feat: Model Building labels Aug 14, 2024

Avdhesh-Varshney added this to the Models Integration milestone Aug 14, 2024

Avdhesh-Varshney assigned ramu-nukavarapu Oct 4, 2024

Avdhesh-Varshney added Status: Assigned gssoc-ext hacktoberfest-accepted level2 and removed Up-for-Grabs ✋ Issues are opened for the contributors labels Oct 4, 2024

Avdhesh-Varshney moved this from Todo to In Progress in Jarvis Project Guide Oct 4, 2024

Avdhesh-Varshney assigned saniyaahemad12 and unassigned ramu-nukavarapu Oct 13, 2024

jaidh01 added Up-for-Grabs ✋ Issues are opened for the contributors and removed Status: Assigned gssoc-ext hacktoberfest-accepted level2 labels Dec 15, 2024

Avdhesh-Varshney unassigned saniyaahemad12 Jan 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📃: Text Classification for Spam Detection #70

📃: Text Classification for Spam Detection #70

Avdhesh-Varshney commented Jun 30, 2024

sid7219 commented Oct 2, 2024

saniyaahemad12 commented Oct 2, 2024

Avdhesh-Varshney commented Oct 2, 2024

ramu-nukavarapu commented Oct 3, 2024

saniyaahemad12 commented Oct 3, 2024

Avdhesh-Varshney commented Oct 4, 2024

ramu-nukavarapu commented Oct 4, 2024 •

edited

Loading

aakashmohole commented Oct 6, 2024

📃: Text Classification for Spam Detection #70

📃: Text Classification for Spam Detection #70

Comments

Avdhesh-Varshney commented Jun 30, 2024

Screenshots 📷

sid7219 commented Oct 2, 2024

saniyaahemad12 commented Oct 2, 2024

Avdhesh-Varshney commented Oct 2, 2024

ramu-nukavarapu commented Oct 3, 2024

saniyaahemad12 commented Oct 3, 2024

Avdhesh-Varshney commented Oct 4, 2024

ramu-nukavarapu commented Oct 4, 2024 • edited Loading

aakashmohole commented Oct 6, 2024

ramu-nukavarapu commented Oct 4, 2024 •

edited

Loading