Skip to content

A python and datascience based sms spam and ham detection system.

Notifications You must be signed in to change notification settings

radhikagupta6/Spam_ham_detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spam ham detection

Objective:

  • To identify text messages/sms as spam or ham(non-spam).

Challenges:

  • SMSes are limited in length, number of features that can be used for classification is small.
  • Text messages generally include abbreviations, informal language, text-speak, other languages written in english.

Dataset:

  • UCI Machine Learning Repository has a collection of sms – SMS Spam Dataset.
  • It contains: A total of 4827 ham and 747 spam = 5574 messages

Classification technique

  • The NaiveBayes classifier is used here.

File Description

  • 'spam.py' file has the main program. GUI is build with tkinter.
  • Execute the file in the terminal with : 'python spam.py'
  • 'spamDetection.ipynb' has the analysis of the dataset.

About

A python and datascience based sms spam and ham detection system.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published