سال انتشار: ۱۳۸۷

محل انتشار: دومین کنفرانس داده کاوی ایران

تعداد صفحات: ۱۱

نویسنده(ها):

Maryam Mokhtari – Department of Electrical and Computer Engineering Isfahan University of Technology, Isfahan, Iran
Mohammad Saraee – 1Department of Electrical and Computer Engineering Isfahan University of Technology, Isfahan, Iran
Alireza Haghshenas – Department of Computer Engineering Iran University of Science & Technology, Tehran, Iran

چکیده:

Scam’ is a fraudulence message by criminal intent sent to internet user mailboxes. Many approaches have been proposed to filter out unsolicited messages known as ‘spam’ from legitimate messages known as ‘ham’. However up to this date no suitable approach has been proposed to detect Scams. Almost all spam filters which use Machine Learning approaches, classify scams as hams when scam messages are more similar to the average ham than spam. But such fraudulence messages can be very harmful to users as many people in the world lose their funds by relying on scam messages. In this paper we use Data Mining techniques for scam detection. Bayesian Classifier, Naïve Bayes and K-Nearest Neighbor which are mostly used in spam detection are experimented and the results are reported. In addition, a new approach in scam detection is proposed. This approach uses K-Nearest Neighbor algorithm with modification to Document Similarity equation. Additionally, classification is not binary as ‘scam’ or ‘not scam’: a Fuzzy Decision is used instead of clear types of classes. Scammessages are successfully detected by applying this approach.