Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

 

Mostrar el registro sencillo del ítem

dc.date.accessioned 2012-10-25T18:16:29Z
dc.date.available 2012-10-25T18:16:29Z
dc.date.issued 2005-10
dc.identifier.uri http://sedici.unlp.edu.ar/handle/10915/22957
dc.description.abstract The problem of unwanted e-mails (or spam messages) has been increasing for years. Different methods have been proposed in order to deal with this problem wich includes blacklists of known spammers, handcrafted rules and machine learning techniques. In this paper we investigate the performance of the k Nearest Neighbours (k-NN) method in spam detection tasks. At this end, a number of different document codifications were tested. Moreover, we study how the vocabulary size reduction affects this task. In the experimental design, different k values were considered and results were analyzed with respect to a public mailing list and personal e-mail collections. The experiments showed that results with public mailing lists tend to be very optimistic and they should not be considered representative of those expected with personal user accounts. en
dc.language en es
dc.subject Electronic mail es
dc.subject spam en
dc.subject anti-spam filtering en
dc.subject Message sending es
dc.subject automated text categorization en
dc.subject Information filtering es
dc.subject machine learning en
dc.subject k-NN en
dc.title Learning to detect spam messages en
dc.type Objeto de conferencia es
sedici.creator.person Gil Costa, Graciela Verónica es
sedici.creator.person Errecalde, Marcelo Luis es
sedici.creator.person Taranilla, María Teresa es
sedici.description.note VI Workshop de Agentes y Sistemas Inteligentes (WASI) es
sedici.subject.materias Ciencias Informáticas es
sedici.description.fulltext true es
mods.originInfo.place Red de Universidades con Carreras en Informática (RedUNCI) es
sedici.subtype Objeto de conferencia es
sedici.rights.license Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5)
sedici.rights.uri http://creativecommons.org/licenses/by-nc-sa/2.5/ar/
sedici.date.exposure 2005-10
sedici.relation.event XI Congreso Argentino de Ciencias de la Computación es
sedici.description.peerReview peer-review es


Descargar archivos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5) Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5)