Busque entre los 166596 recursos disponibles en el repositorio
Mostrar el registro sencillo del ítem
dc.date.accessioned | 2012-10-25T18:16:29Z | |
dc.date.available | 2012-10-25T18:16:29Z | |
dc.date.issued | 2005-10 | |
dc.identifier.uri | http://sedici.unlp.edu.ar/handle/10915/22957 | |
dc.description.abstract | The problem of unwanted e-mails (or spam messages) has been increasing for years. Different methods have been proposed in order to deal with this problem wich includes blacklists of known spammers, handcrafted rules and machine learning techniques. In this paper we investigate the performance of the k Nearest Neighbours (k-NN) method in spam detection tasks. At this end, a number of different document codifications were tested. Moreover, we study how the vocabulary size reduction affects this task. In the experimental design, different k values were considered and results were analyzed with respect to a public mailing list and personal e-mail collections. The experiments showed that results with public mailing lists tend to be very optimistic and they should not be considered representative of those expected with personal user accounts. | en |
dc.language | en | es |
dc.subject | Electronic mail | es |
dc.subject | spam | en |
dc.subject | anti-spam filtering | en |
dc.subject | Message sending | es |
dc.subject | automated text categorization | en |
dc.subject | Information filtering | es |
dc.subject | machine learning | en |
dc.subject | k-NN | en |
dc.title | Learning to detect spam messages | en |
dc.type | Objeto de conferencia | es |
sedici.creator.person | Gil Costa, Graciela Verónica | es |
sedici.creator.person | Errecalde, Marcelo Luis | es |
sedici.creator.person | Taranilla, María Teresa | es |
sedici.description.note | VI Workshop de Agentes y Sistemas Inteligentes (WASI) | es |
sedici.subject.materias | Ciencias Informáticas | es |
sedici.description.fulltext | true | es |
mods.originInfo.place | Red de Universidades con Carreras en Informática (RedUNCI) | es |
sedici.subtype | Objeto de conferencia | es |
sedici.rights.license | Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5) | |
sedici.rights.uri | http://creativecommons.org/licenses/by-nc-sa/2.5/ar/ | |
sedici.date.exposure | 2005-10 | |
sedici.relation.event | XI Congreso Argentino de Ciencias de la Computación | es |
sedici.description.peerReview | peer-review | es |