Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

 

Mostrar el registro sencillo del ítem

dc.date.accessioned 2014-11-04T20:27:53Z
dc.date.available 2014-11-04T20:27:53Z
dc.date.issued 2014
dc.identifier.uri http://sedici.unlp.edu.ar/handle/10915/42288
dc.description.abstract Information Quality assessment in Wikipedia has become an ever-growing research line in the last years. However, few e orts have been accomplished in Spanish Wikipedia, despite being Spanish, one of the most spoken languages in the world by native speakers. In this respect, we present the rst study to automatically assess information quality in Spanish Wikipedia, where Featured Articles identi cation is evaluated as a binary classi cation task. Two popular classi cation approaches like Naive Bayes and Support Vector Machine (SVM) are evaluated with di erent document representations and vocabulary sizes. The obtained results show that FA identi cation can be performed with an F1 score of 0.81, when SVM is used as classi cation algorithm and documents are represented with a binary codi cation of the bag-of-words model with reduced vocabulary. en
dc.language en es
dc.subject Wikipedia en
dc.subject information quality en
dc.subject featured article en
dc.subject support vector machine en
dc.title Ideentifying featured articles in Spanish Wikipedia en
dc.type Objeto de conferencia es
sedici.creator.person Pohn, Lian es
sedici.creator.person Ferretti, Edgardo es
sedici.creator.person Errecalde, Marcelo Luis es
sedici.description.note XI Workshop Bases de Datos y Minería de Datos es
sedici.subject.materias Ciencias Informáticas es
sedici.description.fulltext true es
mods.originInfo.place Red de Universidades con Carreras de Informática (RedUNCI) es
sedici.subtype Objeto de conferencia es
sedici.rights.license Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5)
sedici.rights.uri http://creativecommons.org/licenses/by-nc-sa/2.5/ar/
sedici.date.exposure 2014-10
sedici.relation.event XX Congreso Argentino de Ciencias de la Computación (Buenos Aires, 2014) es
sedici.description.peerReview peer-review es


Descargar archivos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5) Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5)