Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

 

Mostrar el registro sencillo del ítem

dc.date.accessioned 2016-11-16T12:07:25Z
dc.date.available 2016-11-16T12:07:25Z
dc.date.issued 2016
dc.identifier.uri http://sedici.unlp.edu.ar/handle/10915/56750
dc.description.abstract Featured Articles (FA) are considered to be the best articles that Wikipedia has to offer and in the last years, researchers have found interesting to analyze whether and how they can be distinguished from “ordinary” articles. Likewise, identifying what issues have to be enhanced or fixed in ordinary articles in order to improve their quality is a recent key research trend. Most of the approaches developed in these research trends have been proposed for the English Wikipedia. However, few efforts have been accomplished in Spanish Wikipedia, despite being Spanish, one of the most spoken languages in the world by native speakers. In this respect, we present a first breakdown of Spanish Wikipedia’s quality flaw structure. Besides, we carry out a study to automatically assess information quality in Spanish Wikipedia, where FA identification is evaluated as a binary classification task. The results obtained show that FA identification can be performed with an F1 score of 0.81, using a document model consisting of only twenty six features and AdaBoosted C4.5 decision trees as classification algorithm. en
dc.format.extent 702-711 es
dc.language en es
dc.subject Featured Articles (FA) es
dc.subject Wikipedia es
dc.subject Quality Flaws Prediction es
dc.title On the Assessment of Information Quality in Spanish Wikipedia en
dc.type Objeto de conferencia es
sedici.creator.person Urquiza, Guido es
sedici.creator.person Soria, Matías es
sedici.creator.person Pérez Casseignau, Sebastián es
sedici.creator.person Ferretti, Edgardo es
sedici.creator.person Gómez, Sergio Alejandro es
sedici.creator.person Errecalde, Marcelo Luis es
sedici.description.note XIII Workshop Bases de datos y Minería de Datos (WBDMD) es
sedici.subject.materias Ciencias Informáticas es
sedici.description.fulltext true es
mods.originInfo.place Red de Universidades con Carreras en Informática (RedUNCI) es
sedici.subtype Objeto de conferencia es
sedici.rights.license Creative Commons Attribution 4.0 International (CC BY 4.0)
sedici.rights.uri http://creativecommons.org/licenses/by/4.0/
sedici.date.exposure 2016-10
sedici.relation.event XXII Congreso Argentino de Ciencias de la Computación (CACIC 2016). es
sedici.description.peerReview peer-review es
sedici.relation.isRelatedWith http://sedici.unlp.edu.ar/handle/10915/55718 es


Descargar archivos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Creative Commons Attribution 4.0 International (CC BY 4.0) Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution 4.0 International (CC BY 4.0)