Upload resources

Upload your works to SEDICI to increase its visibility and improve its impact


Show simple item record

dc.date.accessioned 2019-03-15T15:53:50Z
dc.date.available 2019-03-15T15:53:50Z
dc.date.issued 2018
dc.identifier.uri http://sedici.unlp.edu.ar/handle/10915/73228
dc.description.abstract For years, and nowadays even more because of the ease of access to information, countless scientific documents that cover all branches of human knowledge are generated. These documents, consisting mostly of text, are stored in digital libraries that are increasingly consenting access and manipulation. This has allowed these repositories of documents to be used for research work of great interest, particularly those related to evaluation of automatic summaries through experimentation. In this area of computer science, the experimental results of many of the published works are obtained using document collections, some known and others not so much, but without specifying all the special considerations to achieve said results. This produces an unfair competition in the realization of experiments when comparing results and does not allow to be objective in the obtained conclusions. This paper presents a text document manipulation tool to increase the exactness of results when obtaining, evaluating and comparing automatic summaries from different corpora. This work has been motivated by the need to have a tool that allows to process documents, split their content properly and make sure that each text snippet does not lose its contextual information. Applying the model proposed to a set of free-access scientific papers has been successful. en
dc.format.extent 481-490 es
dc.language en es
dc.subject automatic summarization en
dc.subject extractive approaches en
dc.subject web scraping en
dc.subject document representation en
dc.subject summaries evaluation en
dc.title Text pre-processing tool to increase the exactness of experimental results in summarization solutions en
dc.type Objeto de conferencia es
sedici.identifier.isbn 978-950-658-472-6 es
sedici.creator.person Villa Monte, Augusto es
sedici.creator.person Corvi, Julieta Pilar es
sedici.creator.person Lanzarini, Laura Cristina es
sedici.creator.person Puente, Crisitina es
sedici.creator.person Cuevas, Alfredo Simón es
sedici.creator.person Olivas, José A. es
sedici.description.note XV Workshop Bases de Datos y Minería de Datos (WBDDM) es
sedici.subject.materias Ciencias Informáticas es
sedici.description.fulltext true es
mods.originInfo.place Red de Universidades con Carreras en Informática (RedUNCI) es
sedici.subtype Objeto de conferencia es
sedici.rights.license Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
sedici.rights.uri http://creativecommons.org/licenses/by-nc-sa/4.0/
sedici.date.exposure 2018-10
sedici.relation.event XXIV Congreso Argentino de Ciencias de la Computación (La Plata, 2018). es
sedici.description.peerReview peer-review es

Download Files

This item appears in the following Collection(s)

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) Except where otherwise noted, this item's license is described as Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)