Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

 

Mostrar el registro sencillo del ítem

dc.date.accessioned 2023-04-18T18:54:20Z
dc.date.available 2023-04-18T18:54:20Z
dc.date.issued 2022
dc.identifier.uri http://sedici.unlp.edu.ar/handle/10915/151702
dc.description.abstract Research over historical text volumes can be performed by means of automatic tools that help historians achieve more abstract and aggregated points of view. Tasks such as Information Extraction or Text Mining can be performed more efficiently if Machine Learning models are employed. We propose the evaluation of different state-of-the-art models over a new dataset for Named Entity Recognition. The dataset was built over a History texts volume about General Güemes, a national Argentinian independence hero. The results show that some models perform better in terms of precision, recall and f1-score for most types of entities. Specifically, pretrained language models fine-tuned for this particular task show considerably higher performance than classical models based on word embeddings and other kinds of representations and models.Besides, statistical tests are provided to ensure the significance in the differences of the performance values attained. Hence, the contribution of this work is twofold, on the one hand a new corpus and dataset for Named Entity Recognition and a complete statistical assessment of performance values of state-of-the-art models over the generated dataset. en
dc.format.extent 98-109 es
dc.language en es
dc.subject Named Entity Recognition and Classification es
dc.subject Argentinian History es
dc.subject Pretrained Language Models es
dc.title Evaluation of Named Entity Recognition in Historical Argentinian Documents en
dc.type Objeto de conferencia es
sedici.identifier.uri https://publicaciones.sadio.org.ar/index.php/JAIIO/article/download/270/221 es
sedici.identifier.issn 2451-7496 es
sedici.creator.person Darfe, Facundo es
sedici.creator.person Xamena, Eduardo es
sedici.creator.person Orozco, Carlos I. es
sedici.subject.materias Ciencias Informáticas es
sedici.description.fulltext true es
mods.originInfo.place Sociedad Argentina de Informática e Investigación Operativa es
sedici.subtype Objeto de conferencia es
sedici.rights.license Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
sedici.rights.uri http://creativecommons.org/licenses/by-nc-sa/4.0/
sedici.date.exposure 2022-10
sedici.relation.event Simposio Argentino de Inteligencia Artificial (ASAI 2022) - JAIIO 51 (Modalidad virtual y presencial (UAI), octubre 2022) es
sedici.description.peerReview peer-review es


Descargar archivos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)