Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

 

Mostrar el registro sencillo del ítem

dc.date.accessioned 2016-04-21T12:12:19Z
dc.date.available 2016-04-21T12:12:19Z
dc.date.issued 2016-04
dc.identifier.uri http://sedici.unlp.edu.ar/handle/10915/52377
dc.description.abstract The impressive rise of user-generated content on the web in the hands of sites like Twitter imposes new challenges to search systems. The concept of real-time search emerges, increasing the role that efficient indexing and retrieval algorithms play in this scenario. Thousands of new updates need to be processed in the very moment they are generated and users expect content to be “searchable” within seconds. This lead to the develop of efficient data structures and algorithms that may face this challenge efficiently. In this work, we introduce the concept of index entry invalidator, a strategy responsible for keeping track of the evolution of the underlying vocabulary and selectively invalidate and evict those inverted index entries that do not considerably degrade retrieval effectiveness. Consequently, the index becomes smaller and may increase overall efficiency. We introduce and evaluate two approaches based on Time-to-Live and Sliding Windows criteria. We also study the dynamics of the vocabulary using a real dataset while the evaluation is carry out using a search engine specifically designed for real-time indexing and search. en
dc.format.extent 6-13 es
dc.language en es
dc.subject Real time es
dc.subject Indexing methods es
dc.subject Information Search and Retrieval es
dc.title Improving Real Time Search Performance using Inverted Index Entries Invalidation Strategies en
dc.type Articulo es
sedici.identifier.uri http://journal.info.unlp.edu.ar/wp-content/uploads/2015/10/JCST-42-Paper-2.pdf es
sedici.identifier.issn 1666-6038 es
sedici.creator.person Ríssola, Esteban A. es
sedici.creator.person Tolosa, Gabriel Hernán es
sedici.subject.materias Ciencias Informáticas es
sedici.description.fulltext true es
mods.originInfo.place Facultad de Informática es
sedici.subtype Articulo es
sedici.rights.license Creative Commons Attribution 3.0 Unported (CC BY 3.0)
sedici.rights.uri http://creativecommons.org/licenses/by/3.0/
sedici.description.peerReview peer-review es
sedici.relation.journalTitle Journal of Computer Science & Technology es
sedici.relation.journalVolumeAndIssue vol. 16, no. 1 es


Descargar archivos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Creative Commons Attribution 3.0 Unported (CC BY 3.0) Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution 3.0 Unported (CC BY 3.0)