Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

 

Mostrar el registro sencillo del ítem

dc.date.accessioned 2015-12-23T13:50:32Z
dc.date.available 2015-12-23T13:50:32Z
dc.date.issued 2015
dc.identifier.uri http://sedici.unlp.edu.ar/handle/10915/50429
dc.description.abstract The impressive rise of user-generated content on the web in the hands of sites like Twitter imposes new challenges to search systems. The concept of real-time search emerges, increasing the role that efficient indexing and retrieval algorithms play in this scenario. Thousands of new updates need to be processed in the very moment they are generated and users expect content to be “searchable” within seconds. This lead to the develop of efficient data structures and algorithms that may face this challenge efficiently. In this work, we introduce the concept of index entry invalidator, a strategy responsible for keeping track of the evolu- tion of the underlying vocabulary and selectively invalidóte and evict those inverted index entries that do not considerably degrade retrieval effectiveness. Consequently, the index becomes smaller and may increase overall efficiency. We study the dynamics of the vocabulary using a real dataset and also provide an evaluation of the proposed strategy using a search engine specifically designed for real-time indexing and search. en
dc.language en es
dc.subject Real time es
dc.subject Sorting and searching es
dc.subject Index generation es
dc.title Inverted Index Entry Invalidation Strategy for Real Time Search en
dc.type Objeto de conferencia es
sedici.identifier.isbn 978-987-3806-05-6 es
sedici.creator.person Ríssola, Esteban A. es
sedici.creator.person Tolosa, Gabriel Hernán es
sedici.description.note XII Workshop Bases de Datos y Minería de Datos (WBDDM) es
sedici.subject.materias Ciencias Informáticas es
sedici.description.fulltext true es
mods.originInfo.place Red de Universidades con Carreras en Informática (RedUNCI) es
sedici.subtype Objeto de conferencia es
sedici.rights.license Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5)
sedici.rights.uri http://creativecommons.org/licenses/by-nc-sa/2.5/ar/
sedici.date.exposure 2015-10
sedici.relation.event XXI Congreso Argentino de Ciencias de la Computación (Junín, 2015) es
sedici.relation.isRelatedWith http://sedici.unlp.edu.ar/handle/10915/50028 es


Descargar archivos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5) Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5)