Lessons learned from contrasting a BLAS kernel implementations

More, Andres

Buscar material

Busque entre los 171529 recursos disponibles en el repositorio

Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

Red de Universidades con Carreras en Informática (RedUNCI)
→
Eventos
→
CACIC
→
CACIC 2013

Lessons learned from contrasting a BLAS kernel implementations

Autor: More, Andres

2013

Tipo de documento: Objeto de conferencia

Resumen

This work reviews the experience of implementing different versions of the SSPR rank-one update operation of the BLAS library. The main objective was to contrast CPU versus GPU implementation effort and complexity of an optimized BLAS routine, not considering performance. This work contributes with a sample procedure to compare BLAS kernel implementations, how to start using GPU libraries and offloading, how to analyze their performance and the issues faced and how they were solved.

Notas

WPDP- XIII Workshop procesamiento distribuido y paralelo

Información general

Fecha de publicación: octubre 2013

Idioma del documento: Inglés

Evento: XVIII Congreso Argentino de Ciencias de la Computación

Institución de origen: Red de Universidades con Carreras en Informática (RedUNCI)

Palabras claves: BLAS libraries ; Software libraries ; Optimization ; SSPR kernel ; PROCESSOR ARCHITECTURES ; CPU architecture ; GPU architecture ; Performance Analysis and Design Aids ; performance analysis ; performance measurement ; software optimization

Materias: Ciencias Informáticas

Descargar archivos

Documento completo
Descargar archivo (523.1Kb) - PDF

BASE

GoogleScholar

Creado el: 3 de diciembre de 2013

Disponible en SEDICI desde: 3 de diciembre de 2013

Por favor, utilice uno de estos identificadores(URI) para citar o enlazar este ítem:

http://sedici.unlp.edu.ar/handle/10915/31702

Mostrar el registro completo del ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)

CACIC → CACIC 2013

Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5)

Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial-ShareAlike 2.5 Argentina (CC BY-NC-SA 2.5)

Iniciar sesión