REVISTA
Supercomputing Frontiers and Innovations

TODAS

Redirigiendo al acceso original de articulo en 21 segundos...

Inicio / Supercomputing Frontiers and Innovations / Vol: 1 Núm: 1 Par: 0 (2014) / Artículo

ARTÍCULO

TITULO

Model-Driven One-Sided Factorizations on Multicore Accelerated Systems

Jack Dongarra

Azzam Haidar

Jakub Kurzak

Piotr Luszczek

Stanimire Tomov

Asim YarKhan

Resumen

Hardware heterogeneity of the HPC platforms is no longer considered unusual but instead have become the most viable way forward towards Exascale. In fact, the multitude of the heterogeneous resources available to modern computers are designed for different workloads and their efficient use is closely aligned with the specialized role envisaged by their design. Commonly in order to efficiently use such GPU resources, the workload in question must have a much greater degree of parallelism than workloads often associated with multicore processors (CPUs). Available GPU variants differ in their internal architecture and, as a result, are capable of handling workloads of varying degrees of complexity and a range of computational patterns. This vast array of applicable workloads will likely lead to an ever accelerated mixing of multicore-CPUs and GPUs in multi-user environments with the ultimate goal of offering adequate computing facilities for a wide range of scientific and technical workloads. In the following paper, we present a research prototype that uses a lightweight runtime environment to manage the resource-specific workloads, and to control the dataflow and parallel execution in hybrid systems. Our lightweight runtime environment uses task superscalar concepts to enable the developer to write serial code while providing parallel execution. This concept is reminiscent of dataflow and systolic architectures in its conceptualization of a workload as a set of side-effect-free tasks that pass data items whenever the associated work assignment have been completed. Additionally, our task abstractions and their parametrization enable uniformity in the algorithmic development across all the heterogeneous resources without sacrificing precious compute cycles. We include performance results for dense linear algebra functions which demonstrate the practicality and effectiveness of our approach that is aptly capable of full utilization of a wide range of accelerator hardware.

Acceso

PÁGINAS

pp. 85 - 115

NÚMERO

Volumen: 1 Número: 1 Parte: 0 (2014)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Algorithms
Applied Sciences
Journal of Low Power Electronics and Applications

DOI

http://dx.doi.org/10.14529/jsfi140105

Artículos similares

Automated Configuration of NoSQL Performance and Scalability Tactics for Data-Intensive Applications

Acceso

Davy Preuveneers and Wouter Joosen

This paper presents the architecture, implementation and evaluation of a middleware support layer for NoSQL storage systems. Our middleware automatically selects performance and scalability tactics in terms of application specific workloads. Enterprises ... ver más

Revista: Informatics

Survey of Storage Systems for High-Performance Computing

Acceso

Jakob Lüttgau,Michael Kuhn,Kira Duwe,Yevhen Alforov,Eugen Betke,Julian Kunkel,Thomas Ludwig Pág. 31 - 58

In current supercomputers, storage is typically provided by parallel distributed file systems for hot data and tape archives for cold data. These file systems are often compatible with local file systems due to their use of the POSIX interface and semant... ver más

Revista: Supercomputing Frontiers and Innovations

APPLICATION OF COMPUTER SIMULATION IN IMPROVING THE PROCESS OF SCREWS PRODUCTION

Acceso

Julia Siderska, Katarzyna Perkowska Pág. 89 - 96

The aim of this work is to present and discuss the possibility of using computer simulation to improve the production flow of sheet metal screws in the carpentry plant. The paper includes descriptive and schematic characterization of the present producti... ver más

Revista: Transport Economics and Logistics

Pilot Metal Workload in Flight Operation: A case study of Indonesian Civilian Pilot

Acceso

Abadi Dwi Saputra Pág. 37 - 43

This type of activity or work with high stress level and requires more concentration and attention, in this case is the aircraft operation. Thereby mental workload is the most dominant than the physical workload. And this is what should have been a conce... ver más

Revista: Aceh International Journal of Science and Technology

Analyzing Data Properties using Statistical Sampling ? Illustrated on Scientific File Formats

Acceso

Julian Martin Kunkel Pág. 34 - 39

Understanding the characteristics of data stored in data centers helps computer scientists in identifying the most suitable storage infrastructure to deal with these workloads. For example, knowing the relevance of file formats allows optimizing the rele... ver más

Revista: Supercomputing Frontiers and Innovations

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas