Date of Award

2014-01-01

Degree Name

Doctor of Philosophy

Department

Computer Science

Advisor(s)

Rodrigo A. Romero

Abstract

Heterogeneous architectures can improve the performance of applications with computationally intensive operations. Even when these architectures may reduce the execution time of applications, there are opportunities for additional performance improvement as the memory hierarchies of the central processor cores and the coprocessor cores are separate. Applications running on heterogeneous architectures where graphics processing units (GPUs) execute throughput-intense, data-parallel operations may run in a single address space provided by unified virtual addressing or expand the upper bounds of scalability and high performance computing by explicitly partitioning and transferring data across orthogonal host and device address spaces. For explicit handling, applications must allocate space in the GPU global memory, copy input data, invoke kernels, and copy results to the CPU memory. By overlapping inter-memory data transfers and GPU computation steps, applications may further reduce execution time. This research presents a software architecture with a runtime pipeline for GPU input/output scheduling that acts as a bidirectional interface between the GPU computing application and the physical device. The main aim of this system is to reduce the impact of the processor-memory performance gap by exploiting device I/O and computation overlap. Evaluation using application benchmarks shows processing improvements with speedups up to 2.37x with respect to baseline, non-streamed GPU execution. In addition, the presented input/output scheduling system is a high-level, systems abstraction that removes application software complexity while exploiting the input/output and processing concurrency capabilities of the underlying GPU.

Language

Provenance

Received from ProQuest

Copyright Date

2014

File Size

130 pages

File Format

application/pdf

Rights Holder

Julio Cesar Olaya

Recommended Citation

Olaya, Julio Cesar, "Runtime Pipeline I/O Scheduling System for GPU-Based Heterogeneous Architectures" (2014). Open Access Theses & Dissertations. 1311.
https://scholarworks.utep.edu/open_etd/1311

Download

Included in

Computer Sciences Commons

COinS

Open Access Theses & Dissertations

Runtime Pipeline I/O Scheduling System for GPU-Based Heterogeneous Architectures

Date of Award

Degree Name

Department

Advisor(s)

Abstract

Language

Provenance

Copyright Date

File Size

File Format

Rights Holder

Recommended Citation

Included in

Search

Links

Browse

Author Corner

Open Access Theses & Dissertations

Runtime Pipeline I/O Scheduling System for GPU-Based Heterogeneous Architectures

Author

Date of Award

Degree Name

Department

Advisor(s)

Abstract

Language

Provenance

Copyright Date

File Size

File Format

Rights Holder

Recommended Citation

Included in

Share

Search

Links

Browse

Author Corner