DPTM stands for Data Parallel Task Manager. Its objectives are to:
- Utilize parallel processors with minimum changes to existing serial code
- Manage and schedule tasks to insure full utilization of available hardware
- Be compatible with parallel APIs like OpenCL, CUDA, MPSS and more
Specifically, as outlined by Dr. Amir Farbin:
The goal is to establish infrastructure for hybrid multi-core processor (ie CPU) and many-core co-processor (eg a GPU) computing in complex applications consisting of large numbers ofalgorithms, without major redesign and reworking of legacy software. The vision is that Task Parallel (TP) applications running on CPU cores schedule large numbers of Data Parallel (DP) computing tasks on available many-core co-processors, maximally utilizing the all available hard-ware resources. The project will be executed in the context of offline and trigger reconstruction of Large Hadron Collider (LHC) collisions, but will certainly have a broader impact because it is also applicable to particle interaction simulation (ie Geant4), streaming Data Acquisition Systems, statistical analysis, and other scientific applications.
The current version of this project has explored different smaller projects and this is reflected in the folder layout:
- aside - standalone OpenCL programs used to test system stability
- clFFT - simpler version of ‘ClientStandAloneOpenCL’ found in ‘singleFFT’
- matrixMultiplication - used to test different openCL implementations of matrixMultiplication and determine library linking compatibility
- monteCarlo - unaltered monteCarlo example from Intel SDK
- squareArray - used to compare consisitency of squareArray implemented with OpenCL and CUDA
- doc - contains some additional documentation
- kernelProfiler - OpenCL kernel launcher with performance reports
- analysis
- kernelMB
- Platform - Device - contains graphs for time(x,y,z,mb)
- optimal - contains curves for time(mb) for each kernel on each device
- kernelMB
- include - header files
- obj - object files
- raw - contains raw data from dpClientSrc with printFile enabled
- src - source files
- analysis
- toyDPTM - cuFFT implementation with a prototype task manager
- client.cu - makes requests to dptm.cu using IPC
- wrapper_FFT.cu - launches kernel on command from dptm.cu
- dptm.cu - takes client.cu requests and combines them to give to wrapper_FFT.cu
- FFTComparison - FFTW, clFFT and cuFFT comparisons without the use of the task manager
- analysis
- oroduruin - raw data from tests run on the local OSX computer
- AFSuper - raw data from tests run on the local Red Hat computer
- conclusion - contains test results in graphs from pyROOT scripts
- analysis
- utilities - OpenCL programs used to fetch information about the current PC's hardware
Installation begins with cloning the git repository:
mkdir dptm
git github.com:zbhatti/dptm.git dptm
cd dptm
Currently, only Red Hat flavor systems have been tested: RHL6Installation
- NVIDIA GTX 650
- NVIDIA GTX 780
- NVIDIA GTX Titan
- NVIDIA Tesla K20Xm
- AMD Firepro W9100
- Intel Xeon E5-2695
- Intel Xeon Phi x100