parallel-processing-teaching-toolkit/04-GPU-accelerators/04-PyOpenCL/07-multi_kernel at master · javierip/parallel-processing-teaching-toolkit

Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
multi_kernel.py	multi_kernel.py
run.sh	run.sh

Name

Last commit message

Last commit date

About this example

This example shows the use of multiple kernels in a single program. Operation: (vector_a*vector_b) + (vector_c/vector_d)

Requirements

OPENCL and Python.

Run

Open a terminal and type:

sh run.sh

Output

A typical output should look like this one.

Running:  <pyopencl.Platform 'NVIDIA CUDA' at 0x55aaa0ff2890>
On GPU:  <pyopencl.Device 'GeForce GTX 750' on 'NVIDIA CUDA' at 0x55aaa1127490>
--------------------------------------------------------------------------------
CHECK :
--------------------------------------------------------------------------------
[ 0.  0.  0. ...,  0.  0.  0.]
--------------------------------------------------------------------------------
Vector (a*b+c/d)
Vector Size: 50000
Time CPU: 0.174175024033
Time GPU: 0.0157821178436

Extra Resources

OpenCL Programming Guide 1.2 Examples.
NVIDIA toolkit documentation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

About this example

Requirements

Run

Output

Extra Resources

FilesExpand file tree

07-multi_kernel

Directory actions

More options

Directory actions

More options

Latest commit

History

07-multi_kernel

Folders and files

parent directory

README.md

About this example

Requirements

Run

Output

Extra Resources