I am trying to find the bottleneck in my OpenCL kernel, is it possible to profile OpenCL programms on mac os X? I found gDebugger on http://www.gremedy.com/, but it requires 10.5 or 10.6 to run. AMD SDK supports only Linux and Windows.
Is there a profiler for Mountain Lion?
How detailed must your profiling information be? Is it okay to use the built-in internal profiler?
OpenCL queues can be created with the CL_QUEUE_PROFILING_ENABLE flag.
This way you can see for each kernel you executed:
When it has been
With C++-Bindings, the creation of the queue can look like this:
The extration of the profiling information looks like this:
1) Save the event object (in an array) delivered by the enqueued kernel you want to profile.
2) After execution of the queue, extract the profiling information