searching for answers for why my kernels produce strange error messages or "0" only results I found this answer on SO that mentions that there is a timeout of 5s for kernels running on nvidia gpus? I googled for the timout but I could not find confirming sources or more information.
What do you know about it?
Could the timout cause strange behaviour for kernels with a long runtime?
Thanks!
If you're on Windows Vista or later, the WDDM driver stack will automatically reset the device after about two seconds unless you tweak your TDR timeouts. (Windows can't tell the difference between a GPU running a lengthy kernel and a GPU that's locked up.) Tesla-branded cards running in TCC mode aren't subject to the normal display adapter restrictions and can therefore run longer kernels.
Further googling brought up this in the CUDA_Toolkit_Release_Notes_Linux.txt (Known Issus):
[update] It seems that the official name for this feature is 'watchdog'.