Two tesla c2050 + quadro 600 - GPU error. Ideas ?

2 views (last 30 days)
Hello, I've got a remote cluster with 2 teslas c2050 inside (compute capability 2.0) + quadro 600 for OS purposes. All drivers are the newest from NVIDIA. CUDA toolkit - the latest one 3.2. MATLAB R2010b x64 + PCT + MDCS. Parallel computing test works fine. win7 x64 + 16Gb of RAM. This machine is public one, so some other software (CUDA compatible) is being tested on it. MATLAB is being run under system administrator. Some time ago it worked, now I have an error:
>> gpuDevice ??? Error using ==> feval An error occurred during: setting up the CUDA device. The CUDA error was: (Unknown code: 10100).
Error in ==> currentDeviceIndex at 7 idx = feval( '_gpu_selectedDeviceIdx' );
Error in ==> deviceProperties at 8 idx = parallel.internal.gpu.currentDeviceIndex;
Error in ==> GPUDevice.GPUDevice>GPUDevice.current at 34 [props, E] = parallel.internal.gpu.deviceProperties();
Error in ==> gpuDevice at 18 dev = parallel.gpu.GPUDevice.current();
Actually, I don't have an idea what to test or to check. Please help.

Accepted Answer

Edric Ellis
Edric Ellis on 17 Feb 2011
That error code usually indicates that CUDA is having a problem connecting to the device. Are you connecting to the remote machines using remote desktop? (If so, you'll need the latest Tesla-specific "TCC" drivers).
  12 Comments
Jason Ross
Jason Ross on 3 Mar 2011
You might want to try downloading the developer driver from the CUDA downloads page. That's what we have and we definitely have the utility.
http://developer.nvidia.com/object/cuda_3_2_downloads.html
Najeeb Ahmad
Najeeb Ahmad on 7 Mar 2011
Thanks a lot for your reply.
As per your advice, I downloaded and installed developer driver for Vista and Windows 7 from the link you provided. The file name was devdriver_3.2_winvista-win7_32_263.06_general.exe. However, I still have the folders I mentioned in my previous reply and no NVSMI folder. Tried running 'gpuDevice' command and still get the same error:
??? Error using ==> feval
An error occurred during: setting up the CUDA
device. The CUDA error was: (Unknown code: 10100).
Error in ==> currentDeviceIndex at 7
idx = feval( '_gpu_selectedDeviceIdx' );
Error in ==> deviceProperties at 8
idx = parallel.internal.gpu.currentDeviceIndex;
Error in ==> GPUDevice.GPUDevice>GPUDevice.current
at 34
[props, E] =
parallel.internal.gpu.deviceProperties();
Error in ==> gpuDevice at 18
dev = parallel.gpu.GPUDevice.current();
Am I doing something wrong? Thank you for your co-operation.

Sign in to comment.

More Answers (1)

Najeeb Ahmad
Najeeb Ahmad on 8 Mar 2011
The problem was with Windows 7 32-bit as it doesn't support TCC driver. I installed Windows 7 64-bit and associated TCC drivers for Tesla C1060 and don't get the error anymore. Moroever, now I do have NVSMI folder in its rightful place but it never required me to run nvidia-smi.exe utility to switch driver into TCC mode which indicates that driver is in TCC mode by default.

Tags

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!