Personal computing discussed

Moderators: renee, farmpuma, just brew it!

 
Dysthymia
Gerbil
Topic Author
Posts: 49
Joined: Thu May 01, 2008 4:54 pm

Why is my GPU folding slot failing?

Fri Dec 05, 2014 9:35 am

I'm back folding for TR after getting a GTX 970, but my GPU slot keeps failing. It's an EVGA ACX 1 version, factory overclocked to boost to 1366 MHz. I have a stable gaming overclock of 1451 MHz boost on the GPU, and 1976 MHz on the memory, and this was the case the first time the slot failed. I have tried removing and re-adding the GPU slot. I have tried disabling the overclock entirely. I have tried underclocking the GPU 100 MHz, leaving the memory stock. I have uninstalled F@H (v7.4.4 since before the GPU upgrade) including deleting the data, and reinstalling. I have even completely uninstalled GeForce Experience, the driver, and all other nvidia software such as PhysX before reinstalling everything and trying again.

It seems that unless I wait a day or so to start folding on the GPU again, it will fail before making any progress in the work unit. Otherwise it may make it all the way through one work unit before failing.

The card itself seems fine... It never gets above 72 C and I never have a problem gaming unless I exceed 1451 MHz. My 750 Ti in my other system folds flawlessly. What gives?
 
PainIs4ThaWeak1
Gerbil
Posts: 77
Joined: Fri Jun 26, 2009 11:13 am

Re: Why is my GPU folding slot failing?

Fri Dec 05, 2014 1:31 pm

Could you share your folding log with us from a point-in-time when this is occurring? (Best when log verbosity is set to "5")

Additionally, what driver version is being used?
 
Dysthymia
Gerbil
Topic Author
Posts: 49
Joined: Thu May 01, 2008 4:54 pm

Re: Why is my GPU folding slot failing?

Fri Dec 05, 2014 1:56 pm

The driver is the latest, 344.75 I believe. Set to update to beta drivers but I think the current one is WHQL. I'll look into setting log verbosity and capturing the event when I get home from work.
 
Kougar
Minister of Gerbil Affairs
Posts: 2306
Joined: Tue Dec 02, 2008 2:12 am
Location: Texas

Re: Why is my GPU folding slot failing?

Sat Dec 06, 2014 12:01 am

Are you using any F@H flags or no?

Put the card clocks back to defaults, then drop them by 200Mhz and see if that makes the crashing go away. If it does then safe bet the card wasn't fully stable out of the box. If it continues to drop WU's then it's something else.
 
Dysthymia
Gerbil
Topic Author
Posts: 49
Joined: Thu May 01, 2008 4:54 pm

Re: Why is my GPU folding slot failing?

Sat Dec 06, 2014 10:57 am

I don't know what the deal is, but since I got home from work it hasn't failed. It was folding on idle with the full overclock... set to Full now. I will look up what the stock clocks are supposed to be and underclock to that if/when it fails again and if it continues to fail I will post the logs.

I don't use any flags.

Weird.

Image
 
Dysthymia
Gerbil
Topic Author
Posts: 49
Joined: Thu May 01, 2008 4:54 pm

Re: Why is my GPU folding slot failing?

Wed Dec 10, 2014 7:30 am

Kougar wrote:
Are you using any F@H flags or no?

Put the card clocks back to defaults, then drop them by 200Mhz and see if that makes the crashing go away. If it does then safe bet the card wasn't fully stable out of the box. If it continues to drop WU's then it's something else.


Okay, it failed at 1451 MHz again (saw that comin', right?) so I underclocked it 200 MHz (1166 MHz boost, I believe this is even lower than stock 970) and it has failed again. I had previously set the log verbosity to 5 so I set the view specific to the GPU slot, went back to a point where it was successfully completing a work unit, and copied it from there.


******************************* Date: 2014-12-09 *******************************
14:55:57:WU01:FS01:0x18:Completed 3550000 out of 5000000 steps (71%)
15:06:55:WU01:FS01:0x18:Completed 3600000 out of 5000000 steps (72%)
15:17:59:WU01:FS01:0x18:Completed 3650000 out of 5000000 steps (73%)
15:28:57:WU01:FS01:0x18:Completed 3700000 out of 5000000 steps (74%)
15:39:55:WU01:FS01:0x18:Completed 3750000 out of 5000000 steps (75%)
15:51:05:WU01:FS01:0x18:Completed 3800000 out of 5000000 steps (76%)
16:02:03:WU01:FS01:0x18:Completed 3850000 out of 5000000 steps (77%)
16:13:12:WU01:FS01:0x18:Completed 3900000 out of 5000000 steps (78%)
16:24:10:WU01:FS01:0x18:Completed 3950000 out of 5000000 steps (79%)
16:35:08:WU01:FS01:0x18:Completed 4000000 out of 5000000 steps (80%)
16:46:18:WU01:FS01:0x18:Completed 4050000 out of 5000000 steps (81%)
16:57:16:WU01:FS01:0x18:Completed 4100000 out of 5000000 steps (82%)
17:08:26:WU01:FS01:0x18:Completed 4150000 out of 5000000 steps (83%)
17:19:24:WU01:FS01:0x18:Completed 4200000 out of 5000000 steps (84%)
17:30:22:WU01:FS01:0x18:Completed 4250000 out of 5000000 steps (85%)
17:41:32:WU01:FS01:0x18:Completed 4300000 out of 5000000 steps (86%)
17:52:31:WU01:FS01:0x18:Completed 4350000 out of 5000000 steps (87%)
18:03:41:WU01:FS01:0x18:Completed 4400000 out of 5000000 steps (88%)
18:14:39:WU01:FS01:0x18:Completed 4450000 out of 5000000 steps (89%)
18:25:37:WU01:FS01:0x18:Completed 4500000 out of 5000000 steps (90%)
18:36:48:WU01:FS01:0x18:Completed 4550000 out of 5000000 steps (91%)
18:47:46:WU01:FS01:0x18:Completed 4600000 out of 5000000 steps (92%)
18:58:57:WU01:FS01:0x18:Completed 4650000 out of 5000000 steps (93%)
19:10:12:WU01:FS01:0x18:Completed 4700000 out of 5000000 steps (94%)
19:21:26:WU01:FS01:0x18:Completed 4750000 out of 5000000 steps (95%)
19:32:52:WU01:FS01:0x18:Completed 4800000 out of 5000000 steps (96%)
19:44:06:WU01:FS01:0x18:Completed 4850000 out of 5000000 steps (97%)
19:55:32:WU01:FS01:0x18:Completed 4900000 out of 5000000 steps (98%)
20:06:45:WU01:FS01:0x18:Completed 4950000 out of 5000000 steps (99%)
20:17:59:WU01:FS01:0x18:Completed 5000000 out of 5000000 steps (100%)
20:18:00:WU00:FS01:Connecting to 171.67.108.200:80
20:18:00:WU00:FS01:Assigned to work server 140.163.4.235
20:18:00:WU00:FS01:Requesting new work unit for slot 01: RUNNING gpu:0:GM204 [GeForce GTX 970] from 140.163.4.235
20:18:00:WU00:FS01:Connecting to 140.163.4.235:8080
20:18:01:WU00:FS01:Downloading 4.24MiB
20:18:02:WU00:FS01:Download complete
20:18:02:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:10472 run:0 clone:183 gen:23 core:0x18 unit:0x00000023538b3dbb53beb7b653a82efd
20:18:11:WU01:FS01:0x18:Saving result file logfile_01.txt
20:18:11:WU01:FS01:0x18:Saving result file checkpointState.xml
20:18:13:WU01:FS01:0x18:Saving result file checkpt.crc
20:18:13:WU01:FS01:0x18:Saving result file log.txt
20:18:13:WU01:FS01:0x18:Saving result file positions.xtc
20:18:15:WU01:FS01:0x18:Folding@home Core Shutdown: FINISHED_UNIT
20:18:15:WU01:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
20:18:15:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:10471 run:0 clone:306 gen:21 core:0x18 unit:0x0000001c538b3dbb53beaeac18d11116
20:18:16:WU01:FS01:Uploading 11.06MiB to 140.163.4.235
20:18:16:WU01:FS01:Connecting to 140.163.4.235:8080
20:18:16:WU00:FS01:Starting
20:18:16:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 00 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
20:18:16:WU00:FS01:Started FahCore on PID 8804
20:18:16:WU00:FS01:Core PID:8336
20:18:16:WU00:FS01:FahCore 0x18 started
20:18:16:WU00:FS01:0x18:*********************** Log Started 2014-12-09T20:18:16Z ***********************
20:18:16:WU00:FS01:0x18:Project: 10472 (Run 0, Clone 183, Gen 23)
20:18:16:WU00:FS01:0x18:Unit: 0x00000023538b3dbb53beb7b653a82efd
20:18:16:WU00:FS01:0x18:CPU: 0x00000000000000000000000000000000
20:18:16:WU00:FS01:0x18:Machine: 1
20:18:16:WU00:FS01:0x18:Reading tar file state.xml
20:18:17:WU00:FS01:0x18:Reading tar file system.xml
20:18:17:WU00:FS01:0x18:Reading tar file integrator.xml
20:18:17:WU00:FS01:0x18:Reading tar file core.xml
20:18:17:WU00:FS01:0x18:Digital signatures verified
20:18:17:WU00:FS01:0x18:Folding@home GPU core18
20:18:17:WU00:FS01:0x18:Version 0.0.3
20:18:17:WU00:FS01:0x18:ERROR:exception: Bad platformId size.
20:18:17:WU00:FS01:0x18:Saving result file logfile_01.txt
20:18:17:WU00:FS01:0x18:Saving result file log.txt
20:18:17:WU00:FS01:0x18:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:18:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:18:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:10472 run:0 clone:183 gen:23 core:0x18 unit:0x00000023538b3dbb53beb7b653a82efd
20:18:18:WU00:FS01:Uploading 1.95KiB to 140.163.4.235
20:18:18:WU00:FS01:Connecting to 140.163.4.235:8080
20:18:18:WU00:FS01:Upload complete
20:18:18:WU00:FS01:Server responded WORK_ACK (400)
20:18:18:WU00:FS01:Cleaning up
20:18:18:WU00:FS01:Connecting to 171.67.108.200:80
20:18:19:WU00:FS01:Assigned to work server 171.67.108.52
20:18:19:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM204 [GeForce GTX 970] from 171.67.108.52
20:18:19:WU00:FS01:Connecting to 171.67.108.52:8080
20:18:19:WU00:FS01:Downloading 1.52MiB
20:18:21:WU00:FS01:Download complete
20:18:21:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9201 run:616 clone:4 gen:78 core:0x17 unit:0x0000006f6652edc45399ee443e3619ed
20:18:21:WU00:FS01:Starting
20:18:21:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
20:18:21:WU00:FS01:Started FahCore on PID 9636
20:18:22:WU00:FS01:Core PID:8772
20:18:22:WU00:FS01:FahCore 0x17 started
20:18:23:WU00:FS01:0x17:*********************** Log Started 2014-12-09T20:18:22Z ***********************
20:18:23:WU00:FS01:0x17:Project: 9201 (Run 616, Clone 4, Gen 78)
20:18:23:WU00:FS01:0x17:Unit: 0x0000006f6652edc45399ee443e3619ed
20:18:23:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
20:18:23:WU00:FS01:0x17:Machine: 1
20:18:23:WU00:FS01:0x17:Reading tar file state.xml
20:18:23:WU00:FS01:0x17:Reading tar file system.xml
20:18:23:WU00:FS01:0x17:Reading tar file integrator.xml
20:18:23:WU00:FS01:0x17:Reading tar file core.xml
20:18:23:WU00:FS01:0x17:Digital signatures verified
20:18:23:WU00:FS01:0x17:Folding@home GPU core17
20:18:23:WU00:FS01:0x17:Version 0.0.52
20:18:23:WU00:FS01:0x17:ERROR:exception: Bad platformId size.
20:18:23:WU00:FS01:0x17:Saving result file logfile_01.txt
20:18:23:WU00:FS01:0x17:Saving result file log.txt
20:18:23:WU00:FS01:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:23:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:23:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:9201 run:616 clone:4 gen:78 core:0x17 unit:0x0000006f6652edc45399ee443e3619ed
20:18:23:WU00:FS01:Uploading 1.95KiB to 171.67.108.52
20:18:23:WU00:FS01:Connecting to 171.67.108.52:8080
20:18:24:WU00:FS01:Upload complete
20:18:24:WU03:FS01:Connecting to 171.67.108.200:80
20:18:24:WU00:FS01:Server responded WORK_ACK (400)
20:18:24:WU00:FS01:Cleaning up
20:18:25:WU03:FS01:Assigned to work server 171.67.108.52
20:18:25:WU03:FS01:Requesting new work unit for slot 01: READY gpu:0:GM204 [GeForce GTX 970] from 171.67.108.52
20:18:25:WU03:FS01:Connecting to 171.67.108.52:8080
20:18:25:WU03:FS01:Downloading 1.53MiB
20:18:26:WU03:FS01:Download complete
20:18:26:WU03:FS01:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:9201 run:451 clone:2 gen:71 core:0x17 unit:0x000000696652edc45399e7c7826783e5
20:18:26:WU03:FS01:Starting
20:18:26:WU03:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 03 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
20:18:26:WU03:FS01:Started FahCore on PID 12128
20:18:26:WU03:FS01:Core PID:11432
20:18:26:WU03:FS01:FahCore 0x17 started
20:18:27:WU03:FS01:0x17:*********************** Log Started 2014-12-09T20:18:26Z ***********************
20:18:27:WU03:FS01:0x17:Project: 9201 (Run 451, Clone 2, Gen 71)
20:18:27:WU03:FS01:0x17:Unit: 0x000000696652edc45399e7c7826783e5
20:18:27:WU03:FS01:0x17:CPU: 0x00000000000000000000000000000000
20:18:27:WU03:FS01:0x17:Machine: 1
20:18:27:WU03:FS01:0x17:Reading tar file state.xml
20:18:27:WU03:FS01:0x17:Reading tar file system.xml
20:18:27:WU03:FS01:0x17:Reading tar file integrator.xml
20:18:27:WU03:FS01:0x17:Reading tar file core.xml
20:18:27:WU03:FS01:0x17:Digital signatures verified
20:18:27:WU03:FS01:0x17:Folding@home GPU core17
20:18:27:WU03:FS01:0x17:Version 0.0.52
20:18:27:WU03:FS01:0x17:ERROR:exception: Bad platformId size.
20:18:27:WU03:FS01:0x17:Saving result file logfile_01.txt
20:18:27:WU03:FS01:0x17:Saving result file log.txt
20:18:27:WU03:FS01:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:27:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:27:WU03:FS01:Sending unit results: id:03 state:SEND error:FAULTY project:9201 run:451 clone:2 gen:71 core:0x17 unit:0x000000696652edc45399e7c7826783e5
20:18:27:WU03:FS01:Uploading 1.94KiB to 171.67.108.52
20:18:27:WU03:FS01:Connecting to 171.67.108.52:8080
20:18:27:WU03:FS01:Upload complete
20:18:27:WU00:FS01:Connecting to 171.67.108.200:80
20:18:27:WU03:FS01:Server responded WORK_ACK (400)
20:18:28:WU03:FS01:Cleaning up
20:18:28:WU00:FS01:Assigned to work server 140.163.4.235
20:18:28:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM204 [GeForce GTX 970] from 140.163.4.235
20:18:28:WU00:FS01:Connecting to 140.163.4.235:8080
20:18:28:WU01:FS01:Upload complete
20:18:29:WU00:FS01:Downloading 4.24MiB
20:18:29:WU01:FS01:Server responded WORK_ACK (400)
20:18:29:WU01:FS01:Final credit estimate, 46013.00 points
20:18:29:WU01:FS01:Cleaning up
20:18:30:WU00:FS01:Download complete
20:18:30:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:10472 run:0 clone:314 gen:22 core:0x18 unit:0x0000001e538b3dbb53bebadf50c2cbb1
20:18:30:WU00:FS01:Starting
20:18:30:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 00 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
20:18:30:WU00:FS01:Started FahCore on PID 12176
20:18:30:WU00:FS01:Core PID:11388
20:18:30:WU00:FS01:FahCore 0x18 started
20:18:30:WU00:FS01:0x18:*********************** Log Started 2014-12-09T20:18:30Z ***********************
20:18:30:WU00:FS01:0x18:Project: 10472 (Run 0, Clone 314, Gen 22)
20:18:30:WU00:FS01:0x18:Unit: 0x0000001e538b3dbb53bebadf50c2cbb1
20:18:30:WU00:FS01:0x18:CPU: 0x00000000000000000000000000000000
20:18:30:WU00:FS01:0x18:Machine: 1
20:18:30:WU00:FS01:0x18:Reading tar file state.xml
20:18:31:WU00:FS01:0x18:Reading tar file system.xml
20:18:31:WU00:FS01:0x18:Reading tar file integrator.xml
20:18:31:WU00:FS01:0x18:Reading tar file core.xml
20:18:31:WU00:FS01:0x18:Digital signatures verified
20:18:31:WU00:FS01:0x18:Folding@home GPU core18
20:18:31:WU00:FS01:0x18:Version 0.0.3
20:18:31:WU00:FS01:0x18:ERROR:exception: Bad platformId size.
20:18:31:WU00:FS01:0x18:Saving result file logfile_01.txt
20:18:31:WU00:FS01:0x18:Saving result file log.txt
20:18:31:WU00:FS01:0x18:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:32:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:32:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:10472 run:0 clone:314 gen:22 core:0x18 unit:0x0000001e538b3dbb53bebadf50c2cbb1
20:18:32:WU00:FS01:Uploading 1.96KiB to 140.163.4.235
20:18:32:WU00:FS01:Connecting to 140.163.4.235:8080
20:18:32:WU00:FS01:Upload complete
20:18:32:WU00:FS01:Server responded WORK_ACK (400)
20:18:32:WU00:FS01:Cleaning up
20:18:32:WU00:FS01:Connecting to 171.67.108.200:80
20:18:33:WU00:FS01:Assigned to work server 171.67.108.52
20:18:33:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM204 [GeForce GTX 970] from 171.67.108.52
20:18:33:WU00:FS01:Connecting to 171.67.108.52:8080
20:18:33:WU00:FS01:Downloading 1.52MiB
20:18:34:WU00:FS01:Download complete
20:18:34:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9201 run:382 clone:3 gen:62 core:0x17 unit:0x000000666652edc45399e5159cb92fe4
20:18:34:WU00:FS01:Starting
20:18:34:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
20:18:34:WU00:FS01:Started FahCore on PID 9308
20:18:34:WU00:FS01:Core PID:12212
20:18:34:WU00:FS01:FahCore 0x17 started
20:18:35:WU00:FS01:0x17:*********************** Log Started 2014-12-09T20:18:34Z ***********************
20:18:35:WU00:FS01:0x17:Project: 9201 (Run 382, Clone 3, Gen 62)
20:18:35:WU00:FS01:0x17:Unit: 0x000000666652edc45399e5159cb92fe4
20:18:35:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
20:18:35:WU00:FS01:0x17:Machine: 1
20:18:35:WU00:FS01:0x17:Reading tar file state.xml
20:18:35:WU00:FS01:0x17:Reading tar file system.xml
20:18:35:WU00:FS01:0x17:Reading tar file integrator.xml
20:18:35:WU00:FS01:0x17:Reading tar file core.xml
20:18:35:WU00:FS01:0x17:Digital signatures verified
20:18:35:WU00:FS01:0x17:Folding@home GPU core17
20:18:35:WU00:FS01:0x17:Version 0.0.52
20:18:35:WU00:FS01:0x17:ERROR:exception: Bad platformId size.
20:18:35:WU00:FS01:0x17:Saving result file logfile_01.txt
20:18:35:WU00:FS01:0x17:Saving result file log.txt
20:18:35:WU00:FS01:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:35:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:35:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:9201 run:382 clone:3 gen:62 core:0x17 unit:0x000000666652edc45399e5159cb92fe4
20:18:35:WU00:FS01:Uploading 1.96KiB to 171.67.108.52
20:18:35:WU00:FS01:Connecting to 171.67.108.52:8080
20:18:35:WU00:FS01:Upload complete
20:18:35:WU00:FS01:Server responded WORK_ACK (400)
20:18:35:WU00:FS01:Cleaning up
20:18:35:WU01:FS01:Connecting to 171.67.108.200:80
20:18:36:WU01:FS01:Assigned to work server 140.163.4.235
20:18:36:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GM204 [GeForce GTX 970] from 140.163.4.235
20:18:36:WU01:FS01:Connecting to 140.163.4.235:8080
20:18:36:WU01:FS01:Downloading 4.23MiB
20:18:37:WU01:FS01:Download complete
20:18:37:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:10472 run:0 clone:128 gen:19 core:0x18 unit:0x00000020538b3dbb53beb661c1997625
20:18:37:WU01:FS01:Starting
20:18:37:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 01 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
20:18:37:WU01:FS01:Started FahCore on PID 9292
20:18:37:WU01:FS01:Core PID:11096
20:18:37:WU01:FS01:FahCore 0x18 started
20:18:38:WU01:FS01:0x18:*********************** Log Started 2014-12-09T20:18:37Z ***********************
20:18:38:WU01:FS01:0x18:Project: 10472 (Run 0, Clone 128, Gen 19)
20:18:38:WU01:FS01:0x18:Unit: 0x00000020538b3dbb53beb661c1997625
20:18:38:WU01:FS01:0x18:CPU: 0x00000000000000000000000000000000
20:18:38:WU01:FS01:0x18:Machine: 1
20:18:38:WU01:FS01:0x18:Reading tar file state.xml
20:18:38:WU01:FS01:0x18:Reading tar file system.xml
20:18:39:WU01:FS01:0x18:Reading tar file integrator.xml
20:18:39:WU01:FS01:0x18:Reading tar file core.xml
20:18:39:WU01:FS01:0x18:Digital signatures verified
20:18:39:WU01:FS01:0x18:Folding@home GPU core18
20:18:39:WU01:FS01:0x18:Version 0.0.3
20:18:39:WU01:FS01:0x18:ERROR:exception: Bad platformId size.
20:18:39:WU01:FS01:0x18:Saving result file logfile_01.txt
20:18:39:WU01:FS01:0x18:Saving result file log.txt
20:18:39:WU01:FS01:0x18:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:39:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:39:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:10472 run:0 clone:128 gen:19 core:0x18 unit:0x00000020538b3dbb53beb661c1997625
20:18:39:WU01:FS01:Uploading 1.94KiB to 140.163.4.235
20:18:39:WU01:FS01:Connecting to 140.163.4.235:8080
20:18:39:WU01:FS01:Upload complete
20:18:40:WU01:FS01:Server responded WORK_ACK (400)
20:18:40:WU01:FS01:Cleaning up
20:18:40:WU00:FS01:Connecting to 171.67.108.200:80
20:18:40:WU00:FS01:Assigned to work server 171.67.108.52
20:18:40:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM204 [GeForce GTX 970] from 171.67.108.52
20:18:40:WU00:FS01:Connecting to 171.67.108.52:8080
20:18:40:WU00:FS01:Downloading 1.52MiB
20:18:41:WU00:FS01:Download complete
20:18:42:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9201 run:463 clone:2 gen:65 core:0x17 unit:0x0000006b6652edc45399e83fa9f4b977
20:18:42:WU00:FS01:Starting
20:18:42:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
20:18:42:WU00:FS01:Started FahCore on PID 8900
20:18:42:WU00:FS01:Core PID:12052
20:18:42:WU00:FS01:FahCore 0x17 started
20:18:42:WU00:FS01:0x17:*********************** Log Started 2014-12-09T20:18:42Z ***********************
20:18:42:WU00:FS01:0x17:Project: 9201 (Run 463, Clone 2, Gen 65)
20:18:42:WU00:FS01:0x17:Unit: 0x0000006b6652edc45399e83fa9f4b977
20:18:42:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
20:18:42:WU00:FS01:0x17:Machine: 1
20:18:42:WU00:FS01:0x17:Reading tar file state.xml
20:18:42:WU00:FS01:0x17:Reading tar file system.xml
20:18:42:WU00:FS01:0x17:Reading tar file integrator.xml
20:18:42:WU00:FS01:0x17:Reading tar file core.xml
20:18:42:WU00:FS01:0x17:Digital signatures verified
20:18:42:WU00:FS01:0x17:Folding@home GPU core17
20:18:42:WU00:FS01:0x17:Version 0.0.52
20:18:42:WU00:FS01:0x17:ERROR:exception: Bad platformId size.
20:18:42:WU00:FS01:0x17:Saving result file logfile_01.txt
20:18:42:WU00:FS01:0x17:Saving result file log.txt
20:18:42:WU00:FS01:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:43:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:43:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:9201 run:463 clone:2 gen:65 core:0x17 unit:0x0000006b6652edc45399e83fa9f4b977
20:18:43:WU00:FS01:Uploading 1.95KiB to 171.67.108.52
20:18:43:WU00:FS01:Connecting to 171.67.108.52:8080
20:18:43:WU00:FS01:Upload complete
20:18:43:WU00:FS01:Server responded WORK_ACK (400)
20:18:43:WU00:FS01:Cleaning up
20:18:43:WU01:FS01:Connecting to 171.67.108.200:80
20:18:43:WU01:FS01:Assigned to work server 171.67.108.52
20:18:43:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GM204 [GeForce GTX 970] from 171.67.108.52
20:18:43:WU01:FS01:Connecting to 171.67.108.52:8080
20:18:43:WU01:FS01:Downloading 1.52MiB
20:18:45:WU01:FS01:Download complete
20:18:45:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9201 run:556 clone:4 gen:62 core:0x17 unit:0x000000586652edc45399ebe8b4a169b2
20:18:45:WU01:FS01:Starting
20:18:45:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
20:18:45:WU01:FS01:Started FahCore on PID 11468
20:18:45:WU01:FS01:Core PID:6316
20:18:45:WU01:FS01:FahCore 0x17 started
20:18:45:WU01:FS01:0x17:*********************** Log Started 2014-12-09T20:18:45Z ***********************
20:18:45:WU01:FS01:0x17:Project: 9201 (Run 556, Clone 4, Gen 62)
20:18:45:WU01:FS01:0x17:Unit: 0x000000586652edc45399ebe8b4a169b2
20:18:45:WU01:FS01:0x17:CPU: 0x00000000000000000000000000000000
20:18:45:WU01:FS01:0x17:Machine: 1
20:18:45:WU01:FS01:0x17:Reading tar file state.xml
20:18:45:WU01:FS01:0x17:Reading tar file system.xml
20:18:46:WU01:FS01:0x17:Reading tar file integrator.xml
20:18:46:WU01:FS01:0x17:Reading tar file core.xml
20:18:46:WU01:FS01:0x17:Digital signatures verified
20:18:46:WU01:FS01:0x17:Folding@home GPU core17
20:18:46:WU01:FS01:0x17:Version 0.0.52
20:18:46:WU01:FS01:0x17:ERROR:exception: Bad platformId size.
20:18:46:WU01:FS01:0x17:Saving result file logfile_01.txt
20:18:46:WU01:FS01:0x17:Saving result file log.txt
20:18:46:WU01:FS01:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:46:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:46:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:9201 run:556 clone:4 gen:62 core:0x17 unit:0x000000586652edc45399ebe8b4a169b2
20:18:46:WU01:FS01:Uploading 1.94KiB to 171.67.108.52
20:18:46:WU01:FS01:Connecting to 171.67.108.52:8080
20:18:46:WU01:FS01:Upload complete
20:18:46:WU01:FS01:Server responded WORK_ACK (400)
20:18:46:WU01:FS01:Cleaning up
20:18:46:WU00:FS01:Connecting to 171.67.108.200:80
20:18:46:WU00:FS01:Assigned to work server 171.67.108.52
20:18:46:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM204 [GeForce GTX 970] from 171.67.108.52
20:18:46:WU00:FS01:Connecting to 171.67.108.52:8080
20:18:47:WU00:FS01:Downloading 1.53MiB
20:18:49:WU00:FS01:Download complete
20:18:49:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9201 run:840 clone:2 gen:58 core:0x17 unit:0x000000696652edc45399f711568a4f8b
20:18:49:WU00:FS01:Starting
20:18:49:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
20:18:49:WU00:FS01:Started FahCore on PID 3828
20:18:49:WU00:FS01:Core PID:7136
20:18:49:WU00:FS01:FahCore 0x17 started
20:18:50:WU00:FS01:0x17:*********************** Log Started 2014-12-09T20:18:49Z ***********************
20:18:50:WU00:FS01:0x17:Project: 9201 (Run 840, Clone 2, Gen 58)
20:18:50:WU00:FS01:0x17:Unit: 0x000000696652edc45399f711568a4f8b
20:18:50:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
20:18:50:WU00:FS01:0x17:Machine: 1
20:18:50:WU00:FS01:0x17:Reading tar file state.xml
20:18:50:WU00:FS01:0x17:Reading tar file system.xml
20:18:50:WU00:FS01:0x17:Reading tar file integrator.xml
20:18:50:WU00:FS01:0x17:Reading tar file core.xml
20:18:50:WU00:FS01:0x17:Digital signatures verified
20:18:50:WU00:FS01:0x17:Folding@home GPU core17
20:18:50:WU00:FS01:0x17:Version 0.0.52
20:18:50:WU00:FS01:0x17:ERROR:exception: Bad platformId size.
20:18:50:WU00:FS01:0x17:Saving result file logfile_01.txt
20:18:50:WU00:FS01:0x17:Saving result file log.txt
20:18:50:WU00:FS01:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:50:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:50:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:9201 run:840 clone:2 gen:58 core:0x17 unit:0x000000696652edc45399f711568a4f8b
20:18:50:WU00:FS01:Uploading 1.95KiB to 171.67.108.52
20:18:50:WU00:FS01:Connecting to 171.67.108.52:8080
20:18:50:WU00:FS01:Upload complete
20:18:50:WU00:FS01:Server responded WORK_ACK (400)
20:18:50:WU00:FS01:Cleaning up
20:18:50:WU01:FS01:Connecting to 171.67.108.200:80
20:18:51:WU01:FS01:Assigned to work server 171.67.108.52
20:18:51:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GM204 [GeForce GTX 970] from 171.67.108.52
20:18:51:WU01:FS01:Connecting to 171.67.108.52:8080
20:18:51:WU01:FS01:Downloading 1.52MiB
20:18:55:WU01:FS01:Download complete
20:18:55:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9201 run:34 clone:4 gen:56 core:0x17 unit:0x000000596652edc45399d7675961cfc7
20:18:55:WU01:FS01:Starting
20:18:55:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
20:18:55:WU01:FS01:Started FahCore on PID 12204
20:18:55:WU01:FS01:Core PID:6380
20:18:55:WU01:FS01:FahCore 0x17 started
20:18:55:WU01:FS01:0x17:*********************** Log Started 2014-12-09T20:18:55Z ***********************
20:18:55:WU01:FS01:0x17:Project: 9201 (Run 34, Clone 4, Gen 56)
20:18:55:WU01:FS01:0x17:Unit: 0x000000596652edc45399d7675961cfc7
20:18:55:WU01:FS01:0x17:CPU: 0x00000000000000000000000000000000
20:18:55:WU01:FS01:0x17:Machine: 1
20:18:55:WU01:FS01:0x17:Reading tar file state.xml
20:18:55:WU01:FS01:0x17:Reading tar file system.xml
20:18:56:WU01:FS01:0x17:Reading tar file integrator.xml
20:18:56:WU01:FS01:0x17:Reading tar file core.xml
20:18:56:WU01:FS01:0x17:Digital signatures verified
20:18:56:WU01:FS01:0x17:Folding@home GPU core17
20:18:56:WU01:FS01:0x17:Version 0.0.52
20:18:56:WU01:FS01:0x17:ERROR:exception: Bad platformId size.
20:18:56:WU01:FS01:0x17:Saving result file logfile_01.txt
20:18:56:WU01:FS01:0x17:Saving result file log.txt
20:18:56:WU01:FS01:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:56:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:56:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:9201 run:34 clone:4 gen:56 core:0x17 unit:0x000000596652edc45399d7675961cfc7
20:18:56:WU01:FS01:Uploading 1.94KiB to 171.67.108.52
20:18:56:WU01:FS01:Connecting to 171.67.108.52:8080
20:18:56:WU01:FS01:Upload complete
20:18:56:WU01:FS01:Server responded WORK_ACK (400)
20:18:56:WU01:FS01:Cleaning up
******************************* Date: 2014-12-09 *******************************
******************************* Date: 2014-12-10 *******************************
******************************* Date: 2014-12-10 *******************************
 
Dysthymia
Gerbil
Topic Author
Posts: 49
Joined: Thu May 01, 2008 4:54 pm

Re: Why is my GPU folding slot failing?

Wed Dec 10, 2014 7:44 am

20:18:16:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 00 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia

...Fermi? I guess it is an evolution of that architecture.

I guess this is the most relevant part:

20:18:30:WU00:FS01:Started FahCore on PID 12176
20:18:30:WU00:FS01:Core PID:11388
20:18:30:WU00:FS01:FahCore 0x18 started
[...]
20:18:31:WU00:FS01:0x18:ERROR:exception: Bad platformId size.
[...]
20:18:31:WU00:FS01:0x18:Folding@home Core Shutdown: BAD_WORK_UNIT
20:18:32:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
20:18:32:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:10472 run:0 clone:314 gen:22 core:0x18 unit:0x0000001e538b3dbb53bebadf50c2cbb1

It tries 0x17 and 0x18 work units, to no avail.

Looks like a 0x17 unit did complete previously... So, what determines the PID?

11:15:38:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Dysthymia/AppData/Roaming/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 02 -suffix 01 -version 704 -lifeline 4528 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
11:15:38:WU02:FS01:Started FahCore on PID 8932
11:15:38:WU02:FS01:Core PID:8564
11:15:38:WU02:FS01:FahCore 0x17 started
11:15:38:WU02:FS01:0x17:*********************** Log Started 2014-12-08T11:15:38Z ***********************
11:15:38:WU02:FS01:0x17:Project: 9201 (Run 490, Clone 0, Gen 244)
11:15:38:WU02:FS01:0x17:Unit: 0x000001606652edc45399e94a067f83e7
11:15:38:WU02:FS01:0x17:CPU: 0x00000000000000000000000000000000
11:15:38:WU02:FS01:0x17:Machine: 1
11:15:38:WU02:FS01:0x17:Reading tar file state.xml
11:15:38:WU02:FS01:0x17:Reading tar file system.xml
11:15:38:WU02:FS01:0x17:Reading tar file integrator.xml
11:15:38:WU02:FS01:0x17:Reading tar file core.xml
11:15:38:WU02:FS01:0x17:Digital signatures verified
11:15:38:WU02:FS01:0x17:Folding@home GPU core17
11:15:38:WU02:FS01:0x17:Version 0.0.52
11:15:44:WU00:FS01:Upload complete
11:15:44:WU00:FS01:Server responded WORK_ACK (400)
11:15:44:WU00:FS01:Final credit estimate, 34919.00 points
11:15:45:WU00:FS01:Cleaning up
11:16:04:WU02:FS01:0x17:Completed 0 out of 5000000 steps (0%)
11:16:04:WU02:FS01:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
11:17:58:WU02:FS01:0x17:Completed 50000 out of 5000000 steps (1%)
11:19:52:WU02:FS01:0x17:Completed 100000 out of 5000000 steps (2%)
11:21:46:WU02:FS01:0x17:Completed 150000 out of 5000000 steps (3%)
Last edited by Dysthymia on Wed Dec 10, 2014 7:52 am, edited 1 time in total.
 
mattshwink
Gerbil Team Leader
Posts: 200
Joined: Wed Jul 16, 2008 7:54 am
Location: Alexandria, VA

Re: Why is my GPU folding slot failing?

Wed Dec 10, 2014 7:50 am

This seems to be the error: Bad platformId size

Two questions:
1. What is your OS?
2. Is the GTX 970 listed in the GPU.txt?
 
Dysthymia
Gerbil
Topic Author
Posts: 49
Joined: Thu May 01, 2008 4:54 pm

Re: Why is my GPU folding slot failing?

Wed Dec 10, 2014 7:59 am

mattshwink wrote:
This seems to be the error: Bad platformId size

Two questions:
1. What is your OS?
2. Is the GTX 970 listed in the GPU.txt?


1. Windows 8.1 Pro, 64-bit.
2. I just searched and found a GPUs.txt in [...]\AppData\Roaming\FAHClient with this snippet:

(Edited to show more of the text file as it seemed ambiguous)

[GeForce GTX 750 Ti]
0x10de:0x1381:2:4:GM107 [GeForce GTX 750]
0x10de:0x1382:2:4:GM107 [GeForce GTX 745]
0x10de:0x1390:2:4:GM107 [GeForce 845M]
0x10de:0x1391:2:4:GM107 [GeForce GTX 850M]
0x10de:0x1392:2:4:GM107 [GeForce GTX 860M]
0x10de:0x1393:2:4:GM107 [GeForce 840M]
0x10de:0x13b3:2:4:GM107 [Quadro K2200M]
0x10de:0x13ba:2:4:GM107 [Quadro K2200]
0x10de:0x13bb:2:4:GM107 [Quadro K620]
0x10de:0x13c0:2:4:GM204 [GeForce GTX 980]
0x10de:0x13c2:2:4:GM204 [GeForce GTX 970]
0x10de:0x13d8:2:4:GM204 [GeForce GTX 970M]
0x10de:0x9490:::Invalid []
0x10de:0x9876:::NV11 [GeForce2 MX/MX 400]
0x10de:0x98de:::0x9876 0x9876
0x10de:0xdf5a:2:2:GF106 [GeForce GT525M]

The file ends there.
 
mattshwink
Gerbil Team Leader
Posts: 200
Joined: Wed Jul 16, 2008 7:54 am
Location: Alexandria, VA

Re: Why is my GPU folding slot failing?

Wed Dec 10, 2014 9:52 am

Hmmmm..that should be the right place and the 970 shows up. Next step I would try the beta driver and see if that makes a difference.
 
Dysthymia
Gerbil
Topic Author
Posts: 49
Joined: Thu May 01, 2008 4:54 pm

Re: Why is my GPU folding slot failing?

Wed Dec 10, 2014 10:14 am

mattshwink wrote:
Hmmmm..that should be the right place and the 970 shows up. Next step I would try the beta driver and see if that makes a difference.


There doesn't seem to be a beta available... looks like what I'm running (344.75) is the latest. Confirmed, searching on Nvidia's site. I am set to check for beta drivers in GeForce Experience.

Well... crap.
 
mattshwink
Gerbil Team Leader
Posts: 200
Joined: Wed Jul 16, 2008 7:54 am
Location: Alexandria, VA

Re: Why is my GPU folding slot failing?

Wed Dec 10, 2014 4:06 pm

I would try completely uninstalling FAH, then uninstall the video driver, reinstall the video driver, then FAH....
 
PainIs4ThaWeak1
Gerbil
Posts: 77
Joined: Fri Jun 26, 2009 11:13 am

Re: Why is my GPU folding slot failing?

Wed Dec 10, 2014 4:30 pm

Thing is, he's done what you both (above) have mentioned already. (Though, I not saying that it might not just happen to work for him this time, if he goes about it once again. Who knows.)

Not that this helps any, but I recently had a very similar issue with one of my old 680's.

Folding with 1x 680 in a P45/Intel E8400/750w PSU machine, I would have similar issues. One, maybe two WU would complete, and then return BAD_WORK_UNITS permanently. Reverted to no OCs on all components. Same result.

However, if I took the 680 out of that machine, and folded with it in my main machine (X79/Intel 3930k/2x 970s/1x 680/910w PSU), it would consistently return completed WUs without fail.

I'm not sure why this occurred at all, but maybe this piece of information may lead someone else to your solution.

In the meantime, do you have a second machine to test the card in, under F@H?
 
Kougar
Minister of Gerbil Affairs
Posts: 2306
Joined: Tue Dec 02, 2008 2:12 am
Location: Texas

Re: Why is my GPU folding slot failing?

Wed Dec 10, 2014 6:51 pm

And just to double-check, you have also tried underclocking the RAM as well?

Otherwise I'm out of ideas, sorry. You should post at the F@H forums for assistance as they can tell you what the logs mean: https://foldingforum.org/index.php
 
Dysthymia
Gerbil
Topic Author
Posts: 49
Joined: Thu May 01, 2008 4:54 pm

Re: Why is my GPU folding slot failing?

Thu Dec 11, 2014 8:15 am

PainIs4ThaWeak1 wrote:
Not that this helps any, but I recently had a very similar issue with one of my old 680's.

Folding with 1x 680 in a P45/Intel E8400/750w PSU machine, I would have similar issues. One, maybe two WU would complete, and then return BAD_WORK_UNITS permanently. Reverted to no OCs on all components. Same result.

However, if I took the 680 out of that machine, and folded with it in my main machine (X79/Intel 3930k/2x 970s/1x 680/910w PSU), it would consistently return completed WUs without fail.

I'm not sure why this occurred at all, but maybe this piece of information may lead someone else to your solution.

In the meantime, do you have a second machine to test the card in, under F@H?



Kougar wrote:
And just to double-check, you have also tried underclocking the RAM as well?

Otherwise I'm out of ideas, sorry. You should post at the F@H forums for assistance as they can tell you what the logs mean: https://foldingforum.org/index.php


I do have another machine I could try it in, but honestly I'd be more inclined to reformat. I'll try underclocking the RAM, and seeing if I can update my motherboard drivers. And I'll try the forum Kougar suggested as well. I appreciate your efforts guys, thank you.
 
Kougar
Minister of Gerbil Affairs
Posts: 2306
Joined: Tue Dec 02, 2008 2:12 am
Location: Texas

Re: Why is my GPU folding slot failing?

Fri Dec 12, 2014 5:36 am

I strongly would suggest posting in their forum before you wipe and reinstall. I've been folding on GPUs since they started doing it and it's had a plethora of early teething problems back in the day... if you can rule out drivers and leftover files that are left behind after an uninstall, then I highly doubt an OS wipe/reinstall would do anything to help. Just my two cents though!
 
Dysthymia
Gerbil
Topic Author
Posts: 49
Joined: Thu May 01, 2008 4:54 pm

Re: Why is my GPU folding slot failing?

Fri Dec 19, 2014 7:16 am

Kougar wrote:
I strongly would suggest posting in their forum before you wipe and reinstall. I've been folding on GPUs since they started doing it and it's had a plethora of early teething problems back in the day... if you can rule out drivers and leftover files that are left behind after an uninstall, then I highly doubt an OS wipe/reinstall would do anything to help. Just my two cents though!


Looks like you were right on that point Kougar. Luckily my laziness prevented any unnecessary expenditure of effort. And though my saying so may cause another failure, everything seems to have been fine since the 347.09 driver release. Mostly 0x15 and one 0x18 work unit so far.
 
BIF
Minister of Gerbil Affairs
Posts: 2458
Joined: Tue May 25, 2004 7:41 pm

Re: Why is my GPU folding slot failing?

Sat Dec 20, 2014 2:06 am

There are x18 work units? Well float my boat!
 
PainIs4ThaWeak1
Gerbil
Posts: 77
Joined: Fri Jun 26, 2009 11:13 am

Re: Why is my GPU folding slot failing?

Mon Dec 22, 2014 8:46 am

BIF wrote:
There are x18 work units? Well float my boat!


Yep, and run like hot garbage on Maxwell currently.
 
LoneWolf15
Gerbil Elite
Posts: 963
Joined: Tue Feb 17, 2004 8:36 am
Location: SW Meecheegan

Re: Why is my GPU folding slot failing?

Tue Jan 06, 2015 11:08 pm

You aren't alone.

I'm having issues where the folding slot fails on my GTX 970 as well. I've also read about it on other forums, including Stanford's folding forum. It seems some work units just don't go well. Reinstalling the FaH client won't help. I actually thought I was having slightly better results with earlier drivers before going to 344.75, but after downgrading, I had other issues so I went back up to the latest.

When the card actually does do any work units, it seems to perform reasonably --there's just no guarantee that it will keep going and not stall, or outright fail. My previous setup with two Radeon R9 280X cards folded flawlessly. I did a complete removal of all GPU folding slots prior to the GTX 970 and created a new one after its install. I've had no other issues with games or Windows applications.
i9-9900K @4.7GHz, GIGABYTE Z390 Aorus Pro WiFi, 2 x 16GB G.Skill RipJaws V PC3000
Corsair 650D, Seasonic 1Kw Platinum PSU
2x HP EX920 1TB NVMe, Samsung 850 Pro 512GB 2.5", NEC 7200 DVDRW
Gigabyte RTX 2080 Super Gaming OC, Dell S2719DGF 27" LCD
 
Dysthymia
Gerbil
Topic Author
Posts: 49
Joined: Thu May 01, 2008 4:54 pm

Re: Why is my GPU folding slot failing?

Sun Feb 22, 2015 9:39 am

Well I've at least learned a few things by now. Updating graphics drivers seemed to allow a good 12 to 24 hours of continuous GPU folding, but now with 347.52 the 970 has been folding just fine for almost two weeks -- with one caveat. I used to use Remote Desktop to keep an eye on it but I eventually learned this causes a GPU_MEMTEST_ERROR. Even connecting then disconnecting hours before a work unit finishes can cause this. Connecting with TeamViewer is a workaround.

A few things have changed, incidentally. My OS suddenly went corrupt on my SSD at the end of December and I went back to Windows 7. I also stopped overclocking the GPU for now.

For BAD_WORK_UNIT and Bad_platformId you can pause all folding, go into the AppData\Roaming\FAHClient folder and delete GPUs.txt, open the work folder and delete the folder there associated with the GPU ("01" for me), then back up a level and go into the cores\web.stanford.edu\~pande\Win32\AMD64 folder and delete the NVIDIA folder. Upon resuming it'll recreate all of those and attempt to re-download a new work unit and start again. On the left side of the Advanced Control window, when a new work unit starts downloading, the Folding Slots GPU and Work Queue listing both show "Download". If one of them changes to "Running" while the other still says "Download", it's going to fail in a few minutes.

If nothing else works, the only other thing that works for me is removing the GPU slot, waiting at least 4 to 6 hours, and adding it back in again. Hope this helps somebody.
 
LoneWolf15
Gerbil Elite
Posts: 963
Joined: Tue Feb 17, 2004 8:36 am
Location: SW Meecheegan

Re: Why is my GPU folding slot failing?

Sun Feb 22, 2015 2:01 pm

I found the same thing regarding Remote Desktop. VNC is the only reliable method for remoting into a system for Folding. I didn't know this because when I had two Radeon R9 280X cards, I could RDP in all day long without my slots failing. It took me some time to find the issue.

Ever since I took care of that, my 970 (and now my 980) is folding fine. I'm still annoyed by nVidia's lack of attention to fixing the bugs in their drivers that prevent working with Core18, but clearly, it's not a priority for them as the issue has been known for many months.
i9-9900K @4.7GHz, GIGABYTE Z390 Aorus Pro WiFi, 2 x 16GB G.Skill RipJaws V PC3000
Corsair 650D, Seasonic 1Kw Platinum PSU
2x HP EX920 1TB NVMe, Samsung 850 Pro 512GB 2.5", NEC 7200 DVDRW
Gigabyte RTX 2080 Super Gaming OC, Dell S2719DGF 27" LCD
 
SoM
Gerbil Elite
Posts: 559
Joined: Wed Jan 26, 2011 11:56 am
Location: Toronto

Re: Why is my GPU folding slot failing?

Sun Feb 22, 2015 2:44 pm

347.52 is the latest drv

Win 10
InWin 303
Asus z170a
i7-6700k - H60
G.Skill 2x16GB 2400
M.2 950 Pro 256GB
EVGA GTX 1070 FTW
EVGA Supernova G2 750w
Acer XG270HU
HD 280pro
 
LoneWolf15
Gerbil Elite
Posts: 963
Joined: Tue Feb 17, 2004 8:36 am
Location: SW Meecheegan

Re: Why is my GPU folding slot failing?

Sun Feb 22, 2015 6:32 pm

SoM wrote:
347.52 is the latest drv

Yes, and at this point, it isn't documented to fix the OpenCL/OpenMM issues with Folding@Home. I've checked the release notes on the past several drivers.

The issue has been reported in the Geforce forums, but nVidia people generally don't comment there, meaning in many cases no-one actually knows whether reporting problems does any good. The most anyone has heard is that "they are aware of the problem". No timeline or ETA on a fix, and no real statement.

https://forums.geforce.com/default/topi ... ver-bug/1/

Folding@Home has restricted Core18 folding projects so that Maxwell cards do not get them for this reason. Note that non-Maxwell Geforce cards and AMD cards can do Core18 projects just fine.
https://foldingforum.org/viewtopic.php?f=74&t=27208
i9-9900K @4.7GHz, GIGABYTE Z390 Aorus Pro WiFi, 2 x 16GB G.Skill RipJaws V PC3000
Corsair 650D, Seasonic 1Kw Platinum PSU
2x HP EX920 1TB NVMe, Samsung 850 Pro 512GB 2.5", NEC 7200 DVDRW
Gigabyte RTX 2080 Super Gaming OC, Dell S2719DGF 27" LCD

Who is online

Users browsing this forum: No registered users and 1 guest
GZIP: On