View Full Version : Quit 101 - Fatal error: NaN detected: (ener[12])


DSantosP
05-12-2006, 22:57
Boas...

O que é que se passa..
É a segunda vez que uma Wu deste projecto o 2125 me dá um erro...

[19:25:23] Completed 18000000 out of 20000000 steps (90)
[20:39:56] Writing local files
[20:39:56] Completed 18200000 out of 20000000 steps (91)
[21:27:45] Quit 101 - Fatal error: NaN detected: (ener[12])
[21:27:57]
[21:27:57] Simulation instability has been encountered. The run has entered a
[21:28:25] state from which no further progress can be made.
[21:28:26] This may be the correct result of the simulation, however if you
[21:28:26] often see other project units terminating early like this
[21:28:26] too, you may wish to check the stability of your computer (issues
[21:28:26] such as high temperature, overclocking, etc.).
[21:28:26] Going to send back what have done.
[21:28:26] logfile size: 638119
[21:28:26] - Writing 638682 bytes of core data to disk...
[21:28:26] Done: 638170 -> 17721 (compressed to 2.7 percent)
[21:28:26] ... Done.
[21:28:27]
[21:28:27] Folding@home Core Shutdown: EARLY_UNIT_END
[21:28:45] CoreStatus = 72 (114)
[21:28:50] Sending work to server


[21:28:58] + Attempting to send results
[21:31:37] + Results successfully sent
[21:31:37] Thank you for your contribution to Folding@Home.
[21:31:56] - Preparing to get new work unit...
[21:31:56] + Attempting to get work packet
[21:31:56] - Connecting to assignment server
[21:32:14] - Successful: assigned to (171.65.103.158).
[21:32:14] + News From Folding@Home: Welcome to Folding@Home
[21:32:18] Loaded queue successfully.
[21:32:26] + Closed connections
[21:32:31]
[21:32:31] + Processing work unit
[21:32:31] Core required: FahCore_82.exe
[21:32:31] Core found.
[21:32:31] Working on Unit 04 [December 5 21:32:31]
[21:32:32] + Working ...
[21:33:01]
[21:33:01] *------------------------------*
[21:33:01] Folding@Home PMD Core
[21:33:01] Version 1.03 (September 7, 2005)
[21:33:01]
[21:33:01] Preparing to commence simulation
[21:33:01] - Looking at optimizations...
[21:33:01] - Created dyn
[21:33:01] - Files status OK
[21:33:04] - Expanded 83183 -> 561874 (decompressed 675.4 percent)
[21:33:04]
[21:33:04] Project: 1808 (Run 4, Clone 58, Gen 90)
[21:33:04]
[21:33:15] Assembly optimizations on if available.
[21:33:15] Entering M.D.
[21:34:11] Protein: p1808_Collagen_Brodsky_refolding
[21:34:11]
[21:34:14] Completed 0 out of 500000 steps (0)
[21:34:24] NaN/Inf detected e[0]
[21:34:24] Going to send back what have done.
[21:34:24] logfile size: 5898
[21:34:24] - Writing 6418 bytes of core data to disk...
[21:34:24] ... Done.
[21:34:24]
[21:34:24] Folding@home Core Shutdown: EARLY_UNIT_END
[21:34:28] CoreStatus = 72 (114)
[21:34:28] Sending work to server


[21:34:28] + Attempting to send results
[21:34:29] + Results successfully sent
[21:34:29] Thank you for your contribution to Folding@Home.
[21:34:33] - Preparing to get new work unit...
[21:34:33] + Attempting to get work packet
[21:34:33] - Connecting to assignment server
[21:34:36] - Successful: assigned to (171.65.103.158).
[21:34:36] + News From Folding@Home: Welcome to Folding@Home
[21:34:36] Loaded queue successfully.
[21:34:42] + Closed connections
[21:34:48]
[21:34:48] + Processing work unit
[21:34:48] Core required: FahCore_82.exe
[21:34:48] Core found.
[21:34:48] Working on Unit 05 [December 5 21:34:48]
[21:34:48] + Working ...
[21:35:01]
[21:35:01] *------------------------------*
[21:35:01] Folding@Home PMD Core
[21:35:01] Version 1.03 (September 7, 2005)
[21:35:01]
[21:35:03] Preparing to commence simulation
[21:35:03] - Looking at optimizations...
[21:35:03] - Created dyn
[21:35:03] - Files status OK
[21:35:03] - Expanded 82197 -> 557656 (decompressed 678.4 percent)
[21:35:03]
[21:35:03] Project: 1814 (Run 1, Clone 699, Gen 50)
[21:35:03]
[21:35:14] Assembly optimizations on if available.
[21:35:14] Entering M.D.
[21:35:18] Protein: p1814_Collagen_POG10more_refolding
[21:35:18]
[21:35:18] Completed 0 out of 500000 steps (0)
[21:35:56] NaN/Inf detected e[0]
[21:35:56] Going to send back what have done.
[21:35:56] logfile size: 5898
[21:35:56] - Writing 6418 bytes of core data to disk...
[21:35:56] ... Done.
[21:35:56]
[21:35:56] Folding@home Core Shutdown: EARLY_UNIT_END
[21:36:00] CoreStatus = 72 (114)
[21:36:01] Sending work to server


[21:36:01] + Attempting to send results
[21:36:05] + Results successfully sent
[21:36:06] Thank you for your contribution to Folding@Home.
[21:36:12] - Preparing to get new work unit...
[21:36:12] + Attempting to get work packet
[21:36:12] - Connecting to assignment server
[21:36:13] - Successful: assigned to (171.65.103.158).
[21:36:13] + News From Folding@Home: Welcome to Folding@Home
[21:36:13] Loaded queue successfully.
[21:37:09] + Closed connections
[21:37:14]
[21:37:14] + Processing work unit
[21:37:14] Core required: FahCore_78.exe
[21:37:14] Core found.
[21:37:16] Working on Unit 06 [December 5 21:37:16]
[21:37:16] + Working ...
[21:37:16]
[21:37:16] *------------------------------*
[21:37:16] Folding@Home Gromacs Core
[21:37:16] Version 1.90 (March 8, 2006)
[21:37:16]
[21:37:16] Preparing to commence simulation
[21:37:16] - Looking at optimizations...
[21:37:16] - Created dyn
[21:37:16] - Files status OK
[21:37:23] - Expanded 1594430 -> 8198125 (decompressed 514.1 percent)
[21:37:23] - Starting from initial work packet
[21:37:25]
[21:37:25] Project: 1862 (Run 14, Clone 33, Gen 12)
[21:37:25]
[21:39:03] Assembly optimizations on if available.
[21:39:03] Entering M.D.
[21:39:30] Protein: p1862_Myosin6_PT_US_TIP3P_bbox
[21:39:30]
[21:39:31] Writing local files
[21:39:51] Extra SSE boost OK.
[21:40:06] Writing local files
[21:40:20] Completed 0 out of 500000 steps (0)
[21:41:36] Quit 101 - Fatal error: NaN detected: (ener[17])
[21:41:36]
[21:41:36] Simulation instability has been encountered. The run has entered a
[21:41:36] state from which no further progress can be made.
[21:41:36] This may be the correct result of the simulation, however if you
[21:41:36] often see other project units terminating early like this
[21:41:36] too, you may wish to check the stability of your computer (issues
[21:41:36] such as high temperature, overclocking, etc.).
[21:41:36] Going to send back what have done.
[21:41:36] logfile size: 7904
[21:41:36] - Writing 8467 bytes of core data to disk...
[21:41:38] ... Done.
[21:41:40]
[21:41:40] Folding@home Core Shutdown: EARLY_UNIT_END
[21:41:52] CoreStatus = 72 (114)
[21:41:52] Sending work to server


[21:41:53] + Attempting to send results
[21:41:55] + Results successfully sent
[21:41:55] Thank you for your contribution to Folding@Home.
[21:42:08] - Preparing to get new work unit...
[21:42:08] + Attempting to get work packet
[21:42:08] - Connecting to assignment server
[21:42:09] - Successful: assigned to (171.65.103.158).
[21:42:10] + News From Folding@Home: Welcome to Folding@Home
[21:42:10] Loaded queue successfully.
[21:42:13] + Closed connections
[21:42:20]
[21:42:20] + Processing work unit
[21:42:20] Core required: FahCore_82.exe
[21:42:20] Core found.
[21:42:22] Working on Unit 07 [December 5 21:42:22]
[21:42:22] + Working ...
[21:42:44]
[21:42:44] *------------------------------*
[21:42:44] Folding@Home PMD Core
[21:42:44] Version 1.03 (September 7, 2005)
[21:42:47]
[21:42:47] Preparing to commence simulation
[21:42:47] - Looking at optimizations...
[21:42:55] - Created dyn
[21:42:55] - Files status OK
[21:42:57] - Expanded 82019 -> 557656 (decompressed 679.9 percent)
[21:42:59]
[21:42:59] Project: 1814 (Run 0, Clone 545, Gen 42)
[21:42:59]
[21:43:00] Assembly optimizations on if available.
[21:43:00] Entering M.D.
[21:43:37] Protein: p1814_Collagen_POG10more_refolding
[21:43:37]
[21:43:37] Completed 0 out of 500000 steps (0)
[21:43:41] NaN/Inf detected e[0]
[21:43:41] Going to send back what have done.
[21:43:41] logfile size: 2425
[21:43:41] - Writing 2945 bytes of core data to disk...
[21:43:41] ... Done.
[21:43:41]
[21:43:41] Folding@home Core Shutdown: EARLY_UNIT_END
[21:43:43] CoreStatus = 72 (114)
[21:43:43] Sending work to server


[21:43:43] + Attempting to send results
[21:43:54] + Results successfully sent
[21:43:54] Thank you for your contribution to Folding@Home.
[21:43:58] - Preparing to get new work unit...
[21:43:58] + Attempting to get work packet
[21:43:58] - Connecting to assignment server
[21:44:05] - Successful: assigned to (171.65.103.158).
[21:44:05] + News From Folding@Home: Welcome to Folding@Home
[21:44:06] Loaded queue successfully.
[21:44:26] + Closed connections
[21:44:31]
[21:44:31] + Processing work unit
[21:44:31] Core required: FahCore_82.exe
[21:44:31] Core found.
[21:44:31] Working on Unit 08 [December 5 21:44:31]
[21:44:31] + Working ...
[21:44:31]
[21:44:31] *------------------------------*
[21:44:31] Folding@Home PMD Core
[21:44:31] Version 1.03 (September 7, 2005)
[21:44:31]
[21:44:31] Preparing to commence simulation
[21:44:31] - Looking at optimizations...
[21:44:32] - Created dyn
[21:44:32] - Files status OK
[21:44:32] - Expanded 83189 -> 561874 (decompressed 675.4 percent)
[21:44:32]
[21:44:32] Project: 1808 (Run 17, Clone 80, Gen 91)
[21:44:32]
[21:44:32] Assembly optimizations on if available.
[21:44:32] Entering M.D.
[21:44:39] Protein: p1808_Collagen_Brodsky_refoldingTambém não sei o que se passou com as outras WU's que vieram depois do erro..
O mais estranho é que tudo isto se passou enquanto gravava um cd, será que o problema evm daí??

Cumps.[[[]]]
DSantosP

P.S. A ultima que aparece ai tá a foldar sem problemas.

Bubu
05-12-2006, 23:26
Quit 101 - Fatal error: NaN detected: (ener[17])

Este erro ta ligado muitas vezes a erros de leitura/escrita de disco e tambem a instabilidades de cpu,OC,temperatura alta...Como tavas a gravar um cd, talvez algum erro do disco...

DSantosP
05-12-2006, 23:30
Obrigado Bubu...

Pá proxima vez que tiver que gravar um cd, já sei que é mais seguro desligar o folding primeiro, não quero perder mais WU's por causa disso.

Cump. [[[]]]:kfold:

Luisakamotor
05-12-2006, 23:33
Sabes porque é que a as Amber não dão erro... Porque não utilizam o SSE.
Há uma flag que força o modo SSE, que vem por defeito, activada no folding. Já vi alguns erros destes no Folding-Community; o Bruce disse que devia a questões de prioridade do cálculo SSE. Por exemplo, o teu programa de cd's deve ter uma prioridade certamente superior ao FAH, e se o FahCore pede os dados e não os recebe em tempo útil, já percebeste...

Por exemplo, nas gráficas...quando inicias algum jogo ele desliga automaticamente, pois não consegue correr cálculos 2D ao mesmo tempo que os 3D(também os usa, mas em menor quantidade)
O mesmo acontece se gravares ou copiares ficheiros relacionados com o FAH, ao mesmo tempo que tens o programa a correr...Há coisa de meio-ano tinha uma WU, que valia mil e tal pontos, e a chegar ao 92ou93 %, ocorreu me esse problema...o vale é que faço backup's de cada WU que recebo...

Cumps...não te preocupes com isso...alguém recebeu a tua WU para a "reprocessar"...
Fica bem...

DSantosP
05-12-2006, 23:56
Luis a primeira parte eu percebi, só não percebi para que servem os backup's, se cada WU só pode ser foldada uma vez, e se houver problemas é enviada para outra pessoa..

Cumps.

Luisakamotor
06-12-2006, 14:48
Tenho por hábito, fazer a guardar a "work" e o ficheiro "queue.dat", dentro de outra com o nome da WU em questão...Isto faço sempre...para quando os projectos Amber ou os 21xx, têm mais prioridade (verificas isso no Server Status do FAH), sei que se ligar o programa vou receber aquelas, então tenho sempre umas WU de reserva, pra se a net for abaixo ou quiser foldar aquele tipo de WU's, escuso de as receber e então foldo as que tenho...

Se fizeres isto, tb por alguma razão, deves certificar-te que tens assegurada uma posição de rotatividade entre elas, senão passam da deadline, como é claro...
Cumps