Did I just lose a WU?!

Kardonxt

2[H]4U
Joined
Apr 13, 2009
Messages
3,679
I finished a 6903 and noticed something about sleep=10000 at the end of the log. I didn't see anything about uploading and the boxen just sat there idle. Thinking to myself "this is a waste." i closed the client and opened it back up hoping that it would start uploading. I did see "done" twice so i assumed it was done doing whatever and was safe to close. To my shock it just started a WU. I didn't see a work unit dled so it must be the same one!?

Anyway to salvage this or at least get rid of the sleep:10000 at the end so it doesn't happen again?

[02:32:15] Completed 248085 out of 250000 steps (99%)
[03:04:08] Completed 250000 out of 250000 steps (100%)
[03:04:31] DynamicWrapper: Finished Work Unit: sleep=10000
[03:04:41]
[03:04:41] Finished Work Unit:
[03:04:41] - Reading up to 121622496 from "work/wudata_01.trr": Read 121622496
[03:04:42] trr file hash check passed.
[03:04:42] - Reading up to 108720724 from "work/wudata_01.xtc": Read 108720724
[03:04:43] xtc file hash check passed.
[03:04:43] edr file hash check passed.
[03:04:43] logfile size: 214465
[03:04:43] Leaving Run
[03:04:47] - Writing 230730677 bytes of core data to disk...
[03:05:57] Done: 230730165 -> 222379466 (compressed to 3.3 percent)
[03:05:58] ... Done.


--- Opening Log file [December 9 03:26:10 UTC]


# Linux SMP Console Edition ###################################################
###############################################################################

Folding@Home Client Version 6.34

http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/user/fah
Executable: ./fah6
Arguments: -smp -bigbeta

[03:26:10] - Ask before connecting: No
[03:26:10] - User name: Kardon (Team 33)
[03:26:10] - User ID: 2XXXXXXXXXXXXXX
[03:26:10] - Machine ID: 1
[03:26:10]
[03:26:10] Loaded queue successfully.
[03:26:10]
[03:26:10] + Processing work unit
[03:26:10] Core required: FahCore_a5.exe
[03:26:10] Core found.
[03:26:10] Working on queue slot 01 [December 9 03:26:10 UTC]
[03:26:10] + Working ...
[03:26:10]
[03:26:10] *------------------------------*
[03:26:10] Folding@Home Gromacs SMP Core
[03:26:10] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[03:26:10]
[03:26:10] Preparing to commence simulation
[03:26:10] - Ensuring status. Please wait.
[03:26:20] - Looking at optimizations...
[03:26:20] - Working with standard loops on this execution.
[03:26:20] - Previous termination of core was improper.
[03:26:20] - Going to use standard loops.
[03:26:20] - Files status OK
[03:26:27] - Expanded 57246484 -> 71846524 (decompressed 50.4 percent)
[03:26:27] Called DecompressByteArray: compressed_data_size=57246484 data_size=71846524, decompressed_data_size=71846524 diff=0
[03:26:27] - Digital signature verified
[03:26:27]
[03:26:27] Project: 6903 (Run 5, Clone 19, Gen 8)
[03:26:27]
[03:26:27] Entering M.D.
[03:26:41] Mapping NT from 16 to 16
[03:26:48] Completed 0 out of 250000 steps (0%)
 
I'm afraid you lost that wu. Which version of ubuntu are you running?
 
Kardonxt, that doesn't look good.:eek:

Check your current log to see if the send was delayed. Otherwise it is probably lost.:mad:

The sleep thing is normal for the console client, but it's only 10 seconds. That (compressed to 3.3 percent) is highly suspicious tho, maybe a bad WU?

Here's my last console WU before I started playing with v7 beta yesterday;


[11:29:00] Completed 495000 out of 500000 steps (99%)
[11:41:17] Completed 500000 out of 500000 steps (100%)
[11:41:19] DynamicWrapper: Finished Work Unit: sleep=10000
[11:41:29]
[11:41:29] Finished Work Unit:
[11:41:29] - Reading up to 12102120 from "work/wudata_05.trr": Read 12102120
[11:41:29] trr file hash check passed.
[11:41:29] edr file hash check passed.
[11:41:29] logfile size: 69616
[11:41:29] Leaving Run
[11:41:32] - Writing 12205412 bytes of core data to disk...
[11:41:35] Done: 12204900 -> 11314030 (compressed to 92.7 percent)
[11:41:35] ... Done.
[11:41:38] - Shutting down core
[11:41:38]
[11:41:38] Folding@home Core Shutdown: FINISHED_UNIT
[11:41:42] CoreStatus = 64 (100)
[11:41:42] Sending work to server
[11:41:42] Project: 6098 (Run 8, Clone 21, Gen 24)

[11:41:42] + Attempting to send results [December 5 11:41:42 UTC]
[11:45:00] + Results successfully sent
[11:45:00] Thank you for your contribution to Folding@Home.
[11:45:00] + Number of Units Completed: 644

[11:45:04] + -oneunit flag given and have now finished a unit. Exiting.
Folding@Home Client Shutdown.

When in doubt, wait it out....
Dana
 
Last edited:
Kardonxt,

Is your file system ext3 or ext4 ? If ext4 there can be a delay of upto an hour after completing the unit before sending it. The problem does not exist with ext3.

H.
 
not sure what the file system is. I'm just a linux newb using muskys guide. http://hardforum.com/showthread.php?t=1601608

Star you're log looks like what I would expect. I let mine sit for 20 mins and it didn't get that far. For some reason the last 6903 I turned in sat for 2 hours after finishing a WU. I had to delete the work folder and queue to get it going again. I'm not sure why I'm having so much trouble.

I ended up turning the box off last night because i was so discouraged. My hyper 212s should be here in the next day or so anyways. I'll give it another shot when i get them installed and can sleep peacefully without all the fan noise lol.
 
Why don't you type this command into terminal
Code:
df -T
and come back with the results.
 
I suspect EXT4 file system strikes again. Sorry to hear about the lost WU.
 
I was wondering why in the Ubuntu guide on here it said to use EXT3. Now it makes sense.
 
You guys are probably right. The first time I installed linux I followed the guid to a T. I had to reload because of some file corruption due to bad ram settings and I just did it by memory. I know for a fact I did not change the default file system to EXT3 now that i re-read the guide.

Thanks for the help. I'll get her reloaded when I get home tonight. :D
 
Last edited:
Yup.... ext3 is very important...

The only exception we have found is when installed on an SSD... no tpf difference seen between ext3 and 4.
 
Does the OS need to be on ext3 or just fah? I could make a 2gb ram disk and put the fah directory on it easy enough.
 
Just the fah disk needs to be ext3, though I suspect on a ram.disk ext4 would have no problems either.

H.
 
On an SSD, ext4 has no problem as well. on a spinning disk, it does.
 
SSD w/ext4 gets thrashed by the Fahcore and the client the same way == more wear on the SSD. You just don't get to see it as seeks are infinitely faster w/SSDs.

I would recommend against running FAH on ext4 on SSD... call is yours.
 
ya I wouldn't run it on an ssd either. I use Ram disk for my mincraft server for the same reason. It puts ssd speeds to shame and doesn't get worn out.

That said i just reloaded with ext3 to save myself from any future headaches. Thanks for the help every one.
 
Back
Top