Linux client Fedora 12 not getting work

nwrtarget

Gawd
Joined
Aug 10, 2010
Messages
903
******FIXED*****
Thanks Tobit for the help


I have tried to get this machine going a couple of times. Once I just copied a folding folder I was using on a different machine over (which has always worked before) and it couldn't get work. So I DLed and re setup the client from scratch. Linux Fedora 12. i7 950 12 gigs of ram. Mild overclock since it is my work desktop machine. Oh and I fold on several other machines at work so this isn't a firewall issue. Most of my PPD is running at the office (gotta love small companies).


./fah6 -configonly

Note: Please read the license agreement (fah6 -license). Further
use of this software requires that you have read and accepted this agreement.

Folding@Home User Configuration



--- Opening Log file [October 25 00:41:48 UTC]


# Linux Console Edition #######################################################
###############################################################################

Folding@Home Client Version 6.29

http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /folding2
Executable: ./fah6
Arguments: -configonly

[00:41:48] Configuring Folding@Home...

User name [Anonymous]? nwrtarget
Team Number [0]? 33
Passkey []? deleted for posting
Ask before fetching/sending work (no/yes) [no]?
Use proxy (yes/no) [no]?
Acceptable size of work assignment and work result packets (bigger units
may have large memory demands) -- 'small' is <5MB, 'normal' is <10MB, and
'big' is >10MB (small/normal/big) [normal]? big
Change advanced options (yes/no) [no]? yes
Core Priority (idle/low) [idle]?
Disable highly optimized assembly code (no/yes) [no]?
Interval, in minutes, between checkpoints (3-30) [15]? 5
Memory, in MB, to indicate (12040 available) [12040]? 3240
Set -advmethods flag always, requesting new advanced
scientific cores and/or work units if available (no/yes) [no]? yes
Ignore any deadline information (mainly useful if
system clock frequently has errors) (no/yes) [no]?
Machine ID (1-16) [1]?
The following options require you to restart the client before they take effect
Disable CPU affinity lock (no/yes) [no]?
Additional client parameters []?
IP address to bind core to (for viewer) []?

[00:43:16] - Ask before connecting: No
[00:43:16] - User name: nwrtarget (Team 33)
[00:43:16] - User ID not found locally
[00:43:16] + Requesting User ID from server
[00:43:16] + Could not connect to Primary Assignment Server for ID
[00:43:16] + Could not connect to Secondary Assignment Server for ID
[00:43:16]
+ Could not get ID from server. Retrying...
[00:43:33] + Could not connect to Primary Assignment Server for ID
[00:43:33] + Could not connect to Secondary Assignment Server for ID
[00:43:33]
+ Could not get ID from server. Retrying...
[00:43:57] + Could not connect to Primary Assignment Server for ID
[00:43:57] + Could not connect to Secondary Assignment Server for ID
[00:43:57]
+ Could not get ID from server. Retrying...
[00:44:28] + Could not connect to Primary Assignment Server for ID
[00:44:28] + Could not connect to Secondary Assignment Server for ID
[00:44:28]
+ Could not get ID from server. Retrying...
 
Last edited:
The only outage I can think of is DNS but I can resolve the addresses fine. (and so do my other machines at the office)

I have been fighting with this for about two weeks.

Oh and I am running that as root so permissions shouldn't be a problem but I have set them to 777 at one point to be sure.
 
Did you try another distro by any chance?
 
From my old config that I copied in from another running Fedora 12 system. I have been getting this same error on and off for a week plus every time I try. I can use a web browser to open those pages and they give some generic hello message but not connection refused.

Attempting to get work packet
Passkey found
Will indicate memory of 4000 MB
Connecting to assignment server
Connecting to http://assign.stanford.edu:8080/
Could not CosmHTTPOpen
Could not connect to Assignment Server
Connecting to http://assign2.stanford.edu:80/
Could not CosmHTTPOpen
Could not connect to Assignment Server 2
Couldn't get work instructions.
Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
 
And from the old install that has a client number (which I deleted from this screen copy for the time being).

Note: Please read the license agreement (fah6 -license). Further
use of this software requires that you have read and accepted this agreement.

8 cores detected


--- Opening Log file [October 25 01:35:40 UTC]


# Linux SMP Console Edition ###################################################
###############################################################################

Folding@Home Client Version 6.29

http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /folding
Executable: ./fah6
Arguments: -smp -verbosity 9 -forceasm

[01:35:40] - Ask before connecting: No
[01:35:40] - User name: nwrtarget (Team 33)
[01:35:40] - User ID:
[01:35:40] - Machine ID: 1
[01:35:40]
[01:35:40] Loaded queue successfully.
[01:35:40] - Preparing to get new work unit...
[01:35:40] - Autosending finished units... [October 25 01:35:40 UTC]
[01:35:40] Cleaning up work directory
[01:35:40] Trying to send all finished work units
[01:35:40] + No unsent completed units remaining.
[01:35:40] - Autosend completed
[01:35:40] + Attempting to get work packet
[01:35:40] Passkey found
[01:35:40] - Will indicate memory of 4000 MB
[01:35:40] - Connecting to assignment server
[01:35:40] Connecting to http://assign.stanford.edu:8080/
[01:35:40] - Could not CosmHTTPOpen
[01:35:40] + Could not connect to Assignment Server
[01:35:40] Connecting to http://assign2.stanford.edu:80/
[01:35:40] - Could not CosmHTTPOpen
[01:35:40] + Could not connect to Assignment Server 2
[01:35:40] + Couldn't get work instructions.
[01:35:40] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[01:35:51] + Attempting to get work packet
[01:35:51] Passkey found
[01:35:51] - Will indicate memory of 4000 MB
[01:35:51] - Connecting to assignment server
[01:35:51] Connecting to http://assign.stanford.edu:8080/
[01:35:51] - Could not CosmHTTPOpen
[01:35:51] + Could not connect to Assignment Server
[01:35:51] Connecting to http://assign2.stanford.edu:80/
[01:35:51] - Could not CosmHTTPOpen
[01:35:51] + Could not connect to Assignment Server 2
[01:35:51] + Couldn't get work instructions.
[01:35:51] - Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
 
Appears to be a local to you network issue, firewall setting perhaps, as you are not able to even connect to the assignment server. Not familiar enough with the specifics of Fedora 12 to know where to lead you but it is a network issue of some kind.
 
From my home web browser here at home on http://assign2.stanford.edu:80/
I get
OK

when I wget the same page via command line


[root@jeffi7 folding]# wget http://assign2.stanford.edu:80/
--2010-10-24 20:40:14-- http://assign2.stanford.edu/
Resolving assign2.stanford.edu... 171.64.65.121
Connecting to assign2.stanford.edu|171.64.65.121|:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [text/html]
Saving to: âindex.htmlâ

[ <=> ] 23 --.-K/s in 9.5s

2010-10-24 20:40:24 (2.42 B/s) - âindex.htmlâ

[root@jeffi7 folding]# cat index.html
<html><b>OK</b></html>

I get an OK back so it seems to be connecting.

From one of my other work boxes running Fedora 12 which works fine and has been for quite a while.

[19:31:37] Trying to send all finished work units
[19:31:37] + No unsent completed units remaining.
[19:31:37] - Preparing to get new work unit...
[19:31:37] Cleaning up work directory
[19:31:37] + Attempting to get work packet
[19:31:37] Passkey found
[19:31:37] - Will indicate memory of 4000 MB
[19:31:37] - Connecting to assignment server
[19:31:37] Connecting to http://assign.stanford.edu:8080/
[19:31:37] Posted data.
[19:31:37] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[19:31:37] + News From Folding@Home: Welcome to Folding@Home
[19:31:38] Loaded queue successfully.
[19:31:38] Connecting to http://171.64.65.56:8080/
[19:31:38] Posted data.
[19:31:38] Initial: 0000; - Receiving payload (expected size: 764332)
[19:31:40] - Downloaded at ~373 kB/s
[19:31:40] - Averaged speed for that direction ~419 kB/s
[19:31:40] + Received work.
[19:31:40] Trying to send all finished work units
[19:31:40] + No unsent completed units remaining.
[19:31:40] + Closed connections
 
Soft firewall is on on my desktop but not the server. Might as well fix that and see if it fixes it.

Update
Softfirewall is now off and I get the same output. I am pretty sure I disabled selinux but I am sure it is in permissive mode at least. Yeah I am annoyed I can't get this box to fold. It is doing nothing most of the time even when I am at work! (terminal sessions don't eat much cpu).
 
Ok, on Fedora 11 and 12 it looks like you need to open the file /etc/nsswitch.conf and look for the line that begins with 'hosts: '

Code:
hosts:     files mdns4_minimal [NOTFOUND=return] dns

and change it to:

Code:
hosts:      files dns
 
I haven't changed it and the line you mention is correct. (I do notice "bootparams: nisplus [NOTFOUND=return] files" and wonder if it needs changed)?

passwd: files
shadow: files
group: files

#hosts: db files nisplus nis dns
hosts: files dns

bootparams: nisplus [NOTFOUND=return] files

ethers: files
netmasks: files
networks: files
protocols: files
rpc: files
services: files

netgroup: nisplus

publickey: nisplus

automount: files nisplus
aliases: files nisplus
 
No, bootparams doesn't need to be changed.
 
Try these two commands as root (or via sudo)

chkconfig nscd on
service nscd start
 
Quote from Folding Forums.

"Ok, how about this:
The fah6 binary is statically linked. Due to the fact that even so some parts of the c library are dynamically loaded at runtime, it is somewhat incompatible with newer versions of glibc. Especially the domain name resolution does not work, which prevents contacting the assignment server and getting any WUs; the symptom of this is the error message "Could not CosmHTTPOpen".
There is a workaround for this which creates a patched version of glibc; that stopped working as of glibc 2.12. Symptoms include segfaults, floating point exceptions or the core exiting with status 0.
But there is another workaround: glibc comes with a caching daemon, called nscd, which communicates with the applications via socket and does the resolutions on their behalf. glibc, namely gethostbyname, automatically uses this daemon if it is running. So to get fah6 working, you just have to enable nscd. For Fedora (and similar systems) this is done by
CODE: SELECT ALL
chkconfig nscd on
; if you don't want to restart your system, also do
CODE: SELECT ALL
service nscd start "

And it was just nscd wasn't running! So what I was interpreting as a DNS issue. Man that was an easy fix.

Thanks for the help guys. Another Linux box for the [H]orde
 
Awesome you were able to resolve the issue and thanks to Tobit for helping out. :cool:
 
My work desktop

I7 950 running a mild overclock of 3.26 due to it being my work desktop and having a stock heatsink.
It has 12 gigs 3x4 from Mushkin Silverline 1333 and that doesn't OC very well. (big and slow works in this situation) Above 143 I lose two sticks. 140 bclck currently

Asrock X58 Xtreme
GT240 512 meg DDR5 (work machine with nice dual head)
http://www.newegg.com/Product/Product.aspx?Item=N82E16814125304&cm_re=512_240-_-14-125-304-_-Product
Kingston 64 gig SSD
WD Black 2 TB

Whole wish list at the egg
http://secure.newegg.com/WishList/PublicWishDetail.aspx?WishListNumber=10033189
although the monitor changed slightly and I got the ram from Mushkin directly because it was cheaper and the support is better that way.

It is a work desktop that can virtualize Win 7 Pro along with various other OSes. Main function provide me with terminal sessions to go out to servers with. Then I need some windows crap so I either rdp in to something or run a virtual machine.

The previous machine I was using was an HP Athlon 64 x2 running 1.8 or so GHz with 2 gigs of ram max which barely ran itself. I am not missing it!
 
Last edited:
Back
Top