Hello everybody,
I have been trying to get some work done on my Linux machines but so far ALL my WUs ended in an error.
One of the problem seems to be related to ZIP (Program not found) (http://dg.imp.kiev.ua/slinca/result.php?resultid=234524).
After adding a symbolic link to gzip the errors changed to Signal 11 errors.
Has anyone successfully completed any work on Linux ?
BTW: I'm running 64Bit Linux only.
Regards,
Michael
Hi SLinCA-Yuri,
Thanks for the replay (and the credits )
I'm glad I could help. I'll keep an eye on the project and run sum WUs from time to time to see if anything is changing. Always glad to support another physics project.
Michael
Update:
I've been doing some more testing.
I have removed the link to gzip and compiled and installed a zip from source (into /usr/local/bin).
BOINC and the science applications are running as the user boinc. The zip command was in the search path of the boinc user. The two test-units that I ran (http://dg.imp.kiev.ua/slinca/result.php?resultid=249311, http://dg.imp.kiev.ua/slinca/result.php?resultid=247811) ended in a computation error with a zip not found error message.
Second test:
I added a symbolic link /usr/bin/zip pointing to the newly installed /usr/local/bin/zip. Since then I have run more than ten WUs and all have completed without error and validated.
The only explanations I can think of are:
1) Although the top command shows that the applications are executed with the user boinc, they are run as the user root.
The difference between the two users (root and boinc) is the configuration of the search path. The root user does not have the directory /usr/local/bin in its search path only /usr/bin. The boinc user has both directories in its search path. So, if the zip command is installed into /usr/local/bin the user boinc should find it, the user root should not. After adding the symbolic link to /usr/bin the root user can find the zip command as well. However, I don't believe it's a likely explanation.
2) Your application has a hard-coded path to /usr/local/bin/zip in its call to the zip command which requires zip to be present in /usr/local/bin. Maybe you could check that ?
Anyways, it seems to work now
Regards,
Michael
Edit:
It seems that I was too optimistic
I have a computation error again. However, It's the Signal 11 error instead of the zip error. (http://dg.imp.kiev.ua/slinca/result.php?resultid=265600)
since the client app update yesterday from 32.34 to 32.38
I am getting signal 11 on all linux WUs (x86_64) after many hours of computing
please fix
same zip issue repeating in Win_x86_64 with client 32.38 as happened before
http://dg.imp.kiev.ua/slinca/result.php?resultid=651457
again after many hours of computing
please fix
same client errors again as a few months ago? very frustrating.
Invision Power Board
© Invision Power Services