computing error
log in

Advanced search

Message boards : Number crunching : computing error

Author Message
W.A.R.co
Send message
Joined: 5 Jan 12
Posts: 1
Credit: 266
RAC: 0
Message 713 - Posted: 4 Nov 2012, 12:19:31 UTC

all my cas wu finish with computing error

archeye*
Send message
Joined: 3 Oct 12
Posts: 5
Credit: 965
RAC: 0
Message 714 - Posted: 4 Nov 2012, 18:52:24 UTC - in response to Message 713.

Hi,

I agree for mine too.

I checked the work unit details and all the ones returned also had computing error.

Someone needs to fix something!! (before sending more work units out)

regards,

wenjing wu
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 10
Posts: 161
Credit: 751,216
RAC: 0
Message 725 - Posted: 8 Nov 2012, 2:23:27 UTC - in response to Message 713.

Thanks for your feedback!
This must be happening on a certain number of hosts.
The majority results we get are good..
I will have a look at your hosts.


____________
加油!CAS@home!我们帮助科学家跟时间赛跑!
Go CAS@home! We help scientists to race against time!

wenjing wu
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 10
Posts: 161
Credit: 751,216
RAC: 0
Message 726 - Posted: 8 Nov 2012, 2:28:31 UTC - in response to Message 713.

Thanks for your feedback!
This must be happening on a certain number of hosts.
The majority results we get are good..
I will have a look at your hosts.


____________
加油!CAS@home!我们帮助科学家跟时间赛跑!
Go CAS@home! We help scientists to race against time!

wenjing wu
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 10
Posts: 161
Credit: 751,216
RAC: 0
Message 727 - Posted: 8 Nov 2012, 2:30:31 UTC - in response to Message 714.

Hi, I checked your host.. It is the Tsinghua Application.. I have noticed this same error before.. I think I can fix it!


Hi,

I agree for mine too.

I checked the work unit details and all the ones returned also had computing error.

Someone needs to fix something!! (before sending more work units out)

regards,


____________
加油!CAS@home!我们帮助科学家跟时间赛跑!
Go CAS@home! We help scientists to race against time!

wenjing wu
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Avatar
Send message
Joined: 13 Sep 10
Posts: 161
Credit: 751,216
RAC: 0
Message 728 - Posted: 8 Nov 2012, 2:37:37 UTC - in response to Message 713.

It looks like all TreeThreader application wus exits right away on your host.
It would be useful if you can execute the binary file :
ICT_treeThreader_1.00_windows_intelx86

in your project directory on a command line terminal and see what error msg it pops up..



all my cas wu finish with computing error

____________
加油!CAS@home!我们帮助科学家跟时间赛跑!
Go CAS@home! We help scientists to race against time!

NATE1
Send message
Joined: 17 Dec 11
Posts: 31
Credit: 66,877
RAC: 0
Message 731 - Posted: 8 Nov 2012, 4:36:58 UTC - in response to Message 728.

It looks like all TreeThreader application wus exits right away on your host.
It would be useful if you can execute the binary file :
ICT_treeThreader_1.00_windows_intelx86

in your project directory on a command line terminal and see what error msg it pops up..



all my cas wu finish with computing error


look like I had a win 7 host failing all of its wu the same way as it turns out
that that host was missing

Msvcr71.dll and Msvcr70.dll

which is part of:
Microsoft .NET Framework Version 1.1 Redistributable Package
and
.NET Framework SDK Version 1.1

so go to the following 2 links and download and install

http://www.microsoft.com/en-us/download/details.aspx?id=26

http://www.microsoft.com/en-us/download/details.aspx?id=16217


it will cry about being Compatible but I installed anyway.
the work units were failing at 3 seconds and now they are running.

hope this helps.

Profile ritterm
Avatar
Send message
Joined: 21 Jul 10
Posts: 17
Credit: 157,228
RAC: 0
Message 743 - Posted: 9 Nov 2012, 15:31:01 UTC
Last modified: 9 Nov 2012, 15:31:15 UTC

Hello,

I'm getting compute errors on all WU's run by two of my four Windows 7 hosts. Here are examples from each:

Result 19914251
Result 19912217

Both return "195 (0xc3) EXIT_CHILD_FAILED" in the stderr output. This appears to be the same error reported by W.A.R.co in the original post, but different than that reported by archeye. I don't know if this is the same error as NATE1 because his/her computers are hidden.

@NATE1: What is the specific type of error you're getting? I would rather not try your solution without knowing if your errors are the same as mine.

@wenjing wu: Thanks for being a responsive admin! Can you provide more information on running the ICT_treeThreader_1.00_windows_intelx86 binary? It looks like it needs additional parameters such as <config>, <query-name>, and <template-list-file>

Regards,

MarkR
____________

NATE1
Send message
Joined: 17 Dec 11
Posts: 31
Credit: 66,877
RAC: 0
Message 746 - Posted: 9 Nov 2012, 22:41:15 UTC - in response to Message 743.
Last modified: 9 Nov 2012, 22:45:18 UTC

Hello, ....
Regards,

MarkR



<core_client_version>7.0.38</core_client_version>
<![CDATA[
<message>
- exit code 195 (0xc3)
</message>
<stderr_txt>
wrapper: starting
03:57:34 (3492): wrapper: running run-treeThreader.exe ()
app exit status: 0xc0000135
03:57:35 (3492): called boinc_finish

</stderr_txt>
]]>

the way I found out was to go the project dir. and just execute the apps.
forget which app. but one said it could not run, was missing the first file and another said it was missing the 2nd on the computer. google showed the missing dll's and found that they were in the pkg's. download the 1st, installed, and one of the missing dll files errors was gone. download and installed the 2nd and not only was the 2nd missing dll error gone. but the wu, which were failing at 3 seconds, ran and verified.

edit: the reason they are hidden, someone, or someones, use the information to hack my network and tried to kill off some of my computers last year.

Profile ritterm
Avatar
Send message
Joined: 21 Jul 10
Posts: 17
Credit: 157,228
RAC: 0
Message 747 - Posted: 10 Nov 2012, 1:00:45 UTC - in response to Message 746.
Last modified: 10 Nov 2012, 1:07:50 UTC

NATE1 said...

- exit code 195 (0xc3)

Okay, so it looks like I'm having the same error.

NATE1 also said...

...the way I found out was to go the project dir. and just execute the apps...

And that's what I did, I think. Here's the output I get:

C:\ProgramData\BOINC\projects\casathome.ihep.ac.cn>ICT_treeThreader_1.00_windows
_intelx86.exe
usage1: ICT_treeThreader_1.00_windows_intelx86 <config> <query-name> <template-l
ist-file>

I may be doing something wrong, but it looks to me like the app is expecting some other parameters. That's why I asked wenjing wu for some guidance. Also, I get this same behavior on my hosts that aren't having any problems with CAS WU's.

Finally, NATE1 said...

...the reason they are hidden, someone, or someones, use the information to hack my network and tried to kill off some of my computers last year.

I totally understand. I was merely pointing out that I could not compare my error to yours because I could not see your results. :-)
____________

Tex1954
Send message
Joined: 23 Apr 11
Posts: 38
Credit: 811,612
RAC: 0
Message 755 - Posted: 11 Nov 2012, 20:19:08 UTC
Last modified: 11 Nov 2012, 20:21:20 UTC

I'm getting two weird problems.

#1: The new apps refuse to run on any of my AMD systems.

#2: The apps seems to complete, but exit poorly generating an App Crash on my intel system.

Problem signature:
Problem Event Name: APPCRASH
Application Name: ICT_treeThreader_1.00_windows_intelx86.exe
Application Version: 0.0.0.0
Application Timestamp: 502e0498
Fault Module Name: msvcrt.dll
Fault Module Version: 7.0.7601.17744
Fault Module Timestamp: 4eeaf722
Exception Code: 40000015
Exception Offset: 0006680c
OS Version: 6.1.7601.2.1.0.256.48
Locale ID: 1033
Additional Information 1: 9167
Additional Information 2: 91672809c9032b4b34782548664839d6
Additional Information 3: 1e54
Additional Information 4: 1e54c2abbed12bd256369a59ded8746a


I have no idea what is going on, but all systems run Win7-64b...

Sigh...

I thought I had all the latest runtimes installed... have to check again I suppose..

8-)

NATE1
Send message
Joined: 17 Dec 11
Posts: 31
Credit: 66,877
RAC: 0
Message 756 - Posted: 12 Nov 2012, 0:19:52 UTC - in response to Message 755.

I'm getting two weird problems.



Problem signature:
Problem Event Name: APPCRASH
Application Name: ICT_treeThreader_1.00_windows_intelx86.exe
Application Version: 0.0.0.0
Application Timestamp: 502e0498
Fault Module Name: msvcrt.dll
Fault Module Version: 7.0.7601.17744
Fault Module Timestamp: 4eeaf722
Exception Code: 40000015
Exception Offset: 0006680c
OS Version: 6.1.7601.2.1.0.256.48
Locale ID: 1033
Additional Information 1: 9167
Additional Information 2: 91672809c9032b4b34782548664839d6
Additional Information 3: 1e54
Additional Information 4: 1e54c2abbed12bd256369a59ded8746a


I have no idea what is going on, but all systems run Win7-64b...

Sigh...

I thought I had all the latest runtimes installed... have to check again I suppose..

8-)




install the 32 bit dll
http://support.microsoft.com/kb/259403

Profile ritterm
Avatar
Send message
Joined: 21 Jul 10
Posts: 17
Credit: 157,228
RAC: 0
Message 766 - Posted: 12 Nov 2012, 15:48:37 UTC - in response to Message 731.

so go to the following 2 links and download and install

http://www.microsoft.com/en-us/download/details.aspx?id=26

http://www.microsoft.com/en-us/download/details.aspx?id=16217

Arrgh...I installed the first item (.NET Framework v1.1) on the two hosts I'm having problems with. But, when trying to install the second item (.NET Framework SDK v1.1), it stops saying that the .NET Framework v1.1 must be installed first. Didn't I just do that with the first item?
____________

NATE1
Send message
Joined: 17 Dec 11
Posts: 31
Credit: 66,877
RAC: 0
Message 767 - Posted: 12 Nov 2012, 16:50:23 UTC - in response to Message 766.
Last modified: 12 Nov 2012, 17:40:13 UTC

so go to the following 2 links and download and install

http://www.microsoft.com/en-us/download/details.aspx?id=26

http://www.microsoft.com/en-us/download/details.aspx?id=16217

Arrgh...I installed the first item (.NET Framework v1.1) on the two hosts I'm having problems with. But, when trying to install the second item (.NET Framework SDK v1.1), it stops saying that the .NET Framework v1.1 must be installed first. Didn't I just do that with the first item?



you may have to do a little more work, look here.

http://www.naturalprogramming.com/csbook_download_dotnet_framework_etc.html


edit: windows version of linux ldd link

http://dependencywalker.com/

also a little more info

http://msdn.microsoft.com/en-us/library/abx4dbyh(v=vs.80).aspx

Profile ritterm
Avatar
Send message
Joined: 21 Jul 10
Posts: 17
Credit: 157,228
RAC: 0
Message 768 - Posted: 13 Nov 2012, 2:36:15 UTC - in response to Message 766.
Last modified: 13 Nov 2012, 2:38:34 UTC

so go to the following 2 links and download and install

http://www.microsoft.com/en-us/download/details.aspx?id=26

http://www.microsoft.com/en-us/download/details.aspx?id=16217

Arrgh...I installed the first item (.NET Framework v1.1) on the two hosts I'm having problems with. But, when trying to install the second item (.NET Framework SDK v1.1), it stops saying that the .NET Framework v1.1 must be installed first. Didn't I just do that with the first item?

Okay...Even though the .NET v1.1 install didn't say that you had to do this, I rebooted both hosts and ran the install for the SDK v1.1 package again. This time the install ran fine on one host (Intel), but still failed on the other (AMD). Both are Win7-64 Home Premium, SP1, and at the same update level as best I can tell. I may just leave it that...3 out of 4 isn't bad...
____________

Profile ritterm
Avatar
Send message
Joined: 21 Jul 10
Posts: 17
Credit: 157,228
RAC: 0
Message 769 - Posted: 13 Nov 2012, 22:17:13 UTC - in response to Message 731.
Last modified: 13 Nov 2012, 22:17:31 UTC

so go to the following 2 links and download and install

http://www.microsoft.com/en-us/download/details.aspx?id=26

http://www.microsoft.com/en-us/download/details.aspx?id=16217

I can confirm that this worked for me on one of my hosts that previously had compute errors. For some reason, I can't get the SDK package to install on the other host.

@NATE1: Good catch and thanks for the tip! :-)
____________

Tex1954
Send message
Joined: 23 Apr 11
Posts: 38
Credit: 811,612
RAC: 0
Message 770 - Posted: 14 Nov 2012, 14:31:14 UTC - in response to Message 769.
Last modified: 14 Nov 2012, 15:05:28 UTC

so go to the following 2 links and download and install

http://www.microsoft.com/en-us/download/details.aspx?id=26

http://www.microsoft.com/en-us/download/details.aspx?id=16217

I can confirm that this worked for me on one of my hosts that previously had compute errors. For some reason, I can't get the SDK package to install on the other host.

@NATE1: Good catch and thanks for the tip! :-)


This worked for me on my AMD systems as well. However, I still get App Crash messages sometimes on ALL systems... I've seen this before on another project and they finally got it fixed, but I have no idea how. It only happens when the application completes and exits...

8-)

NATE1
Send message
Joined: 17 Dec 11
Posts: 31
Credit: 66,877
RAC: 0
Message 771 - Posted: 14 Nov 2012, 16:05:25 UTC - in response to Message 769.

so go to the following 2 links and download and install

http://www.microsoft.com/en-us/download/details.aspx?id=26

http://www.microsoft.com/en-us/download/details.aspx?id=16217

I can confirm that this worked for me on one of my hosts that previously had compute errors. For some reason, I can't get the SDK package to install on the other host.

@NATE1: Good catch and thanks for the tip! :-)


good to hear that.

also you may want to consider the following:
install version 1.0 then 1.1 and bring it to the present version.
M$ always has and had this "pain-in-the-butt" about not letting you install
one version of an app(software package) without having had the previous version already installed. and then when you upgrade leaving files from the previous version left on your computer, and programs(apps) not running on the new version because they need the files left from the previous version and these files not in the newest version.

"catch 22"

Profile skgiven
Avatar
Send message
Joined: 11 Sep 10
Posts: 11
Credit: 123,324
RAC: 0
Message 777 - Posted: 16 Nov 2012, 10:04:39 UTC - in response to Message 713.

My tasks seem to be completing, reporting and being credited ok, but such runtime variation!?!

19947894 9282026 26712 15 Nov 2012 | 18:13:40 UTC 16 Nov 2012 | 9:15:19 UTC Completed and validated 39,375.96 0.00 22.87 ICT Protein Structure Prediction(2nd Generation) v1.00
19947893 9282025 26712 15 Nov 2012 | 18:13:40 UTC 15 Nov 2012 | 22:53:59 UTC Completed and validated 2,117.21 0.00 6.39 ICT Protein Structure Prediction(2nd Generation) v1.00
19946327 9281306 26712 15 Nov 2012 | 12:20:32 UTC 15 Nov 2012 | 22:42:26 UTC Completed and validated 1,544.18 0.00 16.43 ICT Protein Structure Prediction(2nd Generation) v1.00
19946326 9281305 26712 15 Nov 2012 | 12:20:32 UTC 15 Nov 2012 | 22:43:35 UTC Completed and validated 1,584.05 0.00 6.66 ICT Protein Structure Prediction(2nd Generation) v1.00
19943731 9280189 26712 15 Nov 2012 | 0:01:12 UTC 15 Nov 2012 | 10:25:56 UTC Completed and validated 1,326.67 0.00 13.92 ICT Protein Structure Prediction(2nd Generation) v1.00
19943729 9280188 26712 15 Nov 2012 | 0:01:12 UTC 15 Nov 2012 | 10:27:28 UTC Completed and validated 300.38 230.41 2.77 ICT Protein Structure Prediction(2nd Generation) v1.00
19943727 9280187 26712 15 Nov 2012 | 0:01:12 UTC 15 Nov 2012 | 16:11:47 UTC Completed and validated 20,740.99 0.00 18.38 ICT Protein Structure Prediction(2nd Generation) v1.00
____________

astroWX
Send message
Joined: 27 Sep 10
Posts: 29
Credit: 147,739
RAC: 0
Message 790 - Posted: 18 Nov 2012, 7:13:13 UTC

Test-programs crash on Q9300, Vista_x64, 32-bit boinc 6.2.19:

11/17/2012 10:32:56 PM|CAS@home|Starting task batch_762_1236_1 using LAMMPS version 118
11/17/2012 10:33:21 PM|CAS@home|[error] Can't rename output file batch_762_1236_1_lammps_result
11/17/2012 10:33:26 PM|CAS@home|[error] Can't rename output file batch_762_1236_1_lammps_DumpFiles.zip
11/17/2012 10:33:26 PM|CAS@home|Computation for task batch_762_1236_1 finished


A test program is running okay on a similarly-configured Q6600. (Estimated run time started at 600 hours! No problem, boinc is figuring it out: 25% complete after an hour and a half; estimated time is down to 338:38 and is in free-fall.)

ICT tasks fail on an i5 3550 in W7_x64; can't rename the output result files.
____________
Greetings from the US Pacific Northwest.


Post to thread

Message boards : Number crunching : computing error