Posts by ritterm
log in
1) Message boards : Number crunching : Open checkpoint file error? (Message 1032)
Posted 1881 days ago by Profile ritterm
I've been getting a few of these errors recently on my host 26742 running the ICT tasks. Here's a typical stderr output:

<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 195 (0xc3)
</message>
<stderr_txt>
wrapper: starting
15:10:11 (5928): wrapper: running run-treeThreader.exe ()
Warning: Open checkpoint file error.
app exit status: 0x1
16:27:23 (5928): called boinc_finish

</stderr_txt>
]]>
2) Message boards : News : Updates on Nanotechnology application (Message 941)
Posted 1999 days ago by Profile ritterm
Task 20084369 failed... 6 more days of lost crunching. I'm suspending app 1.25 tasks while I consider aborting them... I don't want to waste any more time. :-(
3) Message boards : News : Updates on Nanotechnology application (Message 940)
Posted 2003 days ago by Profile ritterm
For any others concerned about very long running tasks, Wenjing provide this feedback in a reply to a PM:

Wenjing Wu wrote:
I have talked to the scientists, they confirmed that there is nothing wrong in the job itself which would cause the termination, so I contacted the BOINC developer to see if there is anything might cause this from the client side. I understand your concern about it, and this is definetely something we need to solve to avoild wasting crunchers' CPU time!!


This situation my be overcome by the new application she mentions previously, but, it might be worth letting any 1.25 apps run to completion. I'm letting this task run and hope that it finishes successfully. At the time of this post, it's has just over 6 days of runtime, but is only ~14% complete.
4) Message boards : News : Updates on Nanotechnology application (Message 919)
Posted 2015 days ago by Profile ritterm
Compute errors on these Nano Tech tasks:

20059991
20052196
20043050
20042137

Similar stderr output on all:

close failed: [Errno 9] Bad file descriptor
20:23:00 (4484): wrapper: running parse_result.exe ()
Traceback (most recent call last):
File "lammps_parse_result_1.00_windows_intelx86.py", line 19, in <module>
IOError: [Errno 9] Bad file descriptor
close failed: [Errno 2] No such file or directory
app exit status: 0xff
20:23:05 (4484): called boinc_finish

</stderr_txt>
]]>


8 days of crunching down the drain... :-(
5) Message boards : News : Updates on Nanotechnology application (Message 918)
Posted 2016 days ago by Profile ritterm
Which do you think the BOINC manager will prefer to run, one with deadline next summer or other shorter deadline.

What about the resource share setting? If your setting for CAS is the same as your other projects, won't that even out the amount of work done even if the CAS deadline is far in the future?

On all of my hosts, I have these distant-deadline CAS tasks running just fine alongside other tasks with deadlines ranging from days to weeks in the future.
6) Message boards : Number crunching : Long Running Nano Tech Task (Message 915)
Posted 2017 days ago by Profile ritterm
I missed the information addressing this issue in the topic Updates on Nanotechnology application.
7) Message boards : Number crunching : Nano Tech tasks with deadlines far in the future (Message 914)
Posted 2017 days ago by Profile ritterm
I missed the information addressing this issue in the topic Updates on Nanotechnology application.
8) Message boards : Number crunching : Long Running Nano Tech Task (Message 910)
Posted 2020 days ago by Profile ritterm
My C2Q/Win7-64 host has been working on Task 20042137 for over 40 hours and is showing about 25% complete. Is this expected? If not, should I abort?

Thanks for any feedback.

MarkR
9) Message boards : News : Check out your contribution here for ThreeThreader (Message 894)
Posted 2023 days ago by Profile ritterm
Would you like to see the new protein structure predicted by your computer?

Yes...Thank you very much! It would be nice if all projects provided this kind of feedback to the volunteers. Well done. :-)

Regards,

MarkR
10) Message boards : Number crunching : Nano Tech tasks with deadlines far in the future (Message 892)
Posted 2024 days ago by Profile ritterm
At the time of this post, I have 3 tasks with deadlines 7-8 months into the future.
11) Message boards : Number crunching : Unsent Nano Tech Tasks (Message 886)
Posted 2025 days ago by Profile ritterm
Hey Stranger, we're a long way from home here!

Ahoy, there, BK! Yes, indeed, we are! :D

I'm seeing the same issues as well Mark. I have 30 pendings and only 8 have a wingman. The other 22 are unsent. It appears this project does not give as high a priority to geting WU's resent. These are strange WU's, some finish in well under an hour while others take almost a day. I guess it all contributes to making this an interesting hobby.

Since my first post, two WU's have been resent, returned, and validated, so it seems that the work is moving slowly, at least. It certainly does keep things interesting...
12) Message boards : Number crunching : Unsent Nano Tech Tasks (Message 845)
Posted 2028 days ago by Profile ritterm
I have 10 pending Nano Tech tasks that are waiting for the task to be sent out to a wingman. Each task sent to other hosts has failed by compute error, timed out, or been abandoned. Is there a way of knowing if these are going to be sent back out for completion anytime soon?

Thanks,

MarkR
13) Message boards : Number crunching : computing error (Message 769)
Posted 2044 days ago by Profile ritterm
so go to the following 2 links and download and install

http://www.microsoft.com/en-us/download/details.aspx?id=26

http://www.microsoft.com/en-us/download/details.aspx?id=16217

I can confirm that this worked for me on one of my hosts that previously had compute errors. For some reason, I can't get the SDK package to install on the other host.

@NATE1: Good catch and thanks for the tip! :-)
14) Message boards : Number crunching : computing error (Message 768)
Posted 2044 days ago by Profile ritterm
so go to the following 2 links and download and install

http://www.microsoft.com/en-us/download/details.aspx?id=26

http://www.microsoft.com/en-us/download/details.aspx?id=16217

Arrgh...I installed the first item (.NET Framework v1.1) on the two hosts I'm having problems with. But, when trying to install the second item (.NET Framework SDK v1.1), it stops saying that the .NET Framework v1.1 must be installed first. Didn't I just do that with the first item?

Okay...Even though the .NET v1.1 install didn't say that you had to do this, I rebooted both hosts and ran the install for the SDK v1.1 package again. This time the install ran fine on one host (Intel), but still failed on the other (AMD). Both are Win7-64 Home Premium, SP1, and at the same update level as best I can tell. I may just leave it that...3 out of 4 isn't bad...
15) Message boards : Number crunching : computing error (Message 766)
Posted 2045 days ago by Profile ritterm
so go to the following 2 links and download and install

http://www.microsoft.com/en-us/download/details.aspx?id=26

http://www.microsoft.com/en-us/download/details.aspx?id=16217

Arrgh...I installed the first item (.NET Framework v1.1) on the two hosts I'm having problems with. But, when trying to install the second item (.NET Framework SDK v1.1), it stops saying that the .NET Framework v1.1 must be installed first. Didn't I just do that with the first item?
16) Message boards : Number crunching : computing error (Message 747)
Posted 2048 days ago by Profile ritterm
NATE1 said...
- exit code 195 (0xc3)

Okay, so it looks like I'm having the same error.

NATE1 also said...

...the way I found out was to go the project dir. and just execute the apps...

And that's what I did, I think. Here's the output I get:

C:\ProgramData\BOINC\projects\casathome.ihep.ac.cn>ICT_treeThreader_1.00_windows
_intelx86.exe
usage1: ICT_treeThreader_1.00_windows_intelx86 <config> <query-name> <template-l
ist-file>

I may be doing something wrong, but it looks to me like the app is expecting some other parameters. That's why I asked wenjing wu for some guidance. Also, I get this same behavior on my hosts that aren't having any problems with CAS WU's.

Finally, NATE1 said...

...the reason they are hidden, someone, or someones, use the information to hack my network and tried to kill off some of my computers last year.

I totally understand. I was merely pointing out that I could not compare my error to yours because I could not see your results. :-)
17) Message boards : Number crunching : computing error (Message 743)
Posted 2048 days ago by Profile ritterm
Hello,

I'm getting compute errors on all WU's run by two of my four Windows 7 hosts. Here are examples from each:

Result 19914251
Result 19912217

Both return "195 (0xc3) EXIT_CHILD_FAILED" in the stderr output. This appears to be the same error reported by W.A.R.co in the original post, but different than that reported by archeye. I don't know if this is the same error as NATE1 because his/her computers are hidden.

@NATE1: What is the specific type of error you're getting? I would rather not try your solution without knowing if your errors are the same as mine.

@wenjing wu: Thanks for being a responsive admin! Can you provide more information on running the ICT_treeThreader_1.00_windows_intelx86 binary? It looks like it needs additional parameters such as <config>, <query-name>, and <template-list-file>

Regards,

MarkR