Posts by swiftmallard

1) Message boards : Number crunching : Out of Work (Message 1791)
Posted 2 Sep 2022 by swiftmallard
Post:
Tasks ready to send 0

This is unusual for this project. I will be out of work in about an hour.
it's been like that for quite a while. Some of my computers are out of work for this projects over lenghy time-periods :-( Too bad :-(


https://quchempedia.univ-angers.fr/athome/forum_thread.php?id=183&postid=1783
2) Message boards : News : Project update (Message 1789)
Posted 30 Aug 2022 by swiftmallard
Post:
This has been the most educational project I have ever crunched, and it has been my pleasure to have taken part.
I thank you, and you have my best wishes for the future!
3) Message boards : Number crunching : Stuck tasks (Message 1777)
Posted 19 Aug 2022 by swiftmallard
Post:
I'm Windows 8.1 x64, and have just set no new tasks again. There are too many issues here. I'm, seeing the "Postponed: VM job unmanageable restarting later" issue on pretty much all work units, but with the long deadline, I've ignored that, they start again after 24 hours. This morning, I saw another running, 100% done, and 26 hours . The project is not suitable for production as it is, (and as it was before...).

Agreed it is useless, one of my Win10 boxes gets many VM postponed unmanageable tasks. If I stop BOINC and restart it the tasks begin again from 0% - why is there no checkpointing implemented?
If it wasn't a sprint project for BOINCgames I wouldn't be running it.

There has been no Windows developer associated with the project for three(?) years now and little is going to change anytime soon. The last time any Windows development was done, the current VirtualBox version was 5.2.xx and it remains the best version for this project. The issue arises then that many use VB for other projects, projects which benefit from using the most current version. I use ver. 5.2.38 and still get an occasional "Postponed..." message, but not nearly as many as I did using ver. 6.xx The stuck tasks issue comes from the Boinc manager losing contact with VirtualBox and is one of the hazards of running a Linux based project on a Windows machine. This project will accept all the babysitting you care to give it. This issue happens with all of us Windows users and we just deal. In my view, it's just a cost of doing real science.
4) Message boards : Number crunching : High failure rate (Message 1772)
Posted 2 Aug 2022 by swiftmallard
Post:
I am using VB ver 5.2.38 so that makes sense, and with the project no longer having a Windows developer, that's not going to change anytime soon. The part about the com interface is beyond me. But thank you for confirming it wasn’t just my imagination.
5) Message boards : Number crunching : High failure rate (Message 1770)
Posted 2 Aug 2022 by swiftmallard
Post:
I re-enabled work fetch from the project to see if the earlier issues were just a memory. It downloaded 18 work units. Four jobs failed after a short period, (ie. less than two minutes), with an exit status of a helpful 0x00000000. The remainder started running, but within an hour, all had entered the "Postponed: VM job unmanageable, restarting later." state. "Later" appears to be 24 hours With the long deadline, this appears to be tolerable however, it simply makes a mess of the BOINC Manager screen. The exit status for these completed units is also 0x00000000, so clearly, failures are not discriminated against... I enabled work fetch again, and since doing so, four more units have arrived, I'll leave it running and see what happens.

Off topic:

This keeps appearing:

Your connection is not private
Attackers might be trying to steal your information from quchempedia.univ-angers.fr (for example, passwords, messages or credit cards). Learn more
NET::ERR_CERT_DATE_INVALID

Are you crunching on all 8 processors? if so, freeing one up worked for me. I see far fewer of the "Postponed..." messages any more. I also downgraded the VirtualBox version. I don't understand why, but it seemed to help.

I too have seen the certificate invalid message, but Boinc manages to get work after a minute or so.
6) Questions and Answers : Windows : CPU time vs elapsed time (Message 1768)
Posted 28 Jul 2022 by swiftmallard
Post:
I have used Windows & VirtualBox the entire time I have run this project and have never seen seen an instance of where BOINC says the CPU time is greater than the elapsed time.

Application NWChem 0.11 (vbox64_t1)
Name cl9_athome_b3lyp-321gd,batch106,001066086,nwchem,1656511236
State Running
Received 7/28/2022 11:50:02 AM
Report deadline 8/11/2022 11:50:00 AM
Estimated computation size 20,000 GFLOPs
CPU time 03:52:10
CPU time since checkpoint 00:02:39
Elapsed time 03:44:00
Estimated time remaining 00:55:36
Fraction done 80.111%
Virtual memory size 55.16 MB
Working set size 2.00 GB
Directory slots/3
Process ID 4528
Progress rate 21.600% per hour
Executable vboxwrapper_26200_windows_x86_64.exe


Any ideas how this can happen?
7) Questions and Answers : Web site : NET::ERR_CERT_DATE_INVALID (Message 1764)
Posted 22 Jul 2022 by swiftmallard
Post:
7/21/2022 9:52:30 PM | QuChemPedIA@home | Scheduler request failed: Peer certificate cannot be authenticated with given CA certificates

I am unable to get tasks

[edit] OK, I got work on the next try. But this issue should receive some attention. [/edit]
8) Message boards : Number crunching : Host ID 1388 corrupted (Message 1523)
Posted 23 Oct 2021 by swiftmallard
Post:
2324
9) Message boards : Number crunching : surrendered? (Message 1505)
Posted 10 Oct 2021 by swiftmallard
Post:
It won't, but it does allow us to see your system and tasks.
10) Message boards : Number crunching : surrendered? (Message 1503)
Posted 10 Oct 2021 by swiftmallard
Post:
Unhide your computer
11) Message boards : Number crunching : Host ID 1388 corrupted (Message 1452)
Posted 7 Aug 2021 by swiftmallard
Post:
You are right, I was thinking of Windows hosts which need VirtualBox. I am running a Windows 10 PC with VirtualBox and in most cases my host is faster than wingmen with Linux hosts even if they have CPUs much more powerful than my Intel i5. According to logic, it should be the other way around.
Tullio

I have observed the same on my system.
12) Message boards : Number crunching : Host ID 1388 corrupted (Message 1447)
Posted 3 Aug 2021 by swiftmallard
Post:
6671
13) Message boards : Number crunching : Never ending tasks (Message 1442)
Posted 25 Jun 2021 by swiftmallard
Post:
Sometimes my tasks are defined "unmanageable". I exit BOINC Manager killing the running tasks and restart BOINC Manager. They all restart.
Tullio

I get that message sometimes as well, but not nearly as often as when the project was new. Crunching on one fewer cores didn't completely eliminate the problem but it helped a lot.
I am suspicious that the VirtualBox version has a role to play here. Since systems can vary widely, what helps for one may not help for others.
14) Message boards : Number crunching : Never ending tasks (Message 1440)
Posted 24 Jun 2021 by swiftmallard
Post:
Crunching this project requires both attentiveness and patience.
It is certainly not a set it and forget it operation.
15) Message boards : Number crunching : Host ID 1388 corrupted (Message 1433)
Posted 15 Jun 2021 by swiftmallard
Post:
7827
16) Message boards : Number crunching : Host ID 1388 corrupted (Message 1431)
Posted 11 Jun 2021 by swiftmallard
Post:
7656
17) Message boards : Number crunching : Never ending tasks (Message 1422)
Posted 23 May 2021 by swiftmallard
Post:
Check the Properties of the task in question. If the difference between the CPU time and the Elapsed time is more than a few minutes, that WU has stopped processing. It will proceed no further and should be aborted.
18) Message boards : Number crunching : Host ID 1388 corrupted (Message 1328)
Posted 29 Jan 2021 by swiftmallard
Post:
4811
6072
19) Message boards : Number crunching : Never ending tasks (Message 1293)
Posted 27 Dec 2020 by swiftmallard
Post:
Yeah, that task should have been aborted long ago.
20) Message boards : Number crunching : Never ending tasks (Message 1291)
Posted 26 Dec 2020 by swiftmallard
Post:
Check the Properties of the task in question. If the difference between the CPU time and the Elapsed time is more than a few minutes, that WU has stopped processing. It will proceed no further and should be aborted.


Next 20

©2024 Benoit DA MOTA - LERIA, University of Angers, France