Calculations and time

Message boards : Number crunching : Calculations and time
Message board moderation

To post messages, you must log in.

AuthorMessage
Argento
Avatar

Send message
Joined: 4 Oct 19
Posts: 3
Credit: 17,546
RAC: 0
Message 142 - Posted: 16 Oct 2019, 12:33:28 UTC

Hi

I became interested in the project and its goals. Thus, I connected to my BOINC palette. During the WU processing, unclear messages appear in the status of "VM job unmanageable, restarting later", which have already been reported.
There is also a situation when the progress of calculations has been 100% for several hours and the status is still "processed".

Task manager shows the load on the processes of this project, only whether the process is not looped by some error and in fact does not perform correct calculations ...
One of such tasks has been flying for over 40 hours.

Hmm, puzzling.
--
Best regards
Sincelery
Argento
ID: 142 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mikey
Avatar

Send message
Joined: 12 Oct 19
Posts: 5
Credit: 337,959
RAC: 0
Message 143 - Posted: 16 Oct 2019, 14:07:50 UTC - in response to Message 142.  

Hi

I became interested in the project and its goals. Thus, I connected to my BOINC palette. During the WU processing, unclear messages appear in the status of "VM job unmanageable, restarting later", which have already been reported.
There is also a situation when the progress of calculations has been 100% for several hours and the status is still "processed".

Task manager shows the load on the processes of this project, only whether the process is not looped by some error and in fact does not perform correct calculations ...
One of such tasks has been flying for over 40 hours.

Hmm, puzzling.


For me the 'unmanageable' was cured by running fewer tasks at one time. I also get the 100% and still processing ones too but mine have all finished eventually. Your pc's are hidden so I can't tell how many cpu cores you have or how much memory you have.
ID: 143 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Argento
Avatar

Send message
Joined: 4 Oct 19
Posts: 3
Credit: 17,546
RAC: 0
Message 144 - Posted: 16 Oct 2019, 14:41:02 UTC - in response to Message 143.  

Hi

I became interested in the project and its goals. Thus, I connected to my BOINC palette. During the WU processing, unclear messages appear in the status of "VM job unmanageable, restarting later", which have already been reported.
There is also a situation when the progress of calculations has been 100% for several hours and the status is still "processed".

Task manager shows the load on the processes of this project, only whether the process is not looped by some error and in fact does not perform correct calculations ...
One of such tasks has been flying for over 40 hours.

Hmm, puzzling.


For me the 'unmanageable' was cured by running fewer tasks at one time. I also get the 100% and still processing ones too but mine have all finished eventually. Your pc's are hidden so I can't tell how many cpu cores you have or how much memory you have.


Hidden...? I have not marked this option anywhere. The machine that counts the design is based on the i3 M350 2C / 4T and 8GB RAM. Memory allocated in BM allocated at 90%. The machine is under no load.
The number of active tasks is only three. At the moment, it does not count other projects.
ID: 144 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Werinbert

Send message
Joined: 15 Oct 19
Posts: 2
Credit: 100,073
RAC: 0
Message 146 - Posted: 16 Oct 2019, 15:32:17 UTC - in response to Message 144.  
Last modified: 16 Oct 2019, 15:39:01 UTC

Hidden...? I have not marked this option anywhere. The machine that counts the design is based on the i3 M350 2C / 4T and 8GB RAM. Memory allocated in BM allocated at 90%. The machine is under no load.
The number of active tasks is only three. At the moment, it does not count other projects.

Under "Preferences for this project" there is a check box to allow other people to see the computers that you have hooked up to the project. By default it is unchecked (at least for this project).
ID: 146 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Argento
Avatar

Send message
Joined: 4 Oct 19
Posts: 3
Credit: 17,546
RAC: 0
Message 148 - Posted: 16 Oct 2019, 15:40:42 UTC - in response to Message 146.  

Hidden...? I have not marked this option anywhere. The machine that counts the design is based on the i3 M350 2C / 4T and 8GB RAM. Memory allocated in BM allocated at 90%. The machine is under no load.
The number of active tasks is only three. At the moment, it does not count other projects.

Under "Preferences for this project" there is a check box to allow other people to see the computers that you have hooked up to the project. By default it is unchecked.


Of course. I looked but did not see ;). Corrected. Thank You.
ID: 148 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
swiftmallard
Avatar

Send message
Joined: 13 Oct 19
Posts: 87
Credit: 6,026,455
RAC: 0
Message 149 - Posted: 16 Oct 2019, 17:17:02 UTC

I have a 6 core machine with 12 GB memory but avoid "unmanageable" WUs by only crunching two at a time.
ID: 149 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
damotbe
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Help desk expert

Send message
Joined: 23 Jul 19
Posts: 289
Credit: 464,119,561
RAC: 0
Message 150 - Posted: 16 Oct 2019, 22:01:24 UTC - in response to Message 142.  

Hi

I became interested in the project and its goals. Thus, I connected to my BOINC palette. During the WU processing, unclear messages appear in the status of "VM job unmanageable, restarting later", which have already been reported.
There is also a situation when the progress of calculations has been 100% for several hours and the status is still "processed".

Task manager shows the load on the processes of this project, only whether the process is not looped by some error and in fact does not perform correct calculations ...
One of such tasks has been flying for over 40 hours.

Hmm, puzzling.


Hi, and thank you for your contribution.

It seems that sometimes VM (and boinc_wrapper) did weird things. That's not the only project impacted by these issues (VM job unmanageable and loop without doing anything).

For the runtime, runtime is unpredictable for our task and too much variability. But the good news is that all valid computations are valuable for us and help us a lot.
ID: 150 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
swiftmallard
Avatar

Send message
Joined: 13 Oct 19
Posts: 87
Credit: 6,026,455
RAC: 0
Message 151 - Posted: 16 Oct 2019, 23:54:20 UTC - in response to Message 150.  

For the runtime, runtime is unpredictable for our task and too much variability. But the good news is that all valid computations are valuable for us and help us a lot.

I've noticed that the runtime estimates have decreased by a factor of 12, from 24 days down to 2. That's makes it much easier for me to judge what is doable on my system.
ID: 151 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Sibe Bleuze

Send message
Joined: 31 Dec 19
Posts: 5
Credit: 53,633
RAC: 0
Message 434 - Posted: 14 Jan 2020, 16:28:03 UTC

I'm having the same kind of issue as stated above, a work unit that's 100% with a day and 16 hours on the clock.
It has been 100% (and 'time left' says '---') since it hit 19 hours. Did it hit an error or should I just wait?
It is this task (https://quchempedia.univ-angers.fr/athome/result.php?resultid=756198) if that's any help.
ID: 434 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
swiftmallard
Avatar

Send message
Joined: 13 Oct 19
Posts: 87
Credit: 6,026,455
RAC: 0
Message 435 - Posted: 14 Jan 2020, 17:41:43 UTC - in response to Message 434.  
Last modified: 14 Jan 2020, 17:44:46 UTC

Abort it, the result will be invalid anyway. With the od9 WUs, I abort anything that takes longer than 8 hours.
But be aware that when the next batch of WUs begins, there will be changes in how we need to look at this issue. Hopefully, it will not exist!
ID: 435 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Serg

Send message
Joined: 12 Jan 20
Posts: 12
Credit: 220,914
RAC: 0
Message 444 - Posted: 15 Jan 2020, 4:45:30 UTC - in response to Message 149.  

I have a 6 core machine with 12 GB memory but avoid "unmanageable" WUs by only crunching two at a time.

Same solution for my pc. 2 cores limitation and problem disappears.
4 physical cores total, 8gb ram.
ID: 444 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Calculations and time

©2024 Benoit DA MOTA - LERIA, University of Angers, France