Please Don't Abort WUs

Message boards : Number crunching : Please Don't Abort WUs
Message board moderation

To post messages, you must log in.

AuthorMessage
GLeeM

Send message
Joined: 24 Mar 20
Posts: 2
Credit: 1,124,000
RAC: 0
Message 860 - Posted: 9 Jun 2020, 19:09:08 UTC

I have not had any WUs resent that were aborted by my wingman since mid April (if ever).

Will these be "Deleted by Server" without points given? I saw someone had alot of these in February.

Today, 9 June 2020, I see one of my wingmen aborted over 500 WUs. I would have aborted mine too but it is at 20 hours so I will let it finish on the slim chance I might get points for it.

I took a break for about a month but I still have 42 WUs "Validation Pending", most are from March to April.

Please resend WUs as soon as possible after being aborted or error out or are invalid.
ID: 860 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 3 Oct 19
Posts: 153
Credit: 32,412,973
RAC: 0
Message 861 - Posted: 9 Jun 2020, 22:45:15 UTC - in response to Message 860.  

The main problem seems to be too many "in progress" work units. I have 11 valid and 73 pending.
Where are they? Lost in a black hole?

The project should shorten the deadline and limit the downloads. Some people are downloading way more than they can handle.
Maybe it is a BOINC scheduler problem, but the Admin should limit it somehow.
ID: 861 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
GLeeM

Send message
Joined: 24 Mar 20
Posts: 2
Credit: 1,124,000
RAC: 0
Message 862 - Posted: 10 Jun 2020, 3:53:06 UTC

Jim1348 - I looked at 16 of your pending WUs, 12 of them your wingman aborted. You might have gotten dozens of WUs with that wingman, sorry!!

I am going to finish the 2 I have left and come back after my pending receive their points.
ID: 862 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 3 Oct 19
Posts: 153
Credit: 32,412,973
RAC: 0
Message 863 - Posted: 10 Jun 2020, 7:01:33 UTC - in response to Message 862.  

Jim1348 - I looked at 16 of your pending WUs, 12 of them your wingman aborted. You might have gotten dozens of WUs with that wingman, sorry!!

I am going to finish the 2 I have left and come back after my pending receive their points.

Thanks for checking. I will slug it out for a while longer, but will eventually have to do the same thing.
ID: 863 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
damienh

Send message
Joined: 5 Jan 20
Posts: 7
Credit: 100,435,425
RAC: 0
Message 864 - Posted: 10 Jun 2020, 7:57:53 UTC

Hmmmm. Usually it's good practice to abort tasks when moving between projects, so that the project knows that the task will not be completed and can re-allocate it. QuChem is the only project I know whtat doesn't appear to support this.

I'll PM damotbe about it.
ID: 864 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Luigi R.

Send message
Joined: 7 Nov 19
Posts: 31
Credit: 4,245,903
RAC: 0
Message 865 - Posted: 10 Jun 2020, 12:34:28 UTC

I don't believe it is a real problem. Project server will resend aborted/errored WUs at the end of this batch.

@damotbe could answer better than me.


Anyway

Luigi R. wrote:
Why has server not sent them yet to 3rd wingman after 10 days? I have the same problem.
damotbe wrote:

I don't know... It's the official code that manage this part.
https://quchempedia.univ-angers.fr/athome/forum_thread.php?id=85&postid=802#802
ID: 865 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
damotbe
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Help desk expert

Send message
Joined: 23 Jul 19
Posts: 289
Credit: 464,119,561
RAC: 0
Message 868 - Posted: 10 Jun 2020, 13:22:21 UTC - in response to Message 865.  

The completion of the batches is long... I left the default behavior. I don't quite understand the current logic, but I can't change it efficiently.
ID: 868 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
damienh

Send message
Joined: 5 Jan 20
Posts: 7
Credit: 100,435,425
RAC: 0
Message 869 - Posted: 10 Jun 2020, 19:21:39 UTC

damotbe's PM also said much the same as Luigi mentioned. It is expected that the aborted units will be re-sent at the end of this batch. As such, I'd hope that you will receive credit for them at some point in the future ...
ID: 869 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Luigi R.

Send message
Joined: 7 Nov 19
Posts: 31
Credit: 4,245,903
RAC: 0
Message 933 - Posted: 13 Jul 2020, 20:31:06 UTC
Last modified: 13 Jul 2020, 20:41:16 UTC

Finally server has started resends for my old inconclusive workunits. :)
ID: 933 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Please Don't Abort WUs

©2024 Benoit DA MOTA - LERIA, University of Angers, France