Validate error.

Message boards : Number crunching : Validate error.
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
adrianxw
Avatar

Send message
Joined: 3 Oct 19
Posts: 33
Credit: 197,169
RAC: 0
Message 972 - Posted: 27 Jul 2020, 10:13:54 UTC

>>> Workunit 1379411

This work unit has been set "validate error" by all that have crunched it so far, after several days of CPU time each. Not happy about that.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 972 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
swiftmallard
Avatar

Send message
Joined: 13 Oct 19
Posts: 87
Credit: 6,026,455
RAC: 0
Message 976 - Posted: 27 Jul 2020, 14:02:50 UTC - in response to Message 972.  

>>> Workunit 1379411

This work unit has been set "validate error" by all that have crunched it so far, after several days of CPU time each. Not happy about that.

This happens to all of us, and nobody likes it. The molecule your task was working on was probably unstable and the validate error confirms that.
ID: 976 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Luigi R.

Send message
Joined: 7 Nov 19
Posts: 31
Credit: 4,245,903
RAC: 0
Message 985 - Posted: 28 Jul 2020, 8:11:14 UTC

It happened to me too.
https://quchempedia.univ-angers.fr/athome/workunit.php?wuid=1400377
Only once.
Probably they are isolated cases.
ID: 985 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
damotbe
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Help desk expert

Send message
Joined: 23 Jul 19
Posts: 289
Credit: 464,119,561
RAC: 0
Message 996 - Posted: 3 Aug 2020, 15:20:12 UTC - in response to Message 985.  

The validation is in two stages. Sometimes, in the first step (before comparison) an inconsistency can be detected.
ID: 996 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tullio

Send message
Joined: 5 Sep 20
Posts: 103
Credit: 2,142,600
RAC: 0
Message 1128 - Posted: 4 Oct 2020, 13:43:25 UTC

All I see on today's tasks, both mine on Windows 10 and 4 or more wingmen on Linux end with validation errors.
Tullio
ID: 1128 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
HK-Steve

Send message
Joined: 24 Sep 20
Posts: 1
Credit: 1,080,000
RAC: 0
Message 1129 - Posted: 4 Oct 2020, 14:13:31 UTC

Just added a new linux rig, all 11x tasks errored out straight away. Strange things also happening with my other 2x linux rigs
ID: 1129 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tullio

Send message
Joined: 5 Sep 20
Posts: 103
Credit: 2,142,600
RAC: 0
Message 1130 - Posted: 4 Oct 2020, 16:06:32 UTC

Got one validated task on my Windows PC. My Linux wingman used a 40 cores CPU against my 6, yet my PC was faster.
Tullio
ID: 1130 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
hsdecalc

Send message
Joined: 6 Oct 20
Posts: 1
Credit: 34,600
RAC: 0
Message 1133 - Posted: 6 Oct 2020, 17:39:15 UTC

Me too. Tasks 2923275, 2923231, 2923499 and a few more, all with "validation error".
NWChem v0.11 (vbox64_t1) windows_x86_64.
Logfile looks good.
ID: 1133 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[VENETO] boboviz

Send message
Joined: 13 Sep 19
Posts: 69
Credit: 399,347
RAC: 0
Message 1175 - Posted: 9 Nov 2020, 9:15:55 UTC

Still validation errors
4140131
4141252
etc...
ID: 1175 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tullio

Send message
Joined: 5 Sep 20
Posts: 103
Credit: 2,142,600
RAC: 0
Message 1176 - Posted: 9 Nov 2020, 16:16:33 UTC

I see many "validation inconclusive".
Tullio
ID: 1176 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
reindl

Send message
Joined: 5 Jul 20
Posts: 1
Credit: 501,000
RAC: 0
Message 1177 - Posted: 11 Nov 2020, 16:24:11 UTC
Last modified: 11 Nov 2020, 16:25:41 UTC

I'm getting an increasing number of tasks with "Completed, can't validate". It's mainly affecting tasks with initial replication >=5 quorum 2, like these two. Is this perhaps a misconfiguration issue?
ID: 1177 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 3 Oct 19
Posts: 153
Credit: 32,412,973
RAC: 0
Message 1178 - Posted: 11 Nov 2020, 16:37:21 UTC

I thought a Ryzen 3900X might be better than the Intel i7-9700 I was using, in order to reduce the invalids and validation inconclusives, but it did not make much difference.
I will wait until they figure out what the problem is before jumping in again. Good luck.
ID: 1178 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tullio

Send message
Joined: 5 Sep 20
Posts: 103
Credit: 2,142,600
RAC: 0
Message 1179 - Posted: 11 Nov 2020, 17:41:43 UTC

Waiting for validation 40. Inconclusive validation 115.
Tullio
ID: 1179 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[VENETO] boboviz

Send message
Joined: 13 Sep 19
Posts: 69
Credit: 399,347
RAC: 0
Message 1180 - Posted: 12 Nov 2020, 15:32:38 UTC - in response to Message 1179.  

Waiting for validation 40. Inconclusive validation 115.
Tullio


We know that the app is in beta (see here: https://quchempedia.univ-angers.fr/athome/apps.php), but some feedback from admins will be great
Are they working on the problem? Are they working on a new app??
ID: 1180 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matthias Lehmkuhl

Send message
Joined: 12 Oct 20
Posts: 9
Credit: 1,502,000
RAC: 0
Message 1181 - Posted: 13 Nov 2020, 19:44:20 UTC

One additional point
The QuChem Windows Application has a problem with other vbox virtual machines running on the same host
As long as I don't have my Ubuntu Linux running in VBox on my computer the QuChem Windows App works fine
When I have the Ubuntu or Debian Linux running in VBox the QuChem Windows App stopps working with this error
QuChemPedIA@home 13.11.2020 20:04:31 Task cl9_athome_b3lyp-321gd,batch013,000139370,nwchem,1595083754_3 postponed for 86400 seconds: VM job unmanageable, restarting later.

This issue I don't see with a LHC Theory App e.g. "Theory_2390-********-57_1 using Theory version 30006 (vbox64_theory) in slot 5" and other vbox related Windows Apps and a running Ubuntu or Debian Linux VBox machine
So it looks for me like this is a QuChem related issue
And for me it looks like this is one reason for the many Validate error for the QuChem calculations (the QuChem vbox machine is stopped hard without writing a savestate file or using a normal shutdown via acpipowerbutton)
Without an other running vbox machine I get valid results, but with an other vbox machine the results are invalid
Matthias
ID: 1181 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tullio

Send message
Joined: 5 Sep 20
Posts: 103
Credit: 2,142,600
RAC: 0
Message 1182 - Posted: 14 Nov 2020, 9:09:16 UTC - in response to Message 1181.  
Last modified: 14 Nov 2020, 9:10:13 UTC

I have noticed the same thing when running also LHC@home tasks alongside QuChem tasks on my Windows 10 PC with 6 processors. Usually it runs two QUChem tasks, the others waiting for memory. So I think it is a memory problem, I have 12 GB RAM on that PC. Maybe I should try running QuChem tasks on another PC which has 24 GB RAM but it is dedicated to running WCG and Rosetta@home tasks on Sars-CoVid'-2.
Tullio
ID: 1182 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Matthias Lehmkuhl

Send message
Joined: 12 Oct 20
Posts: 9
Credit: 1,502,000
RAC: 0
Message 1183 - Posted: 14 Nov 2020, 9:58:21 UTC

on my computers I don't see the issue with not enough memory
My Computers have 16 GB RAM and there are always some GB RAM free
But someone mentioned in the forum that the required free RAM of 2 GB is much to high for the QuChem App

I too didn't see any RAM usage of this hight, Peak working set size is less 250 MB, but Peak swap size is up to 2 GB for my finished results in the last 20 results of my task list

@Tullio
And Boinc only uses a defined % (for me I found 80% configured in boinc is a good solution) of the installed RAM to have enough resources for the OS and other apps I use
Matthias
ID: 1183 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tullio

Send message
Joined: 5 Sep 20
Posts: 103
Credit: 2,142,600
RAC: 0
Message 1184 - Posted: 14 Nov 2020, 14:59:03 UTC

I had problems in Einstein@home when I used a GTX !060 Video board with 3 GB RAM. They scolded me and I had to buy another PC with a GTX 1650 which has 4 GB Video RAM. But QuChem does not use the GPU , so I can run Einstein GPU tasks along QuChem tasks.
Tullio
ID: 1184 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[AF>France>Est>Alsace]PFLIEGER...

Send message
Joined: 16 Nov 20
Posts: 21
Credit: 3,661,600
RAC: 0
Message 1208 - Posted: 24 Nov 2020, 20:42:17 UTC

2020-11-24 04:42:26 (10128): Creating new snapshot for VM.
2020-11-24 04:42:32 (10128): Deleting stale snapshot.
2020-11-24 04:42:32 (10128): Checkpoint completed.
2020-11-24 04:51:56 (10128): Status Report: Trickle-Up Event[
2020-11-24 04:52:28 (10128): Creating new snapshot for VM.
2020-11-24 04:52:34 (10128): Deleting stale snapshot.

You see Trickle Up Event in side of 7 tasks at minima from mines
Inside of our universe we go to low energy system
I can not undestand inside of proteins an energy increasing system
May be a program bug or i can not understand because i have too few information
ID: 1208 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
damotbe
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Help desk expert

Send message
Joined: 23 Jul 19
Posts: 289
Credit: 464,119,561
RAC: 0
Message 1209 - Posted: 25 Nov 2020, 7:32:44 UTC - in response to Message 1208.  

these events are pure system messages. It is not related to molecules. "Trickle-Up Event" are messages send to the server.
ID: 1209 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Validate error.

©2024 Benoit DA MOTA - LERIA, University of Angers, France