how to solve problems related to virtualization

Questions and Answers : Windows : how to solve problems related to virtualization
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Tullio

Send message
Joined: 5 Sep 20
Posts: 103
Credit: 2,142,600
RAC: 0
Message 1552 - Posted: 11 Nov 2021, 8:53:39 UTC

All my invalids are very short lived tasks and as damotbe said in another post they correspond to bad molecules.
Tullio
ID: 1552 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PDW

Send message
Joined: 3 Oct 19
Posts: 43
Credit: 40,548,179
RAC: 0
Message 1553 - Posted: 11 Nov 2021, 9:07:34 UTC - in response to Message 1550.  

bonjour j'utilise windows 10 21h1 cpu i7 9750h, 32gigas ram et oracle vo 6.128
j ai utilisé la 6.122 cette nuit et le probleme est identique.
merci a tous pour votre aide

It is a different problem, but now you are not having many errors for "VERR_NEM_INIT_FAILED (VERR_NEM_VM_CREATE_FAILED)"

Some tasks have worked and validated so you have all that is necessary for it to work.
How many are you trying to run at the same time ?
I would stop new tasks from running, once those running have failed or completed I would then go into VirtualBox and remove any orphaned VMs to clear the environment.
For good measure I would reboot.
Then I would allow just a single task to run and see what happens, there are many ways to do this, if you don't know ask.
If that still fails what is the error message in the Stderr ouput ?

Running VMs is hard work for the disk io especially if you are spinning up quite a few of them at the same time, even on an SSD. If the computer has a high CPU usage this can also cause problems for VMs.
ID: 1553 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
pascal768

Send message
Joined: 28 Aug 20
Posts: 7
Credit: 639,000
RAC: 0
Message 1554 - Posted: 11 Nov 2021, 9:26:34 UTC - in response to Message 1553.  

ok merci
j ai réglé quchem en taches illimitées mais avec 2 cpu.
j ai un pc avec un i9 10900 et je n'ai pas ce problème et l'installation Windows est identique.
on va attendre et on verra comme ça pour quchem.
j'ai déja nettoyé les machines oracle invalides.
ID: 1554 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PDW

Send message
Joined: 3 Oct 19
Posts: 43
Credit: 40,548,179
RAC: 0
Message 1555 - Posted: 11 Nov 2021, 9:27:27 UTC - in response to Message 1552.  

All my invalids are very short lived tasks and as damotbe said in another post they correspond to bad molecules.
Tullio

I don't get any short lived tasks.
I am running native linux but if it was a bad molecule wouldn't I get them too ?

If they are bad molecules why can others hosts get valid results with them ?
For example this task you ran: https://quchempedia.univ-angers.fr/athome/workunit.php?wuid=2954031
You got a Validate error in less than a minute, initially it said "ERROR: Vboxwrapper lost communication with VirtualBox, rescheduling task for a later time" and rescheduled it to start again. It re-started about an hour later but as soon as it started the VM it shut it down. It does not look like any actual work was done to a molecule, good or bad.
Two other hosts successfully ran that task to completion and got credit.
The two other failures (on native linux) are due to bad hosts, I reported 8613 previously but it is still churning out bad results, 9415 I didn't report for blacklisting as it wasn't getting new work.

I would say the bad molecule tasks are those that run to completion by multiple hosts but get a status of "Completed, validation inconclusive" and never validate, eventually having too many attempts.

Your short lived tasks look like your host/environment can't cope with what it is being asked to do. Some get to work but more of them fail to start properly.
ID: 1555 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tullio

Send message
Joined: 5 Sep 20
Posts: 103
Credit: 2,142,600
RAC: 0
Message 1556 - Posted: 11 Nov 2021, 9:44:49 UTC

I am number 30 in the RAC ranking list. Before the 30 September stop due to an expired security certificate I was even better and I am now climbing back again. Thie means my results are good even if my CPU, an Intel i5 9400f is vastly inferior to the majority of Linux hosts.
Tullio
ID: 1556 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
pascal768

Send message
Joined: 28 Aug 20
Posts: 7
Credit: 639,000
RAC: 0
Message 1557 - Posted: 11 Nov 2021, 9:46:20 UTC

si j'installais quchem avec boinc sous linux avec une machine virtuelle sous linux mint.
quchem fonctionne en natif sous linux.
je mets 16 gigas de ram taches illimitées cpu illimités
qu 'en pensez vous.
merci
ID: 1557 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tullio

Send message
Joined: 5 Sep 20
Posts: 103
Credit: 2,142,600
RAC: 0
Message 1558 - Posted: 11 Nov 2021, 10:00:37 UTC
Last modified: 11 Nov 2021, 10:01:10 UTC

i have a Linux Virtual Machine on a Windows 10 host running SuSE Tumbleweed, which is a development version with kernel 5.14.14. But it is updated very frequently and I have to reboot it, so I am not running any project using VirftualBox on it, only Einstein@home.
Tullio
ID: 1558 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PDW

Send message
Joined: 3 Oct 19
Posts: 43
Credit: 40,548,179
RAC: 0
Message 1559 - Posted: 11 Nov 2021, 10:03:50 UTC - in response to Message 1556.  

I am number 30 in the RAC ranking list. Before the 30 September stop due to an expired security certificate I was even better and I am now climbing back again. Thie means my results are good even if my CPU, an Intel i5 9400f is vastly inferior to the majority of Linux hosts.
Tullio

If you were the only person in a race you would finish first.
Does not make you a good runner.

There is not a lot of work being done here.
You have the hosts that should be blacklisted but aren't, producing thousands of invalid results.
You have those running VBox and producing many short running invalid results, difficult to quantify how many of these, but a lot.
You have the bad molecules that run for a long time that have to run 8 times but get no credit.

Chances of getting good value for the time and cost of running this project are low.
I'd like to get to 50M but currently babysitting a single host running a few tasks now and then to reduce wasted time/effort.
ID: 1559 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PDW

Send message
Joined: 3 Oct 19
Posts: 43
Credit: 40,548,179
RAC: 0
Message 1560 - Posted: 11 Nov 2021, 10:07:22 UTC - in response to Message 1557.  

si j'installais quchem avec boinc sous linux avec une machine virtuelle sous linux mint.
quchem fonctionne en natif sous linux.
je mets 16 gigas de ram taches illimitées cpu illimités
qu 'en pensez vous.
merci

I think the translation is telling me you want to run unlimited cores in the VM ?
You need to leave some spare CPU capacity for the OS and running the VM.

I haven't made note of how much memory tasks are using natively but if you only run say 10 cores in the VM then 16 Gb would be enough RAM.
ID: 1560 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Tullio

Send message
Joined: 5 Sep 20
Posts: 103
Credit: 2,142,600
RAC: 0
Message 1561 - Posted: 11 Nov 2021, 10:19:59 UTC

I am runing 6 BOINC projects on 4 different CPUs,3 using Windows 10 and 11, one Linux and a Linux Virtual Machinwe. I am an old UNIX user in my professional life, and I was surprised to see that on QuChem most, if not all, Windows tasks using VirtualBox are faster than their Linux wingmen. So I stuck to Windows on ths project.
Tullio
ID: 1561 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Previous · 1 · 2

Questions and Answers : Windows : how to solve problems related to virtualization

©2024 Benoit DA MOTA - LERIA, University of Angers, France