Work request errors

Message boards : Number crunching : Work request errors

To post messages, you must log in.

AuthorMessage
Profile m4rtyn
Avatar

Send message
Joined: 1 May 09
Posts: 18
Credit: 6,545
RAC: 0
Message 532 - Posted: 5 Jun 2009, 10:14:48 UTC
Last modified: 5 Jun 2009, 10:25:37 UTC

Since attaching to DD@home I've started getting the following errors

05/06/2009 11:03:49|DrugDiscovery|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 0 completed tasks
05/06/2009 11:03:54|DrugDiscovery|Scheduler request succeeded: got 0 new tasks
05/06/2009 11:04:20||[error] Proposed work request 1497676.363229 bigger than max 349057.745280
05/06/2009 11:04:24|Milkyway@home|Sending scheduler request: Requested by user. Requesting 349058 seconds of work, reporting 0 completed tasks
05/06/2009 11:04:29|Milkyway@home|Scheduler request succeeded: got 0 new tasks
05/06/2009 11:04:29|Milkyway@home|Message from server: Not sending work - last request too recent: 46 sec
05/06/2009 11:04:49||[error] Proposed work request 1497962.999277 bigger than max 349057.745280
05/06/2009 11:04:49|Hydrogen@Home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 0 completed tasks
05/06/2009 11:04:54|Hydrogen@Home|Scheduler request succeeded: got 0 new tasks
05/06/2009 11:04:54||[error] Proposed work request 1498000.761948 bigger than max 349057.745280
05/06/2009 11:04:59|Milkyway@home|Sending scheduler request: Requested by user. Requesting 349058 seconds of work, reporting 0 completed tasks
05/06/2009 11:05:01||[error] Proposed work request 1498070.256351 bigger than max 349057.745280
05/06/2009 11:05:04|Milkyway@home|Scheduler request succeeded: got 0 new tasks
05/06/2009 11:05:04|Milkyway@home|Message from server: Not sending work - last request too recent: 35 sec

the errors seem to affect all projects, and the only way to prevent them is to detach from DD@home but they come back as soon as I reattach.
Also uploads have become a troublesome again,

05/06/2009 11:09:47|DrugDiscovery|Started upload of run_1244161897123672440_2_0
05/06/2009 11:09:52|DrugDiscovery|[error] Error on file upload: Server is out of disk space
05/06/2009 11:09:52|DrugDiscovery|Temporarily failed upload of run_1244161897123672440_2_0: transient upload error
05/06/2009 11:09:52|DrugDiscovery|Backing off 1 hr 35 min 32 sec on upload of run_1244161897123672440_2_0
m4rtyn
******************************* ******************************

ID: 532 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Profile Jack Shultz
Avatar

Send message
Joined: 10 Apr 09
Posts: 503
Credit: 120,150
RAC: 0
Message 535 - Posted: 5 Jun 2009, 10:35:35 UTC - in response to Message 532.  

apparently these workunits requiring a little more data than you have allocated. I am still trying to find ways to reduce the footprint. We are running mdrun 3 times in these batches. If we broke these up into separate runs, well there is one way to reduce the disk consumption

Here is a look at what is going on under the hood if you will.
Note the files encased in the pound symbol are backups created if the program is stopped manually or encounters trouble computing something. There are also a couple binaries that we may not need. Look at the trajectory files, trr and trj. These are generating the bulk of our data.

 Volume in drive C has no label.
 Volume Serial Number is 4676-77D7

 Directory of c:\ProgramData\BOINC\slots\0

06/05/2009  06:14 AM    <DIR>          .
06/05/2009  06:14 AM    <DIR>          ..
06/04/2009  08:32 PM             9,825 #mdout.mdp.1#
06/04/2009  08:34 PM             9,904 #mdout.mdp.2#
06/04/2009  08:55 PM             4,568 #run.edr.1#
06/04/2009  10:39 PM           144,064 #run.edr.2#
06/04/2009  08:54 PM            10,513 #run.log.1#
06/04/2009  10:39 PM            68,212 #run.log.2#
06/04/2009  08:55 PM         1,686,740 #run.trr.1#
06/04/2009  10:39 PM        18,554,140 #run.trr.2#
06/04/2009  08:55 PM           380,416 #run.xtc.1#
06/04/2009  10:39 PM        13,816,564 #run.xtc.2#
06/04/2009  08:32 PM         1,127,906 #topol.top.1#
06/04/2009  08:32 PM         1,280,000 7za.exe
03/26/2009  11:17 AM           469,504 bash.exe
06/04/2009  08:32 PM                 0 boinc_lockfile
06/04/2009  08:32 PM           168,522 box.gro
06/04/2009  08:52 PM                14 checkpoint.txt
06/04/2009  08:32 PM           168,462 conf.gro
05/07/2009  07:31 PM           144,896 cp.exe
06/04/2009  08:32 PM           591,360 cygblas.dll
11/19/2008  06:32 AM           800,768 cygfftw3-3.dll
11/09/2008  09:36 PM         1,000,960 cygiconv-2.dll
12/31/2008  05:12 AM            31,744 cygintl-8.dll
03/01/2009  01:37 AM           242,688 cygncurses-8.dll
11/29/2008  11:31 AM           158,208 cygreadline6.dll
06/04/2009  08:32 PM         1,873,811 cygwin1.dll
06/04/2009  08:32 PM           157,076 eiwit.pdb
06/04/2009  08:34 PM           180,352 em.edr
06/04/2009  08:34 PM         1,581,279 em.gro
06/04/2009  08:34 PM            87,844 em.log
06/04/2009  08:32 PM               155 em.mdp
06/04/2009  08:32 PM         2,599,776 em.tpr
06/04/2009  08:34 PM         1,686,904 em.trr
05/07/2009  11:41 AM    <DIR>          gromacs
06/04/2009  08:32 PM        10,402,767 gromacs.zip
06/04/2009  08:32 PM               104 gromacs_0.007_windows_intelx86.exe
06/05/2009  06:14 AM             4,201 init_data.xml
06/04/2009  08:32 PM             2,224 job.xml
06/04/2009  08:52 PM             9,869 mdout.mdp
06/04/2009  08:32 PM            59,545 posre.itp
06/04/2009  08:52 PM         1,688,108 pr.cpt
06/04/2009  08:52 PM            40,752 pr.edr
06/04/2009  08:52 PM         2,424,567 pr.gro
06/04/2009  08:52 PM            30,115 pr.log
06/04/2009  08:32 PM               327 pr.mdp
06/04/2009  08:34 PM         3,036,680 pr.tpr
06/04/2009  08:52 PM        43,855,240 pr.trr
06/04/2009  08:49 PM         1,688,108 pr_prev.cpt
03/27/2009  02:08 PM            20,873 README.TXT
05/07/2009  07:59 PM           125,952 rm.exe
06/04/2009  10:28 PM         1,687,856 run.cpt
06/05/2009  06:14 AM             9,832 run.edr
06/05/2009  06:15 AM            10,496 run.log
06/04/2009  08:32 PM               286 run.mdp
06/04/2009  08:52 PM         2,740,376 run.tpr
06/05/2009  06:14 AM         5,060,220 run.trr
06/05/2009  06:14 AM         2,789,464 run.xtc
06/04/2009  10:13 PM         1,687,856 run_prev.cpt
06/04/2009  08:32 PM         1,581,297 solvated.gro
06/05/2009  06:14 AM            80,824 stderr.txt
06/04/2009  08:32 PM         1,932,675 step15b.pdb
06/04/2009  08:32 PM         1,932,686 step15c.pdb
06/04/2009  08:32 PM         1,127,937 topol.top
06/04/2009  08:32 PM               102 unzip.exe
06/04/2009  08:32 PM               100 zip.exe
              63 File(s)    133,068,614 bytes
               3 Dir(s)  252,894,363,648 bytes free

ID: 535 · Rating: 0 · rate: Rate + / Rate - Report as offensive

Message boards : Number crunching : Work request errors


©2017 All rights reserved | Design by Digital BioPharm Ltd