gLite > gLite 3.1 > glite-LB > Update to glite-LB 3.1.8-0  
 
 

 

 

gLite 3.1

glite-LB - Update to version 3.1.8-0


Date 25.02.2009
Priority Normal

Description



Loggind an booking update to handle the new functionalities of WMS with the last update of WMS.
Full release note :
https://twiki.cnaf.infn.it/cgi-bin/twiki/view/EgeeJra1It/ReleaseNotes1841
  • Added recovery procedure for the WM. This feature is enabled by the option EnableRecovery = true in the WorkloadManager section of the configuration file https://twiki.cnaf.infn.it/cgi-bin/twiki/view/EgeeJra1It/WMSConfFile). It basically works in this way: upon restart, old requests are reconsidered and the LB is queried to know exactly where to resume their processing in such a way that no operation is performed twice or more.


Please also have a look at the list of known issues.

This update fixes various bugs. For the full list of bugs, please see list below.

Fixed bugs

Number Description
 #13494 ARC job submitter.
 #16308 A subscription update is not working (doesn't send the new correct expiration time)
 #21909 /etc/cron.d/glite-wms-check-daemons.cron needs to redirect stderr to /dev/null
 #23443 Documentation out of date
 #24690 Multiple retrieval of job output - Unable to perform job purge
 #26885 Job wrongly kept in ICE cache with status UNKNOWN
 #27215 WM to set the maximum output sandbox size
 #27797 Mixed int and string in Parameters attribute generates wrong jdl
 #27899 VO override does not work with JdlDefaultAttributes
 #28235 Previously used CEs are not considered at all in the resubmission
 #28249 The ICE's command line dumpICECache opens the ICE's database in ReadWrite mode that is wrong (should be readonly)
 #28498 org.glite.wms-utils.classad contains a non-portable module
 #28637 Delegation IDs not found when CREAM persistent storage is cleared
 #28642 User environment breaks WMS wrapper
 #28657 Unexpected exception thrown by ICE
 #29182 The purger under some particular circumstance segfaults
 #29538 ICE doesn't catch an exception raised by a voms function
 #30308 created .mpi file in MPICH job wrapper causes jobs to fail
 #30518 glite-wms-wm crashes during resubmission
 #30816 A collection with pending jobs can be processed multiple times
 #30896 WMS must limit number of files per sandbox
 #30900 MinPerusalTimeInterval default is too low
 #31006 more signals from the Batch System (especially LSF) to be trapped by the jobwrapper
 #31026 Jobwrapper: redirection to /dev/null for both stdout and stderr should be carefully avoided when applicable
 #31278 WMS should prevent non-SDJ jobs from being scheduled on SDJ CEs
 #32078 Problem with GangMatching statement involving GlueSEStatus
 #32345 WMProxy forward request to WM when dirmanager sigseg
 #32366 glite-WMS does not yet support worker node monitoring
 #32528 The BDII information purchasing sometimes timeouts
 #32962 FQAN comparator does not work properly
 #32980 Maradona file should be removed at resubmission
 #33026 "no compatible resources" problem on SL4 WMS
 #33140 boost::timer overflows too quickly
 #33378 The WM startup script should create the jobdir input directory if required
 #34508 Any collection submitted while the WMS is down is not recovered upon WM startup
 #34510 When a collection is aborted the "Abort" event should be logged for the sub-nodes as well
 #35156 glite-wms-purgeStorage.sh hardcodes proxy file name
 #35250 DAG: glite_wms_wmproxy_dirmanager does not extract links from tar.gz
 #35544 org.glite.wms-utils.jobid fails build because of gcc-4 strictness
 #35878 org.glite.wms.common extra qualification relates to gcc-4 strictness
 #36145 Jobdir support to be enabled in the glite-wms-planner
 #36341 Possible bug in ICE when exiting for suicidal patch. Db can return a empty string into JobCacheIterator::refresh()
 #36466 gethostname is called repeatedly and often and could cause troubles
 #36496 WMProxy Server: any-user does not work
 #36536 The glite wms purge storage library should rely on LBProxy while logging CLEAR events.
 #36551 any exception raised while reading from the input wil cause the WM to exit
 #36558 WMProxy Server: should log user id on syslog
 #36870 glite-wms-brokerinfo-access files RPM build, spec file using deprecated Copyright
 #36876 A method of creamJob can return an empty string for most long lived user proxy. This can cause a fail in a LB's method
 #36902 Cron job to renew host-proxy
 #36907 Incomplete error message reported by ICE when lease creation fails
 #36913 utility to convert filelist to jobdir
 #36962 ICE fails to build with the new WMS Purger interface
 #37659 ICE uses a ENDLINE line terminator for log4cpp's calls that is not portable to new version log4cpp 1.0. Must be removed.
 #37674 Pointer returned by edg_wll_GetSequenceCode() is not checked for non-nullness
 #37756 ICE should not resubmit jobs which have been killed by CREAM due to expiring proxy
 #37862 Wrong default value for the GLITE_LOCATION variable in glite-wms-ice script
 #37916 There's an unused and useless method in a ICE's class
 #38359 some issues in the limit for the output sandbox in the WMS jobwrapper
 #38366 Recovery doesn't work with a list-match request:
 #38509 The WM's recovery procedure hangs if no relevant events are found for a given request
 #38739 WMProxy Server: doesn't allow exec if there's only user DN in gacl file
 #38816 Suicidal patch bug
 #38828 A suicidal patch related issue
 #39215 The purger needs some refinements
 #39217 JDL API C++: Parametric jobs are not well formed
 #39501 Wrong message logged by ICE when job proxy files disappear
 #39641 User proxy mixup for job submissions too close in time
 #39903 Fermilab proxy cannot submit to WMS SL4, they are ok with SL3
 #40967 Problems in script glite_wms_wmproxy_load_monitor
 #42587 Error processing DAG dependencies while generating the ISB for final node
 #42590 The WM terminates unexpectly handing a cancel request.
 #44761 WM: segmentation fault during recovery
 #44762 WM: segmentation fault while processing remnants of aborted collections
 #44763 When a collection is aborted the "Abort" event should be logged for waiting, submitted or done-failed sub-nodes
 #45391 submit requests for pending collections are deleted by the recovery on wm exit

Updated rpms

Name Version Full RPM name Description
glite-LB 3.1.8-0 glite-LB-3.1.8-0.i386.rpm gLite metapackage (glite-LB)
glite-wms-utils-exception 3.1.3-2.slc4 glite-wms-utils-exception-3.1.3-2.slc4.i386.rpm org.glite.wms-utils.exception v. 3.1.3
glite-wms-utils-jobid 3.1.5-1.slc4 glite-wms-utils-jobid-3.1.5-1.slc4.i386.rpm org.glite.wms-utils.jobid v. 3.1.5

The RPMs can be updated using yum via

Service reconfiguration after update

Not needed.

Service restart after update

Not needed.

How to apply the fix

  1. Update the RPMs (see above)
  2. Update configuration (see above)
  3. Restart the service if necessary (see above)