Using and Administering
LoadLeveler stores the accounting information that it
collects in a file called history in the spool directory of the
machine that initially scheduled this job, the schedd machine. Data on
parallel jobs are also stored in the history files.
Resource information collected on the LoadLeveler job is constrained by the
capabilities of the wait3 system call. Information for processes which
fork child processes will include data for those child processes as long as
the parent process waits for the child process to terminate. Complete
data may not be collected for jobs which are not composed of simple
parent/child processes. For example, if you have a LoadLeveler job
which invokes an rsh command to execute a function on another machine, the
resources consumed on the other machine will not be collected as part of the
LoadLeveler accounting data.
LoadLeveler accounting uses the following types of files:
- The local history file which is local to each schedd machine is where job
resource information is first recorded. These files are usually named
history and are located in the spool directory of each schedd
machine, but you may specify an alternate name with the HISTORY
keyword in either the global or local configuration file. For more
information, refer to the Step 9: Define Job Accounting.
- The global history file is a combination of the history files from some or
all of the machines in the LoadLeveler cluster merged together. The
command llacctmrg is used to collect files together into a global
file. As the files are collected from each machine, the local history
file for that machine is reset to contain no data. The file is named
globalhist.YYYYMMDDHHmm. You may specify the
directory in which to place the file when you invoke the llacctmrg
command or you can specify the directory with the GLOBAL_HISTORY
keyword in the configuration file. The default value set up in the
sample configuration file is the local spool directory:
- GLOBAL_HISTORY = $(SPOOL) (optional)
-
[ Top of Page | Previous Page | Next Page | Table of Contents | Index ]