Using and Administering

Chapter 7. Gathering Job Accounting Data

Your organization may have a policy of charging users or groups of users for the amount of resources that their jobs consume. You can do this using LoadLeveler's accounting feature. Using this feature, you can produce accounting reports that contain job resource information for completed serial and parallel jobs. You can also view job resource information on jobs that are continuing to run.

Collecting Job Resource Data on Serial and Parallel Jobs

Information on completed serial and parallel jobs is gathered using the UNIX wait3 system call. Information on non-completed serial and parallel jobs is gathered in a platform-dependent manner by examining data from the UNIX process.

Accounting information on a completed serial job is determined by accumulating resources consumed by that job on the machine(s) that ran the job. Similarly, accounting information on completed parallel jobs is gathered by accumulating resources used on all of the nodes that ran the job.

You can also view resource consumption information on serial and parallel jobs that are still running by specifying the -x option of the llq command. In order to enable llq -x, you should specify the following keywords in the configuration file:

ACCT = A_ON A_DETAIL: Turns accounting data recording on. For more information on this keyword, see "Step 8: Define Job Accounting".
JOB_ACCT_Q_POLICY = number: where number is the amount of time in seconds that determines how often the startd daemon updates the schedd daemon with accounting data of running jobs. This controls the accuracy of the llq -x command. The default is 300 seconds.
JOB_LIMIT_POLICY = number: where number is an amount of time in seconds. The smaller of JOB_LIMIT_POLICY and JOB_ACCT_Q_POLICY is used to control how often the startd daemon collects resource consumption data on running jobs, and how often the job_cpu_limit is checked. The default for JOB_LIMIT_POLICY is POLLING_FREQUENCY multiplied by POLLS_PER_UPDATE.

Collecting Job Resource Data Based on Machines

LoadLeveler can collect job resource usage information for every machine on which a job may run. A job may run on more than one machine because it is a parallel job or because the job is vacated from one machine and rescheduled to another machine.

To enable recording of resources by machine, you need to specify ACCT = A_ON A_DETAIL in the configuration file.

The machine's speed is part of the data collected. With this information, an installation can develop a charge back program which can charge more or less for resources consumed by a job on different machines. For more information on a machine's speed, refer to the machine stanza information. See "Step 1: Specify Machine Stanzas".

Collecting Job Resource Data Based on Events

In addition to collecting job resource information based upon machines used, you can gather this information based upon an event or time that you specify. For example, you may want to collect accounting information at the end of every work shift or at the end of every week or month. To collect accounting information on all machines in this manner, use the llctl command with the capture parameter:

llctl -g capture eventname

eventname is any string of continuous characters (no white space is allowed) that defines the event about which you are collecting accounting data. For example, if you were collecting accounting data on the graveyard work shift, your command could be:

llctl -g capture graveyard

This command allows you to obtain a snapshot of the resources consumed by active jobs up to and including the moment when you issued the command. If you want to capture this type of information on a regular basis, you can set up a crontab entry to invoke this command regularly. For example:

#  sample crontab for accounting
#  shift crontab 94/8/5
#
# Set up three shifts, first, second, and graveyard shift.
#  Crontab entries indicate the end of shift.
#
#M  H d m day command
#
00 08 * *  * /u/loadl/bin/llctl -g capture graveyard
00 16 * *  * /u/loadl/bin/llctl -g capture first
00 00 * *  * /u/loadl/bin/llctl -g capture second

For more information on the llctl command, refer to llctl - Control LoadLeveler Daemons. For more information on the collection of accounting records, see llq - Query Job Status.

Collecting Job Resource Information Based on User Accounts

If your installation is interested in keeping track of resources used on an account basis, you can require all users to specify an account number in their job command files. They can specify this account number with the account_no keyword which is explained in detail in "Job Command File Keywords".

LoadLeveler validates this account number by comparing it against a list of account numbers specified for the user in the user stanza in the administration file.

Account validation is under the control of the ACCT keyword in the configuration file. The routine which performs the validation is called llacctval. You can supply your own validation routine by specifying the ACCT_VALIDATION keyword in the configuration file. The following are passed as character string arguments to the validation routine:

User name
User's login group name
Account number specified on the Job
Blank separated list of account numbers obtained from the user's stanza in the administration file.

The account validation routine must exit with a return code of zero if the validation succeeds. If it fails, the return code is a non-zero number.

Collecting the Accounting Information and Storing it into Files

LoadLeveler stores the accounting information that it collects in a file called history in the spool directory of the machine that initially scheduled this job, the schedd machine. Data on parallel jobs are also stored in the history files.

Resource information collected on the LoadLeveler job is constrained by the capabilities of the wait3 system call. Information for processes which fork child processes will include data for those child processes as long as the parent process waits for the child process to terminate. Complete data may not be collected for jobs which are not composed of simple parent/child processes. For example, if you have a LoadLeveler job which invokes an rsh command to execute a function on another machine, the resources consumed on the other machine will not be collected as part of the LoadLeveler accounting data.

LoadLeveler accounting uses the following types of files:

The local history file which is local to each schedd machine is where job resource information is first recorded. These files are usually named history and are located in the spool directory of each schedd machine, but you may specify an alternate name with the HISTORY keyword in either the global or local configuration file. For more information, refer to the "Step 8: Define Job Accounting".
The global history file is a combination of the history files from some or all of the machines in the LoadLeveler cluster merged together. The command llacctmrg is used to collect files together into a global file. As the files are collected from each machine, the local history file for that machine is reset to contain no data. The file is named globalhist.YYYYMMDDHHmm. You may specify the directory in which to place the file when you invoke the llacctmrg command or you can specify the directory with the GLOBAL_HISTORY keyword in the configuration file. The default value set up in the sample configuration file is the local spool directory:

GLOBAL_HISTORY = $(SPOOL) (optional)

Accounting Reports

You can produce three types of reports using either the local or global history file. These reports are called the short, long, and extended versions. As their names imply, the short version of the report is a brief listing of the resources used by LoadLeveler jobs. The long version provides more comprehensive detail with summarized resource usage and the extended version of the report provides the comprehensive detail with detailed resource usage. If you do not specify a report type, you will receive the default short version.

The short report displays the number of jobs along with the total CPU usage according to user, class, group, and account number. The extended version of the report displays all of the data collected for every job. See the llsummary command, llsummary - Return Job Resource Information for Accounting, for examples of the short and extended versions of the report.

For information on the accounting Application Programming Interfaces, refer to Chapter 11. "LoadLeveler APIs".

Sample Job Accounting Scenario

The following sample scenario walks you through the process of collecting account data. You can perform all of the steps or just the ones that apply to your situation.

Task 1: Update the Configuration File

Edit the configuration file according to the following table:

Edit this keyword: To:
GLOBAL_HISTORY Specify a directory in which to place the global history files.
ACCT Turn accounting and account validation on and off and specify detailed accounting.
ACCT_VALIDATION Specify the account validation routine.

Note: See "Step 8: Define Job Accounting" for more information on these keywords.

Task 2: Merge Multiple Files Collected From Each Machine Into One File

You can accomplish this step using either the llacctmrg command or the graphical user interface:

Using llacctmrg: See llacctmrg - Collect machine history files for the syntax of this command.
Using the graphical user Interface:

Select
A machine from the Machines window
Select
Admin > Collect Account Data... from the Machines window.
&TRIANGLE. A window appears prompting you to enter a directory name where the file will be placed. If no directory is specified, the directory specified with the GLOBAL_HISTORY keyword in the global configuration file is the default directory.
Press
OK
&TRIANGLE. The window closes and you return to the main window.

Task 3: Report Job Information on all the Jobs in the History File

You can accomplish this step using either the llsummary command or the graphical user interface:

Using llsummary: see llsummary - Return Job Resource Information for Accounting for the syntax of this command.
Using the graphical user interface:
Select
Admin > Create Account Report... from the Machines window.
Note: If you want to receive an extended accounting report, select the extended cascading button.
&TRIANGLE. A window appears prompting you to enter the following information:
- A short, long, or extended version of the output. The short version is the default version.
- Start and end date ranges for the report. If no date is specified, the default is to report all of the data in the report.
- The name of the input data file.
- The name of the output data file.
Press
OK
&TRIANGLE. The window closes and you return to the main window. The report appears in the Messages window if no output data file was specified.

Task 4: Using Account Numbers and Setting Up Account Validation

Specify the following keyword in the user stanza in the administration file:

account = list
where list is a blank delimited list of account numbers a user may use when submitting jobs.
Instruct users to associate an account number with their job:
- Using the job command file: add the account_no keyword to the job command file. See "Job Command File Keywords" for details.
- Using the graphical user interface:
  
  Select
  File > Build a Job from the main window.
  &TRIANGLE. The Build a Job window appears.
  Type
  the account number in the account_no field on the Build a Job window.
  Press
  OK
  &TRIANGLE. The window closes and you return to the main window.
Specify the ACCT_VALIDATION keyword in the configuration file that identifies the module that will be called to perform account validation. The default module is called llacctval. You can replace this module with your installation's own accounting routine by specifying a new module with this keyword.

Task 5: Specifying Machines and Their Weights

To specify weights to associate with machines, specify the following keyword in a machine's machine stanza in the administration file:

speed = number: where number defines the weight associated with a particular machine. The higher numbers correspond with a greater weight. The default weight is 1.0.

Also, if you have in your cluster machines of differing speeds and you want LoadLeveler accounting information to be normalized for these differences, specify cpu_speed_scale=true in each machine's respective machine stanza.

For example, suppose you have a cluster of two machines, called A and B, where Machine B is three times as fast as Machine A. Machine A has speed=1.0, and Machine B has speed=3.0. Suppose a job runs for 12 CPU seconds on Machine A. The same job runs for 4 CPU seconds on Machine B. When you specify cpu_speed_scale=true, the accounting information collected on Machine B for that job shows the normalized value of 12 CPU seconds rather than the actual 4 CPU seconds.

[ Top of Page | Previous Page | Next Page | Table of Contents | Index ]