IBM Stochastic Extensions Users Guide

Stochastic Extensions provides functional support in modeling, solving, and analysing solution data for multiperiod stochastic programs. Key new features in the Stochastic Extensions are:

A flexible nested decomposition solver, allowing the solution of much larger stochastic programs.
Solution of quadratic stochastic programs.
Parametric solution with perturbed bounds or costs vectors.

The modeling phase includes:

Core Problem. Specification of a Core linear or quadratic program.
Time Stages. Identification of Time Stages.
Stochastics. Definition of the Stochastic information for the distribution of the random variables.

The solver phase includes:

Model Size. Identification of the size of the model.
Decomposition. Decomposition of the model into subproblems.
Algorithm. Invocation of the Nested L-shaped (Benders) decomposition algorithm.
Termination. Interpretation of the algorithm's termination criteria.
Parametric. Invocation of the parametric procedure to generate a frontier of solutions.

The analysis of solution data is a complex issue. The number of ways the user can implement the functions of the IBM Optimization Library Stochastic Extensions is so great that it is difficult to anticipate all possible user requirements. The Solver provides access to:

Activities. column and row solution activity levels by scenario and stage, and
Distribution. distributions of problem data items (such as the value of the objective).

The callable modules provide complete access to all solution data, including dual information.

The Stochastic Extensions Solver

Most users will find that the easiest way to use the IBM Stochastic Extensions is through the Solver. This Solver can perform many different functions, selected by values supplied as command line arguments. The Solver is documented in the Solver Reference.

The Stochastic Extensions Modules

The motivating principle of IBM Stochastic Extensions is to free the user from the complex task of managing multistage stochastic data structures common to all stochastic programs. To implement this aim, IBM Stochastic Extensions provides several independent clusters of user callable modules corresponding roughly to initialization and input, tree generation, model formation and data transfer, solution via Benders decomposition, and user-access to data and information. These modules are accessed by writing a driver program. Example drivers are provided. Here we give an overview of the basic structure of a stochastic extensions driver program.

All information concerning a particular instance of a stochastic program is organized as a structured data type called Stoch_Data. This structure is initialized by a call to subroutine ekks_InitializeApplication, which returns a pointer to the Stoch_Data data type. Every Callable Module uses this pointer to access the data of the stochastic program. Multiple instances of stochastic programs can appear in the same program module, each with their own unique Stoch_Data pointer.

The generic IBM Stochastic Extensions driver consists of five subroutine calls:

     #include "ekkstoch.h"
     Stoch_Data *stoch_data;
         ekk_Initialize();
     ekks_InitializeApplication(&rc,&stoch_data,...);
     ekks_CreateCore(&rc,stoch_data,... );
     for(s=0;s<max_s;s++)
     {
         ekks_AddScenario(&rc,stoch_data,s,...);
     }
     ekks_BendersLSolve(&rc,stoch_data,osl_dspace,...);
     for(s=0;s<max_s;s++)
     {
         ekks_GetSolution(&rc,stoch_data,s,soln,...);
         printf( ... , soln);
     }

The call to ekks_InitializeApplication initializes the stoch_data pointer to the Stoch_Data object. The calls to ekks_CreateCore (declare core file) and ekks_AddScenario (add a scenario) pass the problem data to the Stoch_Data object. A call to ekks_BendersLSolve declares and solves the stochastic problem with a Benders decomposition method implemented in the IBM Optimization Library subroutines. The subroutine ekks_GetSolution accesses scenario solutions one-by-one after which they may be processed by the user (for example, printed).

Formulating Stochastic Programming Problems

The formulation of stochastic programs is a sophisticated undertaking that may require expert advice from application domains, probability and statistics, and the numerical modeling of decision systems. Problem examples are provided that indicate various types of formulations and analyses. These may be useful as templates.

Scenarios Data Structure

Scenarios provide the information about the probability distribution and the data needed to define the actual stochastic program in the SMPS standard format. Scenarios are declared one at a time by specifying the node from which it branches, the probability to be attached to the leaf, and the scenario data for the time stages between the branch and the leaf.

Scenario data is defined relative to the core data. For details of this convention, see the documentation of the ekks_AddScenario subroutine.

Organization of Storage: the SPL-file

When stochastic program core and scenario data is passed, the data, consisting of matrix, row and column bounds, and objective function, is sorted into stages and stored in a specially-structured SPL-file. Core data is stored first in the file, then the scenario data. A copy of the core data is retained in memory, but all scenario data is left in the SPL-file until needed by an optimization method.

SPL-file storage is managed by IBM Stochastic Extensions I/O routines. Users have access to this information through special purpose data-access routines. The SP-file may be saved (ekks_PutStochasticData), in which case it will contain all information necessary to restore (ekks_GetStochasticData) the stochastic program.

Stochastic Models

The data corresponding to a stochastic program is organized by nodes and arcs of a tree structure and stored in an SPL-file. Many stochastic programs could be defined from the same data--for example, the data defined by one single scenario is itself a linear program complete with objective and bounds.

The data required for the definition of a stochastic program is organized into a structure of type Stoch_Model. A Stoch_Model can be thought of as a selection of a subset of nodes from a Stoch_Data. (They are not the same sort of structure, since the node data for the submodel is obtained by reference to the original Stoch_Data structure.) A stochastic model is created by passing a list of nodes to the appropriates IBM Stochastic Extensions subroutine ekks_DescribeModel. By default, a Stoch_Data structure is considered to be itself a stochastic model containing all possible nodes.

Building Scenario Trees from Simulations

The IBM Stochastic Extensions includes a set of routines for building a scenario tree out of a sequence of simulations. Each simulation is viewed as an array of events, labeled as integers. When a new simulation is added (ekks_AddToTree), its initial sequence is compared against all initial sequences so far received. The longest match found is recorded, and the new simulation is made to branch from the scenario it matched. Weights of unmatched nodes are initialized at one, and weights of the matched nodes are incremented by one. The weights are eventually used to calculate the empirical probability of a node. The result is a scenario tree that describes the empirical distribution of the sequence of simulations. The scenario tree data, together with scenario numbers, branching node, and probability weights can be retrieved (subroutine ekks_GetScenarioFromTree). The example driver exsgbdt illustrates the use of these routines.

User Access to Internal Data Structures

The user interfaces, data structures and subroutine declarations, are declared in the header file ekkstoch.h. ekk-stoch This file should be included in every program module that calls IBM Stochastic Extensions subroutines.

The data pointed to by the structures Stoch_Data and Stoch_Model is internal and protected from the user. But all stochastic programming data is available to the user via the structures usrCoreData, usrStoch, usrNode, and usrNodeData, as referenced in the Data Structures Reference. These must be filled by special subroutine calls using: ekks_GetCoreData, ekks_GetScenarioTree and/or ekks_GetNodeData. The sample driver programs exsdst.c, and exsolv.c show examples of user data access.

Calls to user access subroutines can have a Stoch_Data or a Stoch_Model pointer in the argument list. In the latter case, the list of nodes returned will be those that appeared in the model's original list, plus the list of appropriately marked virtual nodes. Offsets and probabilities refer to the relevant model.

The node data retrieved by a call to ekks_GetNodeData contains only the stochastic data. It must be combined with the appropriate core data obtained by ekks_GetCoreData. It is important to note is that the data may have been sorted during the input phase. Particular items may have been moved out of their original positions. The usrCoreData structure has pointers to the sort keys used to translate the row and column indices from external (i.e. user) order to internal, and back.

Stochastic Mathematical Programming System
(SMPS) Format for Stochastic Linear Programs

This section contains material from the paper "A standard input format for multiperiod stochastic linear programs", by J.R. Birge, M.A.H. Dempster, H.I. Gassmann, E.A. Gunn, A.J. King, and S.W. Wallace, which introduced a data convention for the automatic input of multiperiod stochastic linear programs based on the MPS standard. This material is reproduced here, with slight updating, by permission of the authors.

SMPS Header Line.

columns 1 through 14: indicator code
columns 15 through 24: name

Most sections contain data lines. Data lines are distinguished by a blank in the first column; the first word of a header line must, therefore, begin with a non-blank character. Comment lines are indicated by an asterisk (*) in the first column and may appear anywhere.

Problem Statement

A general form of the multiperiod stochastic linear program is as follows.

minimize _x₁ c₁^T x₁ + E₂ Q₂( x₁ ) ,

                                subject to:     x₁  ^n₁ ,
                                                     l₁      x₁      u₁ ,
                                                    A₁x₁ = b ;

where the functions Q₂, Q₃, ... , Q_T are defined recursively by:

        Q₂( x₁ ) = min_x₂                   c₂^T x₂ + E₃ Q₃( x₁ , x₂ ) ,
                        subject to:             x₂  ^n₂ ,
                                                    l₂     x₂     u₂ ,
                                                    A_{2, 1} x₁ + A_{2, 2} x₂ = b₂ ,

        Q₃( x₁ , x₂ ) = min_x₃            c₃^T x₃ + E₄ Q₄( x₁ , x₂ , x₃ ) ,
                        subject to:             x₂  ^n₃ ,
                                                    l₃     x₃     u₃ ,
                                                    A_{3, 1} x₁ + A_{3, 2} x₂ + A_{3, 3} x₃ = b₃ ,

and so on, for Q_t , t = 4, ... , T - 1. In each of the forgoing equations, "E_t" represents the expectation with respect to the random variables in period number t . Finally, Q_T is defined by:

       Q_T ( x₁, ... , x_T-1 ) = min_{x_T}      c_T^T x_T ,
                        subject to:             x₂  ^n_T ,
                                                    l_T     x_T     u_T ,
                                                    A_{T, 1} x₁ + A_{T, 2} x₂ + ... + A_{T, T} x_T = b_T .

The data defining this problem may be arranged in a LP formulation for a single realization of the random variables:

objective: S^{^T}_j=1 c_t^T x_t ,

constraints: x_t ^n_t for t = 1, ... , T ,

l_t x_t u_t for t = 1, ... , T ,

A₁x₁ = b₁ ^m₁ , and
S^{^t}_j=1 A_{t , j} x_j = b_t ^m_t , for t = 2, ... , T .

All entries of the matrices A_{t , j} and vectors: c_t , l_t , u_t , b_t may be random (although in practice all but a few entries will be deterministic). The indices t = 1, ... ,T signify the periods of the problem; to each period t there corresponds a decision vector x_t ^n_t. The lower block-triangular constraint system expresses the typical feature of these problems--the decisions of the prior periods constrain the decision of the current period explicitly (since those decisions are known) but the decision of the current period is affected only implicitly by the feasibility and costs of possible future (recourse) decisions.

The proposed format is most easily understood by considering the above multistage LP in stages of gradually increasing levels of detail. First we regard it as a purely deterministic problem and create an input file following the MPS convention, ignoring (non-random) zero entries. This file will identify an objective vector, upper and lower bounds, a right hand side, and a block lower-triangular matrix, all expressed in the usual column-row format; we call this the core file.

Next we note the period structure of the problem, that certain decisions are made at certain times. Here it is necessary only to specify which rows and columns from the core file correspond to which periods. It is most simply done by indicating the beginning column and row for each period. This is done in the time file. Such a system relies on the proper sequencing of the core file--we require that the list of row names is in order from first period to last period and that the block lower-triangular matrix has been entered in column order: first period columns first, and last period columns last.

Finally there remains the specification of the distributions of the random entries; this takes place in the stoch file. The simplest case occurs when all random entries are mutually independent. We consider two general types of dependency among the random entries: blocks and scenarios. A block is a random vector whose realizations are observed in a single, fixed period. A scenario is a more general type of stochastic structure which models dependencies across periods.

The organization of the data files is similar to the MPS format. Each data file contains a number of sections, some of which are optional. A header line, or header, marks the beginning of each section and may contain keywords to inform the user that the data to follow should be treated in a special way. Each header line is divided into two fields delimited by specific columns.

SMPS Format for Linear Programs

Core File

The core file is sketched only briefly since it closely follows the MPS format. The core file consists of sections introduced by header lines. Data lines follow the headers.

MPS Data line for LP

A data line is divided into six fields: three name fields, two numeric fields and one indicator field.

columns 2 and 3: indicator field
columns 5-12: name field
columns 15-22: first data name field
columns 25-36: first numeric field
columns 40-47: second data name field
columns 50-61: second numeric field

A name is treated as a character string and may contain any ASCII symbol, except that data name fields must not start with a ($). Numbers with decimal point, or in scientific notation, may appear in numeric fields.

Core file layout in SMPS

1.: NAME Required. The second word field is used to identify the problem.
2.: ROWS Required. Each data line specifies the names of the objective and rows of the matrix in the first name field, and the type of constraint (E, L, G, N) in the code field. The list of row names must be in order from first period to last, preceded by the objective name(s).
3.: COLUMNS Required. Each data line specifies the column names and the nonzero values. This must be done in column order.
4.: RHS Required. Starts the RHS section. Data lines specify nonzero entries of the right hand side.
5.: BOUNDS Optional. Data lines specify the column bounds using the codes LO or UP in the code field.
6.: RANGES Optional. Refer to the MPS standard.
7.: ENDATA Required. End of problem data.

Core file - example

1...5.........15........25.............40........50.............
NAME          problem name
* this is a comment line
ROWS
 N  OBJ
 L  ROW1
 E  ROW2
    -etc-
COLUMNS
1...5.........15........25.............40........50.............
    COL1      ROW1      value          ROW2      value
    COL1      ROW3      value          ROW4      value
    -etc-
    RHS
    RHS       ROW1      value          ROW2      value
    -etc-
BOUNDS
 LO BND       COL1      value
    -etc-
RANGES
    -etc-
ENDATA

Integer variables may be indicated by using delimiters as described in the MPS/MIP standard. for those users who want to use mixed integer programming techniques.⁽¹⁾

Time File

The time file contains the information needed to specify the dynamic structure of the problem. It indicates which rows and columns are to be identified with which period. The first line identifies it as a time file and gives a name to the problem. The next header line consists of a single word PERIODS in the first name field, and contains the keyword LP in the second name field to identify the problem as a pure LP. The first two name fields of the data lines identify the beginning row and column names for each period with the corresponding period name in the third name field.

Time file - example

1...5.........15........25.............40........50.............
TIME          problem name
PERIODS
    COL1      ROW1                     PERIOD1
    COL6      ROW3                     PERIOD2
    COL8      ROW19                    PERIOD3
ENDATA

In this example: Columns COL1 through COL5 are PERIOD1 decision variables and COL6 and COL7 are PERIOD2 variables; rows ROW1 and ROW2 are PERIOD1 constraints and ROW3 through ROW18 are PERIOD2 constraints. All remaining rows and columns belong to PERIOD3.

Stoch File

In the stoch file the distributions of the random variables are specified. As mentioned, we consider three varieties of distributions: independent, blocks, and scenarios. Each type will be treated in separate sections of this file; each section consists of a header line followed by data lines.

Stoch file - header lines

1.: STOCH - informative. Identifies a new problem with a give name in the second word field.
2.: INDEP section - specifies the distribution of all independent random entries in separate sections for each type.
3.: BLOCKS section - specifies the joint distribution of all dependent random entries in separate sections for each type.
4.: SCENARIOS section - specifies the scenarios.
5.: ENDATA - informative. End of problem data.

Each distribution specifies the values that a core file entry takes with a certain probability. This value replaces the corresponding core file entry.⁽²⁾

Distributions

Independent Discrete Distributions Format
Uniform Distribution Format
Normal Distribution Format
Block Independent Distributions Format

Independent Discrete Distributions

For each discretely distributed entry one must specify the values and corresponding probabilities. The first two name fields identify the entry, and the first two numeric fields are the value field and probability field respectively. The intervening third name field contains the name of the period in which the random variable is realized. (This information is ignored by the input routine, but it is useful to have it made explicit for data consistency checking).

Independent Discrete format

1...5.........15........25.............40........50.............
INDEP         DISCRETE
    COL1      ROW8      6.0            PERIOD2   0.5
    COL1      ROW8      8.0            PERIOD2   0.5
    RHS       ROW8      1.0            PERIOD2   0.1
    RHS       ROW8      2.0            PERIOD2   0.5
    RHS       ROW8      3.0            PERIOD2   0.4

In this example the entry COL1/ROW8 takes value 6.0 with probability 0.5 and 8.0 with probability 0.5; and the right hand side of ROW8 takes values (1.0,2.0,3.0) with probabilities (0.1,0.5,0.4) respectively. Of course the probabilities associated with an entry must total one.

Uniform Distribution

The endpoints of the interval are the only relevant parameters for uniformly distributed entries. These are entered into the first two numeric fields; the third name field is blank.

Uniform Distribution Format

1...5.........15........25.............40........50.............
INDEP         UNIFORM
    COL1      ROW8      8.0            PERIOD2   9.0

In this example the random entry COL1/ROW8 is uniformly distributed over the interval [8.0, 9.0].

Normal Distribution

The normal distribution is specified by mean and variance in the first two numeric fields.

Normal Distribution Format

1...5.........15........25.............40........50.............
INDEP         NORMAL
    COL1      ROW8      0.0            PERIOD2   1.0

In this example the random variable is normal with mean 0.0 and variance 1.0.

Block Independent Distributions

The "values" of a block are actually vectors of values of the entries that make up the block, and to each value of a block there corresponds a probability. We need two sorts of data lines to describe a block. The first line, distinguished by a BL in the code field, gives the name of the block, the name of the period in which the block is realized, and the probability that the block assumes a given vector value; the following lines identify which entries of the block assume which value.

Independent Block format

1...5.........15........25.............40........50.............
BLOCKS        DISCRETE
 BL BLOCK1    PERIOD2   0.5
    COL1      ROW6      83.0
    COL2      ROW8      1.2
 BL BLOCK1    PERIOD2   0.2
    COL1      ROW6      83.0
    COL2      ROW8      1.3
 BL BLOCK1    PERIOD2   0.3
    COL1      ROW6      84.0
    COL2      ROW8      1.2
ENDATA

In this example the block, called BLOCK1, is the vector made up of the entries COL1/ROW6 and COL2/ROW8. It takes values (83.0,1.2) with probability 0.5, (83.0,1.3) with probability 0.2, and (84.0,1.2) with probability 0.3.

Linear Transformations

These are blocks whose distribution is computed as a linear transformation of another random vector with independent components, i.e., The vector v is the block whose distribution is desired; the vector u has independently distributed components of standard type, e.g., normal. We first identify the block as in the subroutine case, then the (marginal) distribution of each (independent) component of the random vector followed immediately by the corresponding column of the matrix H .

Linear Transformation format

1...5.........15........25.............40........50.............
BLOCKS        LINTR
 BL V_BLOCK   PERIOD2
    COL1      ROW8
    COL3      ROW6
 RV U1        NORMAL    0.0                      1.0
    COL1      ROW8      11.0
    COL3      ROW6      21.0
 RV U2        UNIFORM   -1.0                     1.0
    COL1      ROW8      12.0
    COL3      ROW6      22.0
 RV U3        CONSTANT
    COL3      ROW6      24.0

This example illustrates the file structure. In this case

Note that the "names" U1, U2 and U3 are irrelevant and may be left blank.

Scenarios, An Example - event tree, and the Scenarios Format

To describe scenarios one needs a data structure that expresses inter-period dependencies. This is best developed as a description of the distribution of a process vector just as one may describe a stochastic process in probability theory. We consider the random entries of the objective, bounds, and matrix as states of a process vector. Given the corresponding (finite dimensional) joint distributions of this process, Kolmogorov's construction yields a probability measure, termed the process distribution, on the space of trajectories. Thus, in general, we have the alternatives of describing the distribution of the stochastic process in joint or conditional state distribution form, or as a process distribution over trajectories. In the case under consideration, all this may be given a much simpler, more graphic, characterization.

To describe the process paths in the discrete state case, note that process can assume only finitely many values. A path of the process up to a given time stage may be followed by a finite collection of possible values of the process in the next time period. We think of these as nodes in the next period. Following Lane and Hutchinson and Raiffa, we construct an event tree representation of the trajectories: Represent the (unique) value of the process in the first period by a single node connected by oriented arcs from that node to nodes representing the values of the process in period 2. These are the descendant nodes of the first period node in the terminology of Birge. Each node of period 2 is connected to its descendant nodes at period 3 by individual arcs oriented in the direction of the period 3 nodes. This construction is continued by connecting each ancestor with its descendant nodes in the next period. Each node of this tree has a single entering arc and multiple departing arcs representing the possible next period events.

A single trajectory of the process thus corresponds to a path from the period 1 node to a single period T node composed of arcs oriented in the direction of increasing time, and moreover, corresponds uniquely to a single node in the last period T. There are only finitely many paths linking nodes in period 1 to nodes in period T, and to specify the distribution on the paths one needs only to attach a definite probability to each path. Hence specifying the process distribution in terms of path probabilities can be effected in this context by assigning probabilities to period T nodes of the hyperreferent tree event tree, see Figure 1.1 below.

The topology of this event tree represents the information about the process. To each arc we attach the probability that the terminal node occurs given the initial node has occurred--that is, the conditional probability of the process distribution over a period conditional on the history over all previous periods. These arc probabilities can be computed from the path probabilities by summing the probabilities of the paths visiting the terminal node, and then dividing by the sum of the probabilities of the paths visiting the initial node.

Conversely, if to each arc in the tree we know the conditional probability that its terminal node occurs given that its initial node has occurred, then the probability of any given path is simply the product of the arc probabilities along the path.

A decision at a given node is made only on the basis of information collected up to and including that node. The uncertainty faced by the decision maker is represented by the collection of paths that branch from this node. Thus in the following Figure, the decisions occur at the nodes and the scenarios branch after the node.

**Figure 1.1:** Scenarios example - event tree

In a language more specific to our application, a "path" in the tree analogy is a single "scenario". The nodes visited by the path correspond to certain values assumed by certain entries of the matrices in the core file. Thus a scenario is completely specified by a list of column/row names and values, and a probability value. Once a single given scenario is described, then other scenarios that branch from it may be described by indicating in which period the branch has occurred, and then listing the subsequent column/row names and values. It is best to work through the example shown in the preceding Figure.

There are two types of data lines. The first, signified by SC in the code field, gives the name of the scenario in the first name field and its probability in the first numeric field, and then gives the name of the scenario from which the branch occurred and the name of the period in which the branch occurred --i.e., the first period in which the two scenarios differ-- in the second and third name fields, respectively. A scenario that originates in period one is indicated by ROOT in the name field. The next data lines give the column/row values assumed by the scenario.

Scenarios Format

1...5.........15........25.............40........50.............
SCENARIOS     DISCRETE
 SC SCEN1     ROOT      0.5            PERIOD1
    COL1      ROW2      1.0
    COL2      ROW3      1.0
    COL3      ROW4      1.0
    COL4      ROW5      1.0
 SC SCEN2     SCEN1     0.2            PERIOD3
    COL3      ROW4      1.0
    COL4      ROW5      1.0
 SC SCEN3     SCEN2     0.2            PERIOD4
    COL4      ROW5      0.0
 SC SCEN4     SCEN1     0.1            PERIOD2
    COL2      ROW3      0.0
    COL3      ROW4      0.0
    COL4      ROW5      0.0
ENDATA

This is a description of the distribution of four entries: COL1/ROW2, COL2/ROW3, COL3/ROW4, COL4/ROW5, which for convenience we denote here as d1, d2, d3, and d4, respectively. Note that in PERIOD4 there are two nodes for the "state" d4 equals 0.0 and two for d4 equals 1.0, and similarly in PERIOD3 two nodes for d3 equals 1.0. This is because a node is distinguished by the information that one has collected concerning the path up to and including that node. Thus in PERIOD3 the two nodes are distinguished because in scenario SCEN1 one knows that the final state is d4 equals 1.0, whereas in SCEN2 the outcome of d4 is in doubt.

The Flexible Nested Decomposition Solver

The decomposition method implemented in the IBM Stochastic Extensions is the L-shaped method of van Slyke and Wets. The nested decomposition solver is a very powerful method for solving large stochastic programs.

The solution procedure consists of a nested L-shaped, or Benders, decomposition phase, and if needed, a Simplex method phase. The decomposition may not terminate in a reasonable number of iterations, for this reason it is often useful to follow Benders with the Simplex method. This is automatically done if the number of major cycles exceeds Benders_Major_Iterations.

In the decomposition phase, the problem is decomposed into subproblems according to one of several user-specified criteria: Problem_Size, Maximum_Subproblems, or Cut_Stage.

If Problem_Size is greater than zero, then subtrees are generated by adding nodes from leaf to root until the number of rows in the subtree is just larger than Problem_Size. The nodes are collected in such a way as to create a submodel tree with one root and no cycles.
Otherwise, if Cut_Stage is nonzero then the node tree is cut at Cut_Stage to form a subproblem tree with one master and one or more subtrees.
If both Problem_Size and Cut_Stage are zero, then exactly Maximum_Subproblems are created as if Problem_Size was equal to 0.9 times the number of rows divided by Maximum_Subproblems.

If there are more subproblems than the maximum number of subproblems allowed, Maximum_Subproblems, then the subtrees are divided among the subproblems as evenly as possible. In the case of a cut at a time stage Cut_Stage, the first subproblem is joined to the master problem, thus the master consists of the nodes less than Cut_Stage together with the subtrees collected into the first subproblem. This "augmented master" tends to produce better proposals; for example, the master nodes might be unbounded without the influence of some downstream nodes. If the tree is not decomposed by time stage, then no augmented master is generated.

In nested decomposition, subproblems can generate both cuts and proposals, depending on their position in the subproblem tree. The implementation requires that all child subproblems report OPTIMAL before the subproblem will generate a cut for its parent, and that a subproblem generates a proposal for its children only when it has received a new proposal from its parent. (This is the so-called Fast Forward, Fast Back protocol.) Only optimal cuts and proposals are generated.

Changing the number Maximum_Subproblems can have a big effect on the performance of Benders decomposition. For example, 100 subproblems will generate 100 different cuts, whereas 1 subproblem will generate 1 cut from the same proposal. The difference is the level of aggregation: the 1 cut from 1 subproblem is in fact the aggregation of the 100 cuts from the 100 subproblems. Generally speaking, aggregated cuts contain less information than the collection of disaggregated cuts, but disaggregated cuts make the master problem grow much faster.

Major iterations of Benders decomposition are performed until either a relative optimality gap of size less than Optimality_Gap is attained, or the maximum number of iterations Benders_Major_Iterations is performed. A major iteration consists of:

1.: Solving the root subproblem with all optimality and feasibility cuts so far generated, and posting the proposed root solution and objective value, if one exists.
2.: Solving the full subproblem tree with the root proposal (possibly iterating many times between intermediate subproblems) and generating subproblem optimality or feasibility cuts to the root. If all subproblems generate optimality cuts, then the subproblem optimal values are summed to form the subproblem objective value.
3.: Testing the relative gap between the sum of the subproblem objective values (the true value at the given root proposal) and the root objective value (a lower bound) for convergence to within Optimality_Gap.

At the termination of the nested decomposition phase, if optimality was not attained, then the Simplex phase begins.

Experience suggests that the solution progress is most affected by the Problem_Size and the maximum number of major iterations. Large subproblems take a long time to solve. A large number of small subproblems will spend a long time resolving root proposals.

Those parameters not yet described affect to the solution procedures applied within the Benders decomposition. These procedures are implemented in IBM Optimization Library subroutines, and the parameters refer to the calling sequences of these subroutines, which are documented in the Optimization Library Users Guide and Reference. A method or type setting of zero means that the procedure will not be called. For example, the parameter EKKCRSH_method refers to the parameter passed to EKKCRSH, which is called at the initial solution of any subproblem, master or subproblem. If this parameter is set to zero, then EKKCRSH would not be called (not recommended!). The actual instantiation is if (icrsh) ekkcrsh(iret,osl_dspace,icrsh);, and similarly for the other procedures.

The Quadratic Solver

Quadratic stochastic programming problems may be solved using the L-shaped method, just like any other convex stochastic program. The Stochastic Extensions includes the ability to specify a diagonal quadratic matrix in invocation of the ekks_CreateCore subroutine, and to change the problem type to quadratic. This causes the flexible nested decomposition solver to use the Optimization Library quadratic solver.

Parametric Analysis

Once a problem has been solved with the L-shaped method, it can be quickly resolved with perturbed bounds or objectives. Invoking the parallel solver will cause the program to take equal steps from the initial bounds and objective toward the perturbed bounds and objective. The user specifies the perturbation and the number of steps. At each step, the bounds and objective are updated and the Optimization Library simplex solver is called with the old solution as a starting basis. Parametric performs quite well even for very large problems, once the nested decomposition solver has found the initial solution.

Footnotes

mixed integer programming techniques.⁽¹⁾
This is not supported in the present version of the IBM Stochastic Extensions. [August 1, 1998]
corresponding core file entry.⁽²⁾
The IBM Stochastic Extensions Stoch file format allows for a keyword ADD in the third name field indicating that the values for the random variables are added to the corresponding core file entry.

[ Top of Page | Previous (Roadmap) | Next (Reference Section) | Table of Contents ]

Stochastic Extensions Users Guide

Overview of Stochastic Extensions