Utilities
Utilities
Utilities
Paolo Bruni
Marcelo Antonelli
Tom Crocker
Davy Goethals
Rama Naidoo
Bart Steegmans
ibm.com/redbooks
International Technical Support Organization
August 2001
SG24-6289-00
Take Note! Before using this information and the product it supports, be sure to read the general
information in “Special notices” on page 299.
This edition applies to Version 7 of IBM DATABASE 2 Universal Database Server for z/OS and OS/390 (DB2
for z/OS and OS/390 Version 7), Program Number 5675-DB2, and the DB2 Version 7 Utilities Suite, Program
Number 5697-E98.
When you send information to IBM, you grant IBM a non-exclusive right to use or distribute the information in
any way it believes appropriate without incurring any obligation to you.
Contents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
The team that wrote this redbook. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
Special notice . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xix
IBM trademarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xix
Comments welcome. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xix
Part 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Chapter 1. Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.1 Contents of this redbook . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2 Brief overview of all DB2 Utilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2.1 Online utilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2.2 Stand-alone utilities. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.3 Utilities at work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.4 Major changes with DB2 V7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.4.1 Functional enhancements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.4.2 New packaging of utilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
iv DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
4.3.1 Monitoring space growth . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
4.3.2 Compression dictionary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
4.4 Real-Time Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
4.4.1 Real-Time Statistics tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
4.4.2 Description of statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
4.4.3 Impact of utilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
4.4.4 Maintaining in-memory statistics in data sharing . . . . . . . . . . . . . . . . . . . . . . . . . 88
4.4.5 Statistics accuracy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
4.4.6 DSNACCOR stored procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
4.5 DSNUTILS and Control Center . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
4.6 Data administration tools. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
4.6.1 DB2 Administration Tool . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
4.6.2 DB2 Automation Tool . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
Contents v
7.4 Planning for Online REORG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166
7.5 Shadow data sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166
7.6 Phases of Online REORG. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
7.6.1 UNLOAD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
7.6.2 RELOAD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
7.6.3 SORT and BUILD as compared to SORTBLD . . . . . . . . . . . . . . . . . . . . . . . . . . 170
7.6.4 LOG phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
7.6.5 SWITCH . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
7.6.6 BUILD2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
7.7 Controlling Online REORG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176
7.7.1 DRAIN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176
7.7.2 DRAIN_WAIT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177
7.7.3 RETRY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
7.7.4 RETRY_DELAY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178
7.7.5 MAXRO . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179
7.7.6 LONGLOG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
7.7.7 DELAY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
7.7.8 TIMEOUT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
7.7.9 DEADLINE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
7.7.10 Recommendations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
7.8 Further options . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
7.8.1 LISTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
7.8.2 SORTKEYS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
7.8.3 SORTDATA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
7.8.4 NOSYSREC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
7.8.5 DFSORT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
7.8.6 KEEPDICTIONARY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
7.8.7 REUSE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184
7.8.8 PREFORMAT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184
7.9 DISCARD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185
7.10 UNLOAD EXTERNAL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187
7.11 Index REORG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188
7.12 Catalog reorganization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188
7.13 Inline Utilities with REORG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188
7.13.1 Inline COPY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
7.13.2 Inline RUNSTATS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
vi DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Chapter 9. Recovering data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209
9.1 RECOVER using the LISTDEF command . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210
9.2 Parallel RECOVER . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211
9.2.1 Restriction for parallel RECOVER. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212
9.3 LOGONLY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213
9.3.1 Copy taken outside DB2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213
9.3.2 DB2 object names with REORG and FASTSWITCH . . . . . . . . . . . . . . . . . . . . . 215
9.3.3 Recovering a DB2 object from Concurrent COPY . . . . . . . . . . . . . . . . . . . . . . . 215
9.3.4 Recovering without an image copy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215
9.3.5 Recovering a single piece of a multi-linear page set. . . . . . . . . . . . . . . . . . . . . . 216
9.3.6 LOGONLY recovery improvements in V7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
9.4 Log Apply considerations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
9.5 Fast Log Apply . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 217
9.5.1 Fast Log Apply buffers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218
9.5.2 Sort the log records. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218
9.5.3 When is Fast Log Apply bypassed? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219
9.6 Index recovery from COPY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219
9.7 Modify enhancement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221
9.8 REPORT RECOVERY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 222
9.9 DB2 Change Accumulation Tool . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224
9.10 QUIESCE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225
9.11 CHECK . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226
9.11.1 CHECK INDEX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226
9.11.2 CHECK DATA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228
9.11.3 CHECK LOB . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229
Contents vii
11.3.8 REPORT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262
11.4 RUNSTATS and LOB table spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262
11.5 New statistics collected . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 265
11.6 Removing HISTORY statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267
11.7 RUNSTATS performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 271
Appendix B. Sample JCL for copying Catalog and Directory objects . . . . . . . . . . . . 287
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 305
viii DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Figures
x DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Tables
xii DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Examples
xiv DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
9-5 COPY table space and all indexes . . . . . . . . . . . . . . . . . . . . . . . . . . . 219
9-6 RECOVER of the table space and indexes in parallel . . . . . . . . . . . . . 220
9-7 RECOVER table space and indexes output . . . . . . . . . . . . . . . . . . . . 220
9-8 REPORT RECOVERY sample job . . . . . . . . . . . . . . . . . . . . . . . . . . . 222
9-9 REPORT RECOVERY sample output . . . . . . . . . . . . . . . . . . . . . . . . . 223
9-10 Sample JCL to create new full image copy . . . . . . . . . . . . . . . . . . . . . 224
9-11 QUIESCE with a list of table spaces . . . . . . . . . . . . . . . . . . . . . . . . . . 225
9-12 QUIESCE using the LISTDEF command. . . . . . . . . . . . . . . . . . . . . . . 225
9-13 QUIESCE of table space and all indexes . . . . . . . . . . . . . . . . . . . . . . 226
9-14 Sample CHECK INDEX JCL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227
9-15 CHECK INDEX job output . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227
9-16 Sample CHECK DATA JCL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 229
9-17 Sample CHECK LOB JCL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
9-18 CHECK LOB job output . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 230
10-1 Specifying lists in the COPY utility . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235
10-2 Parallel COPY using LISTDEF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 236
10-3 Output from Parallel COPY using LISTDEF . . . . . . . . . . . . . . . . . . . . 237
10-4 -DISPLAY UTILITY for Parallel COPY . . . . . . . . . . . . . . . . . . . . . . . . 238
10-5 Specifying CHANGELIMIT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239
10-6 Report output from COPY utility. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239
10-7 COPYTOCOPY and LISTDEFs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 244
10-8 COPYTOCOPY using LASTFULLCOPY . . . . . . . . . . . . . . . . . . . . . . . 245
10-9 Output job for Concurrent COPY with error . . . . . . . . . . . . . . . . . . . . . 247
10-10 Output from incremental copy after a Concurrent COPY. . . . . . . . . . . 248
10-11 Example of RECOVER using a Concurrent COPY . . . . . . . . . . . . . . . 248
10-12 Copy statement without FILTERDDN . . . . . . . . . . . . . . . . . . . . . . . . . 250
10-13 Copy statement using FILTERDDN . . . . . . . . . . . . . . . . . . . . . . . . . . . 250
10-14 Complete JCL using FILTERDDN . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250
10-15 Job output using FILTERDDN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 251
11-1 Example of LISTDEF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 255
11-2 RUNSTATS with LISTDEF job output . . . . . . . . . . . . . . . . . . . . . . . . . 255
11-3 RUNSTATS using KEYCARD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257
11-4 RUNSTATS KEYCARD job output . . . . . . . . . . . . . . . . . . . . . . . . . . . 257
11-5 RUNSTATS using FREQVAL . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258
11-6 RUNSTATS using FREQVAL sample output. . . . . . . . . . . . . . . . . . . . 258
11-7 Multiple FREQVAL statements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258
11-8 RUNSTATS SAMPLE 10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259
11-9 RUNSTATS SAMPLE 25 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 260
11-10 RUNSTATS SAMPLE 100 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 260
11-11 RUNSTATS REPORT sample output . . . . . . . . . . . . . . . . . . . . . . . . . 262
11-12 RUNSTATS with LISTDEF for LOB table spaces . . . . . . . . . . . . . . . . 263
11-13 RUNSTATS for LOB table spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . 263
11-14 RUNSTATS output for LOB table spaces . . . . . . . . . . . . . . . . . . . . . . 263
11-15 New statistics SPACE and EXTENTS . . . . . . . . . . . . . . . . . . . . . . . . . 267
11-16 Data set listing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267
11-17 MODIFY STATISTICS by date using SPACE . . . . . . . . . . . . . . . . . . . 269
11-18 Output job for table space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 269
B-1 Copying the Catalog and Directory . . . . . . . . . . . . . . . . . . . . . . . . . . . 287
C-1 DDL to define a shadow catalog in data base DSHADOW . . . . . . . . . 289
C-2 Unload DSNDB06 using the UNLOAD utility . . . . . . . . . . . . . . . . . . . . 290
C-3 Sample output of SYSPUNCH data set for SYSVIEWS . . . . . . . . . . . 291
C-4 Load data from DSNDB06 using the LOAD utility . . . . . . . . . . . . . . . . 292
C-5 RUNSTATS on shadow catalog. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 292
Examples xv
D-1 JCL to create the conversion table . . . . . . . . . . . . . . . . . . . . . . . . . . . 293
D-2 Output from the generation job . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 294
D-3 Contents of CUNUNI01 in SYS1.PARMLIB. . . . . . . . . . . . . . . . . . . . . 295
D-4 Activate the new UNICODE table . . . . . . . . . . . . . . . . . . . . . . . . . . . . 295
D-5 UNI=ALL output in SYSLOG . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 296
xvi DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Preface
IBM has continuously enhanced the functionality, performance, availability, and ease of use of
DB2 Utilities. Starting with DB2 for MVS/ESA Version 4, the progression of changes has
increased. DB2 for z/OS and OS/390 Version 7 introduces new functions and a new way of
packaging the utilities.
This IBM Redbook is the result of a project dedicated to the new DB2 Version 7 Utilities Suite
product. It provides information on introducing Wildcarding in operational scenarios,
optimizing concurrent execution of utilities for a shorter night window, information for
triggering utilities execution, and considerations on partitioning.
It also describes the new functions of LOAD (Cross Loader and Online RESUME) and
REORG, as well as the new UNLOAD and COPTYTOCOPY utilities, and it evaluates the
impact of several options when using LOAD, REORG, RECOVER, COPY, and RUNSTATS.
This redbook concentrates on the recent enhancements. It implicitly assumes a basic level of
familiarity with the utilities as provided by DB2 for OS/390 Version 6.
Marcelo Antonelli is a DB2 Specialist in Brazil. He has 16 years of experience working with
DB2 in OS/390. Marcelo holds a graduate degree in System Analysis from PUCC in
Campinas, Sao Paulo state. His areas of expertise include database design, performance,
and DRDA connectivity. Marcelo currently supports an outsourcing commercial account, as
well as assisting IBM technical professionals on DB2.
Davy Goethals is a systems engineer in Belgium. He works for Sidmar, a steel producing
company. He has experience as a DB2 system administrator since DB2 Version1 in 1985. He
participated with Sidmar in multiple DB2 ESP and QPP programs, including the ESP for DB2
V7. His areas of expertise include OS/390 systems and Business Intelligence.
Bart Steegmans is a DB2 Product Support Specialist in IBM Belgium who has recently
joined the ITSO. He has over 12 years of experience in DB2. Before joining IBM in 1997, Bart
worked as a DB2 system administrator at a banking and insurance group. His areas of
expertise include DB2 performance and backup and recovery.
Corinne Baragoin
Yolanda Cascajo Jimenez
Yvonne Lyon
International Technical Support Organization, San Jose Center
Roy Cornford
Dan Courter
Craig Friske
John Garth
Koshy John
Laura Kunioka-Weis
Fung Lee
Roger Miller
Mai Nguyen
Mary Petras
Jim Ruddy
Akira Shibamiya
Yoichi Tsuji
Grace Tzeng
Tom Ulveman Jensen
Mary Beth White
IBM Silicon Valley Laboratory
Rich Conway
Vasilis Karras
International Technical Support Organization, Poughkeepsie Center
Michael Parbs
IBM Australia
xviii DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Special notice
This publication is intended to help managers and professionals understand and evaluate the
performance and applicability to their environment of the new functions introduced by IBM
DATABASE2 Universal Database Server for z/OS and OS/390 Version 7 and DB2 Version 7
Utilities Suite. The information in this publication is not intended as the specification of any
programming interfaces that are provided by IBM DB2 UDB Server for z/OS and OS/390
Version 7 and the DB2 Version 7 Utilities Suite. See the PUBLICATIONS section of the IBM
Programming Announcement for IBM DB2 UDB Server for z/OS and OS/390 Version 7 and
DB2 Version 7 Utilities Suite for more information about what publications are considered to
be product documentation.
IBM trademarks
The following terms are trademarks of the International Business Machines Corporation in the
United States and/or other countries:
e (logo)® Redbooks
IBM ® Redbooks Logo
S/390 OS/390
DRDA DB2
MVS/ESA
Comments welcome
Your comments are important to us!
We want our IBM Redbooks to be as helpful as possible. Send us your comments about this
or other Redbooks in one of the following ways:
Use the online Contact us review redbook form found at:
ibm.com/redbooks
Send your comments in an Internet note to:
redbook@us.ibm.com
Mail your comments to the address on page ii.
Preface xix
xx DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Part 1
Part 1 Introduction
In this part we introduce the DB2 Utilities, summarize their evolution, and describe the
contents of this redbook.
Chapter 1. Introduction
In this chapter we provide:
A description of the contents of this redbook
A brief overview of all DB2 Utilities, both online and stand-alone
A summary of the DB2 Utilities enhancements with V4, V5, V6, and V7
The main enhancements available with V7
The changes in their packaging that occurred with DB2 V7
Part 2, “Planning for DB2 Utilities” on page 15 includes overall considerations on functions
related to how to plan for, and when to execute the DB2 Utilities. These considerations are
distributed across four chapters describing the topics:
Wildcarding and Templates
Inline executions and parallelism
Triggering and controlling executions
Managing partitions
Part 3, “Executing DB2 Utilities” on page 107 goes into the execution of the new DB2 Utilities
and the new functions by describing the functionality and providing examples of usage for:
Loading data
Reorganizing data
Unloading data
Recovering data
Copying data
Gathering statistics
4 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
CHECK INDEX
The CHECK INDEX online utility tests whether indexes are consistent with the data they
index, and issues warning messages when an inconsistency is found. You execute
CHECK INDEX after a conditional restart or a point-in-time recovery on all table spaces
whose indexes may not be consistent with the data. It should also be used before CHECK
DATA to ensure that the indexes used by CHECK DATA are valid.
CHECK LOB
The CHECK LOB online utility can be run against a LOB table space to identify any
structural defects in the LOB table space and any invalid LOB values. You run the CHECK
LOB online utility against a LOB table space that is marked CHECK pending (CHKP) to
identify structural defects. If none are found, the CHECK LOB utility turns the CHKP status
off.
COPY
The COPY online utility creates up to four image copies of any table space, table space
partition, data set of a linear table space, index space, index space partition.
COPYTOCOPY
The COPYTOCOPY online utility, new with DB2 V7, makes image copies from an image
copy that was taken by the COPY utility. This includes Inline Copies made by the REORG
or LOAD utilities. Starting with either the local primary or recovery site primary copy,
COPYTOCOPY can make up to three copies of one or more of the local primary, local
backup, recovery site primary, and recovery site backup copies.
The copies are used by the RECOVER utility when recovering a table space or index
space to the most recent time or to a previous time. These copies can also be used by
MERGECOPY, UNLOAD, and possibly a subsequent COPYTOCOPY execution.
DIAGNOSE
The DIAGNOSE online utility generates information useful in diagnosing problems. It is
intended to be used only under the direction of your IBM Support Center.
LOAD
The LOAD online utility is used to load one or more tables of a table space. The LOAD
utility loads records into the tables and builds or extends any indexes defined on them. If
the table space already contains data, you can choose whether you want to add the new
data to the existing data or replace the existing data. The loaded data is processed by any
edit or validation routine associated with the table, and any field procedure associated with
any column of the table.
MERGECOPY
The MERGECOPY online utility merges image copies produced by the COPY utility or
Inline Copies produced by the LOAD or REORG utilities. It can merge several incremental
copies of a table space to make one incremental copy. It can also merge incremental
copies with a full image copy to make a new full image copy.
MODIFY RECOVERY
The MODIFY online utility with the RECOVERY option deletes records from the
SYSIBM.SYSCOPY catalog table, related log records from the SYSIBM.SYSLGRNX
directory table, and entries from the DBD. You can remove records that were written
before a specific date, or you can remove records of a specific age. You can delete records
for an entire table space, partition, or data set.
Chapter 1. Introduction 5
MODIFY STATISTICS
The MODIFY STATISTICS online utility, new with DB2 V7, deletes unwanted statistics
history records from the corresponding catalog tables. You can remove statistics history
records that were written before a specific date, or you can remove records of a specific
age. You can delete records for an entire table space, index space, or index.
QUIESCE
The QUIESCE online utility establishes a quiesce point (the current log RBA or log record
sequence number (LRSN)) for a table space, partition, table space set, or list of table
spaces and table space sets, and records it in the SYSIBM.SYSCOPY catalog table. An
established QUIESCE point improves the probability of a successful RECOVER or COPY.
You should run QUIESCE frequently between regular executions of COPY to establish
regular recovery points for future point-in-time recovery.
REBUILD INDEX
The REBUILD INDEX online utility reconstructs indexes from the table that they reference.
RECOVER
The RECOVER online utility recovers data to the current state, or to its state at a previous
point-in-time, by restoring a copy, then applying log records.
REORG INDEX
The REORG INDEX online utility reorganizes an index space to improve access
performance and reclaim fragmented space. You can specify the degree of access to your
data during reorganization, and collect inline statistics using the STATISTICS keyword.
REORG TABLESPACE
The REORG TABLESPACE online utility reorganizes a table space to improve access
performance and reclaim fragmented space. In addition, the utility can reorganize a single
partition or range of partitions of a partitioned table space. You can specify the degree of
access to your data during reorganization, and collect inline statistics using the
STATISTICS keyword. If you specify REORG TABLESPACE UNLOAD EXTERNAL, the
data is unloaded in a format that is acceptable to the LOAD utility of any DB2 subsystem.
You can also delete rows during the REORG by specifying the DISCARD option.
REPAIR
The REPAIR online utility repairs data. The data can be your own data, or data you would
not normally access, such as space map pages and index entries.
REPORT
The REPORT online utility provides information about table spaces. Use REPORT
TABLESPACESET to find the names of all the table spaces and tables in a referential
structure, including LOB table spaces. Use REPORT RECOVERY to find information
necessary for recovering a table space, index, or a table space and all of its indexes. The
REPORT utility also provides the LOB table spaces associated with a base table space.
RUNSTATS
The RUNSTATS online utility gathers summary information about the characteristics of
data in table spaces, indexes, and partitions. DB2 records this information in the DB2
catalog and uses it to select access paths to data during the bind process. It is available to
the database administrator for evaluating database design and to aid in determining when
table spaces or indexes must be reorganized.
STOSPACE
The STOSPACE online utility updates DB2 catalog columns that indicate how much space
is allocated for storage groups and related table spaces and indexes.
6 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
UNLOAD
The UNLOAD online utility unloads data from one or more source objects to one or more
BSAM sequential data sets in external formats. The source can be DB2 table spaces or
DB2 image copy data sets.
The following new utility statements are to be used in conjunction with other utilities:
– TEMPLATE
The TEMPLATE utility control statement lets you allocate data sets, without using JCL
DD cards, during the processing of LISTDEF list. In its simplest form, the TEMPLATE
control statement defines the data set naming convention. The control statement can
optionally contain allocation parameters that define data set size, location, and
attributes.
– LISTDEF
The LISTDEF utility control statement defines a list of objects and assigns a name to
the list. A complete LISTDEF control statement must include:
• The name of the list
• An INCLUDE clause, optionally followed by additional INCLUDE or EXCLUDE
clauses to either include or exclude objects from the list
• The type of objects the list contains, either TABLESPACES or INDEXSPACES
• The object to be used in the initial catalog lookup
The search for objects can begin with databases, table spaces, index spaces, tables,
indexes or other lists. The resulting list will only contain TABLESPACES,
INDEXSPACES, or both.
– OPTIONS
The OPTIONS utility control statement specifies processing options that are applicable
across many utility executions in a job step. By specifying various options, you can:
• Preview utility control statements
• Override library names for LISTDEF lists or TEMPLATEs
• Specify how to handle errors during list processing
• Alter the return code for warning messages
• Restore all default options
– EXEC SQL
The EXEC SQL utility control statement declares cursors or executes dynamic SQL
statements.
Chapter 1. Introduction 7
– Create conditional restart control record to control the next start of the DB2 subsystem
– Change the VSAM catalog name entry in the BSDS
– Modify the communication record in the BSDS
– Modify the value for the highest-written log RBA value (relative byte address within the
log) or the highest-offloaded RBA value
DSNJU004
The Print Log Map (DSNJU004)utility lists the following information:
– Log data set name, log RBA association, and log LRSN for both primary and
secondary copy of all active and archive log data sets
– Active log data sets that are available for new log data
– Status of all conditional restart control records in the bootstrap data set
– Contents of the queue of checkpoint records in the bootstrap data set
– The communication record of the BSDS, if one exists
– Contents of the quiesce history record
– System and utility timestamps
– Contents of the checkpoint queue
DSN1CHKR
The DSN1CHKR utility verifies the integrity of DB2 directory and catalog table spaces.
DSN1CHKR scans the specified table space for broken links, broken hash chains, and
records that are not part of any link or chain. Use DSN1CHKR on a regular basis to
promptly detect any damage to the catalog and directory.
DSN1COMP
The DSN1COMP utility estimates space savings to be achieved by DB2 data compression
in table spaces. This utility can be run on the following types of data sets containing
uncompressed data:
– DB2 full image copy data sets
– VSAM data sets that contain DB2 table spaces
– Sequential data sets that contain DB2 table spaces (for example,DSN1COPY output)
DSN1COMP does not estimate savings for data sets that contain LOB table spaces or
index spaces.
DSN1COPY
With the DSN1COPY stand-alone utility, you can copy:
– DB2 VSAM data sets to sequential data sets
– DSN1COPY sequential data sets to DB2 VSAM data sets
– DB2 image copy data sets to DB2 VSAM data sets
– DB2 VSAM data sets to other DB2 VSAM data sets
You can copy DSN1COPY sequential data sets to other sequential data sets. Using
DSN1COPY, you can also print hexadecimal dumps of DB2 data sets and databases,
check the validity of data or index pages (including dictionary pages for compressed data),
translate database object identifiers (OBIDs) to enable moving data sets between different
systems, and reset to 0 the log RBA that is recorded in each index page or data page.
DSN1COPY is compatible with LOB table spaces, when you specify the LOB keyword,
and omit the SEGMENT and INLCOPY keywords.
DSN1LOGP
The DSN1LOGP utility formats the contents of the recovery log for display.The two
recovery log report formats are:
8 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
– A detail report of individual log records. This information helps IBM Support Center
personnel analyze the log in detail. (This book does not include a full description of the
detail report.)
– A summary report helps you:
• Perform a conditional restart
• Resolve indoubt threads with a remote site
• Detect problems with data propagation
You can specify the range of the log to process and select criteria within the range to limit
the records in the detail report. For example, you can specify:
– One or more units of recovery identified by URID
– A single database
By specifying a URID and a database, you can display recovery log records that
correspond to the use of one database by a single unit of recovery.
DSN1PRNT
With the DSN1PRNT stand-alone utility, you can print:
– DB2 VSAM data sets that contain table spaces or index spaces (including dictionary
pages for compressed data)
– Image copy data sets
– Sequential data sets that contain DB2 table spaces or index spaces
Using DSN1PRNT, you can print hexadecimal dumps of DB2 data sets and databases.
If you specify the FORMAT option, DSN1PRNT formats the data and indexes for any page
that does not contain an error that would prevent formatting. If DSN1PRNT detects such
an error, it prints an error message just before the page and dumps the page without
formatting. Formatting resumes with the next page. Compressed records are printed in
compressed format. DSN1PRNT is especially useful when you want to identify the
contents of a table space or index. You can run DSN1PRNT on image copy data sets as
well as table spaces and indexes. DSN1PRNT accepts an index image copy as input
when you specify the FULLCOPY option.
DSN1PRNT is compatible with LOB table spaces, when you specify the LOB keyword,
and omit the INLCOPY keyword.
DSN1SDMP
Under the direction of the IBM Support Center, you can use the IFC Selective Dump
(DSN1SDMP) utility to:
– Force dumps when selected DB2 trace events occur.
– Write DB2 trace records to a user-defined MVS data set.
Chapter 1. Introduction 9
Table 1-1 Major changes
Utility DB2 V4 DB2 V5 DB2 V6 DB2 V7
LOAD PI improvements. LOAD and Inline Collect inline statistics. Family Cross Loader,
Path length reduction. COPY (COPYDDN). Building indexes in Online Load Resume,
Optional removal of parallel. Partition Parallelism
work data sets for SORTKEYS also used within a job.
indexes (SORTKEYS). for parallel index build.
RELOAD phase
performance.
PREFORMAT option.
REORG NPI improvements. REORG and Inline Collect inline statistics. Fast switch, BUILD2
Catalog REORG. COPY (COPYDDN). Building indexes in phase parallelism,
Path length reduction. New SHRLEVEL parallel. Drain and Retry.
SORTDATA NONE, REFERENCE, Discard and faster
CHANGE option UNLOAD.
(Online REORG). Faster Online REORG.
Optional removal of Threshold for
work data sets for execution.
indexes (SORTKEYS). SORTKEYS also used
NOSYSREC for parallel index build.
PREFORMAT option.
RECOVER Usage of DFSMS Use inline copies from Fast Log Apply.
CONCURRENT LOAD and REORG. RECOVER INDEX
copies. RECOVER INDEX from copies
RECOVER INDEX UNLOAD phase (vs. REBUILD).
restartability. performance (like Recovery of table
RECOVER INDEX SORTDATA). space and indexes
SYSUT1 optional for with single log scan.
performance choice. PARALLEL
RECOVER.
RUNSTATS Modified hashing New SAMPLE option RUNSTATS executed Statistics History.
technique for CPU to specify % of rows to inline with REORG,
reduction. use for non-indexed LOAD, and RECOVER
column statistics. or Rebuild index.
New KEYCARD option Parallel table space
for correlated key and index.
columns.
New FREQVAL option
for frequent value
statistics with
non-uniform
distribution.
QUIESCE TABLESPACESET
support.
10 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Utility DB2 V4 DB2 V5 DB2 V6 DB2 V7
ALL Partition BSAM striping for work Avoid delete and New packaging,
independence (NPI) data sets redefine of data sets dynamic utility jobs.
from type 2 indexes. (except for COPY).
BSAM I/O buffers.
Type 2 index
performance.
Other functions not included in the table, but worth mentioning, are the support for very large
tables (DSSIZE) and the support for pieces for non-partitioning indexes.
Chapter 1. Introduction 11
Cross Loader
A new extension to the point-in-time utility which allows you to load output from a SELECT
statement. The input to the SELECT can be anywhere within the scope of DRDA connectivity.
These utilities initially provided a basic level of services to allow customers to manage data.
Some customers have preferred to obtain such functions, however, from independent
software vendors that have developed utility and tools offerings that offered additional
performance, function, and features beyond that contained in the DB2 core utilities. With
recent releases of DB2 for OS/390, in response to clear customer demand, IBM has invested
in the improvement of the performance and functional characteristics of these utilities, as we
have seen in the previous section.
With DB2 V7, most of the online utilities have been separated from the base product and they
are now offered as separate products licensed under the IBM Program License Agreement
(IPLA), and the optional associated agreements for Acquisition of Support. This combination
of agreements provides to the users equivalent benefits to the previous traditional ICA license
(see Figure 1-1).
12 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Packaging of Utilities Redbooks
Utilities Suite
(5655-E98)
These products are installed separately from DB2 V7 when accessing user data; however, all
of them are already available within DB2 when accessing the DB2 catalog, directory, or the
sample database. You can install one or both of them, and give them a test run during the
money-back guaranteed 30-day period. Verify the benefits they bring to your database
operations.
The following utilities, and all stand-alone utilities (DSN1...), are considered core utilities, and
are included and activated with DB2 V7: CATMAINT, DIAGNOSE, LISTDEF, OPTIONS,
QUIESCE, REPAIR, REPORT, TEMPLATE, and DSNUTILS.
Chapter 1. Introduction 13
14 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Part 2
In this chapter we introduce the concepts and discuss in detail the use of Wildcards and
Templates, while specific implementation details are provided in the appropriate utility
chapters.
2.2 Wildcards
This section shows how to use the LISTDEF statements to provide Wildcarding, and the
benefits of their use.
DB2 will automatically generate a list of objects that matches the supplied pattern, and pass
the list to one or more specific utilities for processing.
A list-name is allocated to the LISTDEF as part of the control statement. If both the SYSIN
and SYSLISTD are included, and the same list-name is found in both, then the instream
LISTDEF will be used.
Objects can be specified within the LISTDEF at database through to index level. Which
objects are returned depends upon whether table spaces or indexes have been requested.
The various results are shown in Table 2-1.
18 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Table 2-1 Object results table by keyword
Object Type TABLESPACE keyword results INDEXSPACE keyword results
Database All table spaces within database All index spaces within database
Table Table space containing the table All index spaces related to the table
Index Table space containing related table Index space containing the index
List Table spaces from the expanded Index spaces from the expanded
referenced list referenced list
Objects are added to the list via the keyword INCLUDE and removed from the built list via
EXCLUDE. If an EXCLUDE attempts to remove an object that has not been included, then
the EXCLUDE is ignored.
Lists generated by the LISTDEF statement are ordered, but are not sorted, and do not
contain duplicates. If restart is required, then a checkpoint list is used at restart.
Following are examples comparing the use of LISTDEF and the method used prior to DB2
Version 7. The Version 6 JCL (see Example 2-1) will have to be altered for every new object
that is required to be quiesced in database DBA.
Example 2-1 Version 6 and before
.....
//SYSIN DD *
QUIESCE TABLESPACE DBA.X
TABLESPACE DBA.Y
TABLESPACE DBA.Z
As shown in Example 2-3, there is the ability to EXCLUDE objects from the list of objects
passed on to the utility.
Example 2-3 Excluding objects
//SYSIN DD *
LISTDEF DBA INCLUDE TABLESPACE DBA.*
EXCLUDE TABLESPACE DBA.Z
QUIESCE LIST DBA
Further examples are displayed under the utilities that are able to accept LISTDEF
statements. For a full list of utilities that can use LISTDEF, refer to Figure 2-6 on page 31.
Be aware that each LISTDEF containing a wildcard will result in calls to the DB2 Catalog to
satisfy the list. Thus, jobs may run longer than necessary when a large number of LISTDEFs
are present in one job and are not referenced. Therefore, while including all LISTDEFs in
every utility job would appear simpler, care should be taken with this approach.
A LISTDEF statement will only be expanded once; therefore, it can be referenced by many
utilities in the job step without incurring the cost of expansion multiple times.
20 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Here is a simple example of this: Five table spaces and their unique indexes reside in the two
databases DBX and DBY. The partitioned table space PTS1 has an additional non-partitioned
index IXP1NP. After the previous recovery of a table space set to a prior point-in-time, since
the indexes have not been recovered to the same point-in-time, a rebuild of all indexes on all
tables in all table spaces of this table space set must be completed. Normally, if no COPY
enabled indexes are involved, the first INCLUDE clause of the LISTDEF statement shown in
Figure 2-1 contains the statements needed to fulfill this task.
LISTDEF expansion
TS0 PTS1 TS2 TS3 TS4
RI RI RI
U NU U U U
DBX IXP1 IXP1NP IX2 DBY IX3 IX4
//SYSIN DD *
V6 REBUILD INDEX (ALL) TABLESPACE DBY.TS3
REBUILD INDEX (ALL) TABLESPACE DBY.TS4
REBUILD INDEX authid.IXP1 PART 1
REBUILD INDEX authid.IXP1 PART 2
REBUILD INDEX authid.IXP1 PART 3
REBUILD INDEX authid.IXP1NP
REBUILD INDEX (ALL) TABLESPACE DBX.TS2
/*
//SYSIN DD *
V7 LISTDEF RBDLIST
INCLUDE INDEXSPACES TABLESPACE DBY.T* RI
EXCLUDE INDEXSPACE DBX.IXP*
INCLUDE INDEXSPACE DBX.IXP* PARTLEVEL
REBUILD INDEX LIST RBDLIST
/*
An example of the complete recovery job with LISTDEF is listed in Example 2-4.
Example 2-4 Sample LISTDEF - Recovery job
LISTDEF RECLIST INCLUDE TABLESPACE DBY.TS% RI
LISTDEF RBDLIST INCLUDE INDEXSPACES TABLESPACE DBY.T% RI
RECOVER LIST RECLIST TOLOGPOINT X'xxxxxxxxxxxxxxxx'
REBUILD INDEX LIST
RBDLIST
A further example is one where we assume that all partitioned indexes are to be rebuilt, per
partition, and that they reside in database DBY. If all partitioned index space names in DBX
start with ‘IXP’, then Figure 2-1 shows an alternative way, possible with V7. Again, the
advantage is that you do not have to change the DB2 V7 job, if table spaces or indexes are
dropped or added — as long as established naming conventions are followed.
1. An initial catalog lookup is performed to find and list the explicitly specified object or
objects which match the specified pattern.In our example, in the first INCLUDE clause,
table spaces are searched which match the pattern DBY.T*. The two matching table
spaces, DBY.TS3 and DBY.TS4, are listed in the example at the right side of step 1.
2. Related objects are added or removed depending on the presence of the RI, BASE, LOB,
or ALL keywords. Two types of relationships are supported, referential and auxiliary
relationships. More specifically:
– If you specify RI, then the TABLESPACESET process is invoked and referentially
related objects are added to the list. As a result, all referentially connected table
spaces — by their tables — are included in the list. Figure 2-2 shows these four table
spaces.
– If you specify LOB or ALL, then auxiliary related LOB objects are added to the list.
– If you specify BASE or ALL, then auxiliary related base objects are added to the list.
– If you specify LOB, then base objects are excluded from the list.
– If you specify BASE, then LOB objects are excluded from the list.
3. This step consists of two sub-steps:
a. All related objects of the requested type are searched and added to the list.This step
can be skipped if the list built so far already consists of objects of the requested type,
either table spaces or index spaces. Otherwise, the two cases must distinguished:
• If a list of table spaces is required, that is, the type specification is TABLESPACES,
but the initial catalog lookup has been done with another type of object, for
example, with index spaces or indexes, then related table spaces are added to the
list. Obviously, those table spaces are related, in which the base tables reside of the
indexes or index spaces already in the list.
22 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
• If a list of index spaces is required, that is, the type specification is INDEXSPACES,
but the initial catalog lookup has been done with another type of object, for
example, with table spaces or tables, then the related index spaces are added to
the list.
• In Figure 2-2, in the first INCLUDE clause, the four table spaces have five index
spaces all together. These index spaces are added to the list.
b. A filtering process takes place:
• If INCLUDE TABLESPACES is specified, then all objects which are not table spaces
are removed from the list. Analog rules apply for INCLUDE INDEXSPACES,
EXCLUDE TABLESPACES, and EXCLUDE INDEXSPACES.
• The specification of COPY YES or NO is also taken into account in this step.
• In Figure 2-2, in the first INCLUDE clause, INCLUDE INDEXSPACES has been
specified. Therefore, all table spaces are removed from the list. The list now
contains five index spaces only, as you can see in the diagram.
4. If the keyword INCLUDE is specified, the objects of the resulting list are added to the list
derived from the previous INCLUDE or EXCLUDE clauses, if not in the list already. In
other words, an INCLUDE of an object already in the list is ignored; the list contains no
duplicates. If the keyword EXCLUDE is specified, the objects of the resulting list are
removed from the list derived from the previous INCLUDE or EXCLUDE clause. An
EXCLUDE of an object not in the list is ignored. In Figure 2-2, for the first INCLUDE clause
(A), there is no previous list to merge with, therefore, this step is skipped.
All four steps are explained for the first sample INCLUDE clause (A). Next, the EXCLUDE
clause (B) and then the second INCLUDE clause (C) are processed. This is processed in a
similar way as described above:
Step 1 is similar to step 1 (already explained) of the first INCLUDE clause (A).
Step 2 is skipped, as no relationship type was specified, neither RI, ALL, BASE, nor LOB.
Step 3 is skipped, as spec-type INDEXSPACES is assumed if obj-type is INDEXSPACE.
Therefore, there is no need to look for related objects, nor to filter out objects of another
type.
In step 4 for the EXCLUDE clause (B), two objects are removed from the list generated by
the first INCLUDE clause (A).
In step 4 for the second INCLUDE clause (C), four objects are added to the list generated
after the EXCLUDE clause (B).
Note: The purpose of the last example is to conceptually present the resolution of a
LISTDEF statement in order to help with defining LISTDEF statements properly. The
purpose is not to precisely document DB2’s implementation of the sequence of the
expansion steps (which is in fact different) in order to be generally applicable, not only for
the example. DB2 may perform step 2 after step 3, for example, if you specify INCLUDE
TABLESPACES INDEX authid.ix RI. Furthermore the consideration of the PARTLEVEL
keyword may be postponed to the end, that is, after step 3.
A sample of the utility output is shown in Example 2-5, to show the expansion of the LISTDEF.
Group logically related objects into the same LISTDEF to ensure consistency.
Use partitioned data set (PDS) members as input to reduce maintenance across JCL and to
ensure consistency.
Ensure that all databases are referenced by at least one LISTDEF, or are explicitly coded.
Revisit LISTDEFs periodically to ensure the integrity and completeness of the definitions.
Naming standards for user defined objects would assist in this process and allow for even
less maintenance of LISTDEF.
Use in conjunction with Templates. See 2.3, “Templates” on page 25 for the full benefits.
24 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
2.2.7 Current restrictions in the use of Wildcards
The use of Wildcarding cannot be used against the catalog or directory objects. If either of
these are required in a LISTDEF statement, then these have to be explicitly coded.
Unless the OPTION statement is used, if one object fails due to restriction, then all objects in
the list are not processed. By using OPTION EVENT(ITEMERROR, SKIP) this can be
avoided.
2.3 Templates
Templates are a method of defining data set masks that are to be used by utilities to
dynamically allocate data sets required for successful execution of the utility. They can be
used for work data sets as well as utility data sets, for instance, image copy data sets. When
used in conjunction with LISTDEF, this provides a powerful mechanism for executing utilities,
allows faster development of job streams, and requires fewer modifications when the
underlying list of database objects changes.
Templates allow the writing of data sets to both disk and tape units and incorporates all the
features normally coded within the DD statement. This also allows for the stacking of data
sets on tapes automatically. The Template statement executes entirely in the UTILINIT phase
in preparation for the subsequent utility. Output from the Template control statement is a
dynamic allocation Template with an assigned name for later reference.
The use of Templates also introduces the concept of automatic intelligent data set sizing. For
this to be efficient, RUNSTATS have to be current.
Templates are initiated through the use of the TEMPLATE keyword in the utility job stream
before the utility commands. Like LISTDEF, the Templates can either be coded within the
SYSIN ddname, or in a data set allocated to the job stream via the default SYSTEMPL
ddname. This is the default and can be overridden by using the OPTIONS keyword.
The minimum requirement for a Template statement is a name, for reference by the utility
command, and the data set naming mask. If no sizing information is supplied, then DB2
calculates the space required for the data set. If the Template supplies any other parameters,
then these parameters take precedence; see Example 2-6 and Example 2-7.
Templates can be used to write to either disk or tape. Support is included for stacking multiple
files onto a single tape via the STACK option; see Example 2-8. Support for DF/SMS is also
included.
There can be more than one Template per job stream (see Example 2-9). Each one must
have a unique name, and a utility can reference more than more than one Template, if that
utility supports multiple output ddnames, for example, COPY.
26 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
If the DISPOSITION option is not specified in the Template definition, DB2 will use the
disposition defaults for the utility concerned. DB2 will also take into account any restart
implications. For an example using REORG, see Figure 2-3.
D IS P O S IT IO N w ith th e T E M P L A T E
E x a m p le : R E O R G
s ta tu s n o rm a l te rm in a tio n a b n o rm a l te rm in a tio n
Further examples of Templates are given under the relevant utility sections.
Note: The DISP field in TEMPLATES does NOT support the typical default MVS syntax,
such as DISP=(,,KEEP). In Templates, ALL three parameters must be specified.
The use of Templates allows for the standardization of data set names across the DB2
instance, and allows easy identification of the data set type created, if the relevant variables
are used in construction of the name, for example, &IC for image copy.
A full list of the currently allowable variables that can be used within the Template statement is
provided in Table 2-3.
PART JDATE or JU
(five digits) (Julian date yyyyddd)
LIST JDAY
(name of the list) (ddd)
SEQ TIME
(sequence number of (hhmmss)
the object in the list)
HOUR
MINUTE
SECOND or SC
The input values used in these formulas mostly come from the DB2 catalog tables. The
high-used RBA is read at open time and it is maintained by the buffer manager. It is the most
current value, updated before it is even written to the ICF catalog.
All these formulas are documented in the standard DB2 manuals. Figure 2-4 is an example of
the specific formulas for the data sets for the REORG utility, in cases where you use:
REORG UNLOAD CONTINUE SORTDATA KEEPDICTIONARY SHRLEVEL NONE
SHRLEVEL REFERENCE
28 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Space allocation with the TEMPLATE
If the SPACE option is not specified in the TEMPLATE
definition, DB2 will calculate the space needed.
Figure 2-4 Automatic data set sizing for the REORG utility
Use variables to make data set names meaningful; for example, use &LR and &IC to identify
image copy data sets.
Define Templates twice, once for tape (where applicable) and once for disk, to allow for
quicker interchanging between devices.
Note: Ensure that the PTF for APAR PQ45835 has been applied to fully enable the STACK
option feature.
Use PREVIEW mode to review the output from LISTDEF and Templates.
Use the GDG option to ensure that GDGs are utilized consistently.
Data sets whose size is calculated as too large for the DYNALLOC interface must be
allocated using DD cards. Message DSNU1034I is issued if this condition is met.
Notes:
Apply the PTF for PQ45835 for problems with COPY when using LISTDEF and
TEMPLATE.
Apply the PTF for PQ50223 for problems when loading partitions using TEMPLATE.
For example, primary and backup copies are required per partition for all partitioned table
spaces in the database DBX whose names start with PT. This leads to the LISTDEF
statement shown in Figure 2-5. Not knowing how many table spaces exist which match that
list definition, and not knowing how many partitions these table spaces have, the Template is
used for dynamically allocating the needed copy data sets. As the Template includes the
variable &PRIBAC, this Template can be used in place of both DD names, the one for the
primary copy and the one for the backup copy.
At job execution, DB2 first processes the LISTDEF statement, then the Template definition is
read and then stored. When DB2 has read the COPY statement, the list COPYLIST is
generated and DB2 copies each item on the list, assuming that there are no restrictions. At
this time, DB2 knows all the details for the variables and can substitute them into the
Template variables stored previously.
With DB2 V6, data set allocations are done at the beginning of the job step. With Templates
and lists in V7, the dynamic allocations are executed when the utility is processing the objects
for which the allocation is needed.
If several executions of CHECK INDEX, REBUILD INDEX, RUNSTATS INDEX utilities are
invoked for multiple indexes for a list of objects, performance can be improved. For instance,
in the case of two table spaces with three indexes each, consider this coding for the
LISTDEF:
LISTDEF myindexes INCLUDE INDEXSPACES TABLESPACE db1.ts1
INCLUDE INDEXSPACES db2.ts2
DB2 generates CHECK INDEX statements for all indexes of the same table space as follows:
CHECK INDEX db1.ix1,db1.ix2,db1ix3
CHECK INDEX db21.ix1,db2.ix2,db2ix3
Thus the table space scans are reduced to two instead of a possible six.
30 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
T E M P L A T E a n d L IS T D E F c o m b in e d
//C O P Y P R I1 D D D S N = D B X .P T S 1 .P 0 0 0 0 1 .P .D 2 0 0 0 1 9 9 ,
// D S IP = ...,U N IT = ...,S P A C E = ...
V6 //C O P Y P R I2 D D D S N = D B X .P T S 1 .P 0 0 0 0 2 .P .D 2 0 0 0 1 9 9 ,
// D S IP = ...,U N IT = ...,S P A C E = ...
//C O P Y P R I3 D D D S N = D B X .P T S 1 .P 0 0 0 0 3 .P .D 2 0 0 0 1 9 9 ,
// D S IP = ...,U N IT = ...,S P A C E = ...
//C O P Y S E C 1 D D D S N = D B X .P T S 1 .P 0 0 0 0 1 .B .D 2 0 0 0 1 9 9 ,
// D S IP = ...,U N IT = ...,S P A C E = ...
//C O P Y S E C 2 D D D S N = D B X .P T S 1 .P 0 0 0 0 2 .B .D 2 0 0 0 1 9 9 ,
// D S IP = ...,U N IT = ...,S P A C E = ...
//C O P Y S E C 3 D D D S N = D B X .P T S 1 .P 0 0 0 0 3 .B .D 2 0 0 0 1 9 9 ,
// D S IP = ...,U N IT = ...,S P A C E = ...
//S Y S IN DD *
C O P Y TA B L E S P A C E D B X .P T S 1 D S N U M 1 C O P Y D D N (C O P Y P R I1 ,C O P Y S E C 1 )
C O P Y TA B L E S P A C E D B X .P T S 1 D S N U M 2 C O P Y D D N (C O P Y P R I2 ,C O P Y S E C 2 )
C O P Y TA B L E S P A C E D B X .P T S 1 D S N U M 3 C O P Y D D N (C O P Y P R I3 ,C O P Y S E C 3 )
/*
//* n o n e o f th e D D s ta te m e n ts a b o v e
V7 //S Y S IN DD *
L IS T D E F C O P Y L IS T IN C L U D E TA B L E S P A C E D B X .P T * PARTLEVEL
TEM PLATE CO PYTM PL D S N ( & D B ..& T S ..P & P A R T..& P R IB A C ..D & J D A T E . )
C O P Y L IS T C O P Y L IS T C O P Y D D N ( C O P Y T M P L ,C O P Y T M P L )
/*
Figure 2-5 TEMPLATE and LISTDEF combined (V6 shown for comparison)
The use of LISTDEF and Templates go hand-in-hand. The ability to generate lists gives rise
to the need for dynamic allocation, therefore, the use of both together is recommended,
where their use is supported by the utility (see Figure 2-6). However, this does not mean that
you cannot use them independently. For example, if naming standards do not lend
themselves to the use of LISTDEF, then Templates can still be used to provide dynamic
allocations. Example 2-10 provides an example using the new UNLOAD utility.
Example 2-10 Using LISTDEF and Templates together
TEMPLATE ULDDDN
DSN(DB2V710G.&DB..&TS..UNLOAD)
UNIT(SYSDA) SPACE(45,45) CYL DISP(NEW,CATLG,DELETE)
VOLUMES(SBOX57,SBOX60)
TEMPLATE PNHDDN
DSN(DB2V710G.&DB..&TS..PUNCH)
UNIT(SYSDA) CYL DISP(NEW,CATLG,DELETE)
VOLUMES(SBOX57,SBOX60)
UNLOAD TABLESPACE U7G01T11.TSCUST
FROMCOPY DB2V710G.U7G01T11.TSCUST.D2001145.T022853L
PUNCHDDN PNHDDN UNLDDN ULDDDN
FROM TABLE PAOLOR1.CUSTOMER
(C_CUSTKEY,C_NAME,C_ADDRESS)
WHEN ( C_CUSTKEY > 1000 )
The benefits of using LISTDEF and Templates in this situation is that data sets are only
allocated at the time that they are required, therefore in the above scenario, there are no
extraneous data sets to delete. If Templates are not being used, then it is the users
responsibility to ensure that the data sets are managed correctly.
32 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Default values for disposition vary depending upon the utility and whether the utility is
restarting. Default values are shown in Table 2-4 and Table 2-5.
Table 2-4 Data dispositions for dynamically allocated data sets
ddname CHECK CHECK COPY LOAD MERGE REBUILD REORG REORG UNLOAD
DATA INDEX OR COPY INDEX INDEX TABLE
CHECK SPACE
LOB
SYSREC Ignored Ignored Ignored OLD, Ignored Ignored Ignored NEW, NEW,
KEEP, CATLG, CATLG,
KEEP CATLG CATLG
SYSDISC Ignored Ignored Ignored NEW, Ignored Ignored Ignored NEW, Ignored
CATLG, CATLG,
CATLG CATLG
SYSPUNCH Ignored Ignored Ignored Ignored Ignored Ignored Ignored NEW, NEW,
CATLG, CATLG,
CATLG CATLG
SYSCOPY Ignored Ignored NEW, NEW, NEW, Ignored Ignored NEW, Ignored
CATLG, CATLG, CATLG, CATLG,
CATLG CATLG CATLG CATLG
SYSCOPY2 Ignored Ignored NEW, NEW, NEW, Ignored Ignored NEW, Ignored
CATLG, CATLG, CATLG, CATLG,
CATLG CATLG CATLG CATLG
SYSRCPY1 Ignored Ignored NEW, NEW, NEW, Ignored Ignored NEW, Ignored
CATLG, CATLG, CATLG, CATLG,
CATLG CATLG CATLG CATLG
SYSRCPY2 Ignored Ignored NEW, NEW, NEW, Ignored Ignored NEW, Ignored
CATLG, CATLG, CATLG, CATLG,
CATLG CATLG CATLG CATLG
SYSUT1 NEW, NEW, Ignored NEW, Ignored NEW, NEW, NEW, Ignored
DELETE, DELETE, DELETE, DELETE, DELETE, DELETE,
CATLG CATLG CATLG CATLG CATLG CATLG
SORTOUT NEW, Ignored Ignored NEW, Ignored Ignored NEW, NEW, Ignored
DELETE, DELETE, DELETE, DELETE,
CATLG CATLG CATLG CATLG
SYSMAP Ignored Ignored Ignored NEW, Ignored Ignored Ignored Ignored Ignored
CATLG,
CATLG
SYSERR NEW, Ignored Ignored NEW, Ignored Ignored Ignored Ignored Ignored
CATLG, CATLG,
CATLG CATLG
SYSREC Ignored Ignored Ignored OLD, Ignored Ignored Ignored MOD, MOD,
KEEP, CATLG, CATLG,
KEEP CATLG CATLG
SYSDISC Ignored Ignored Ignored MOD, Ignored Ignored Ignored MOD, Ignored
CATLG, CATLG,
CATLG CATLG
SYSPUNCH Ignored Ignored Ignored Ignored Ignored Ignored Ignored MOD, MOD,
CATLG, CATLG,
CATLG CATLG
SYSCOPY Ignored Ignored MOD, MOD, MOD, Ignored Ignored MOD, Ignored
CATLG, CATLG, CATLG, CATLG,
CATLG CATLG CATLG CATLG
SYSCOPY2 Ignored Ignored MOD, MOD, MOD, Ignored Ignored MOD, Ignored
CATLG, CATLG, CATLG, CATLG,
CATLG CATLG CATLG CATLG
SYSRCPY1 Ignored Ignored MOD, MOD, MOD, Ignored Ignored MOD, Ignored
CATLG, CATLG, CATLG, CATLG,
CATLG CATLG CATLG CATLG
SYSRCPY2 Ignored Ignored MOD, MOD, MOD, Ignored Ignored MOD, Ignored
CATLG, CATLG, CATLG, CATLG,
CATLG CATLG CATLG CATLG
SYSUT1 MOD, MOD, Ignored MOD, Ignored MOD, MOD, MOD, Ignored
DELETE, DELETE, DELETE, DELETE, CATLG, DELETE,
CATLG CATLG CATLG CATLG CATLG CATLG
SORTOUT MOD, Ignored Ignored MOD, Ignored Ignored MOD, MOD, Ignored
DELETE, DELETE, DELETE, DELETE,
CATLG CATLG CATLG CATLG
SYSMAP Ignored Ignored Ignored MOD, Ignored Ignored Ignored Ignored Ignored
CATLG,
CATLG
SYSERR MOD, Ignored Ignored MOD, Ignored Ignored Ignored Ignored Ignored
CATLG, CATLG,
CATLG CATLG
34 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
2.5 OPTIONS
The utility control statement OPTIONS is provided to complement the functionality offered by
LISTDEF and Templates. OPTIONS provide a general mechanism to enter options that affect
utility processing across multiple utility executions in a job step. Figure 2-7 illustrates the
functions that can be utilized by specifying the respective keywords in the OPTIONS
statement. OPTIONS should be coded within the SYSIN in front of the utility control
statements to which the options apply. Any subsequent OPTIONS statements entirely
supersede any prior statements.
and:
Utility activation key KEY
Example: OPTIONS LISTDEFDD MYLISTS EVENT ( ITEMERROR, SKIP, WARNING, RC8 )
COPY LIST COPYLIST COPYDDN ( COPYTMPL, COPYTMPL )
Reset rule: An OPTIONS statement replaces any prior OPTIONS statement
OPTIONS(PREVIEW) does not check the validity of the TEMPLATE data set names, their
syntax, or whether the data sets names are unique.
For a preview example, see the previous job output in Example 2-5 on page 24 showing the
LISTDEF expansion.
PREVIEW can also be initiated through DSNUTILS, and the use of “ANY” suppresses all
dynamic allocations done by DSNUTILS. Using PREVIEW, this list can be saved before the
actual utility is run, and if there is an error, the job output will show the list item where it failed.
Since DSNUTILS does not provide for storing the reduced list, this saved list can be edited by
removing the already processed items and starting again with this reduced list.
DSNUPROC
EXEC
DSNUPROC,SYSTEM=ssnn,UID=utility-id,UTPROC=restart-parm
'PREVIEW'
DSNUTILB
EXEC
PGM=DSNUTILB,PARM='ssnn,utility-id,restart-parm'
DSNUTILS
CALL DSNUTILS
(utility-id,restart-parm,utstmt,retcode,utility-name,....)
'PREVIEW' 'ANY'
36 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Restoring defaults can be accomplished as follows:
OPTIONS OFF
Note: No other keywords may be specified with the OPTIONS OFF setting.
As usual for utility control statements, parentheses are required for a list, in contrast to a
single item, to be specified.
In this chapter we will focus on parallel executions, which is one of the techniques that DB2
uses to shorten the elapsed times of utilities.
During the three last versions of DB2, different forms of parallelism have been introduced.
Consider the following:
Inline executions, which is the execution of different types of utilities in parallel
subtasks instead of executing them serially. Examples are COPY and RUNSTATS
running embedded and in parallel with LOAD and REORG utilities.
Execution of one type of utility on different DB2 objects in parallel subtasks. Examples
are COPY/RECOVER of a list of table spaces in parallel or rebuilding the different indexes
of a table space in parallel during LOAD or REORG TABLESPACE.
Execution of a utility on different partitions of a partitioned table space, or parts of a
partitioned index in parallel subtasks. Examples are rebuilding the parts of a partitioned
index in parallel, unloading the partitions of a partitioned table space in parallel, and
loading the partitions of a partitioned table space in parallel.
Before, some of this parallelism could be achieved by the users by submitting different jobs in
parallel, sometimes ending up with considerable contentions. Today, all these parallel
executions are done in one job, using parallel subtasks.
Please note that we cover only those aspects which are common for the different utilities.
For more details and examples, we refer to the chapters on the individual utilities. Some
performance figures are shown to give an idea what reduction in elapsed times can be
expected by using parallelism in DB2 utilities.
Before DB2 V5, when executing utilities, such as REORG or LOAD of a table space, without
logging, the DB2 object is left in a restrictive state, COPYPEND, and an image copy is
required to make the object fully available to the applications. To ensure that the correct
access paths are chosen, and the other statistics are current, the RUNSTATS utility is also
recommended to be run to update the statistics of the table. These utilities added to the time
and resource (CPU, I/O) required to carry out the original utility. DB2 addressed these issues
with the introduction of the Inline COPY and RUNSTATS functionality. These enhancements
ensured, when the original utility completed, that the object was fully available to the
application, and that the statistics were current.
Starting with Version 5, DB2 introduced the ability to take image copies of a table space in
parallel with some utilities. With Version 6 this ability was extended to include the gathering of
statistics. These features are known as “inline” utilities, and are run as subphases of the
originating utility.
40 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Example 3-2 shows the Inline COPY at partition level.
Note: If COPYDDN is specified without the REPLACE option, the LOAD is terminated.
Although DB2 V6 introduced image copies of indexes (see 10.1.2, “Copying indexes” on
page 232), DB2 does not support the concept of Inline COPY of an index. Not having an
image copy of an index defined with the COPY YES attribute will put the index in the ICOPY
state, but in contrast with the COPYP status, this is not a restrictive state.
The inline image copy data sets are not exactly the same as the data sets produced by the
COPY utility:
Logically they are the same, except that:
– Data pages may be out of sequence and/or repeated. If repeated, the last occurrence
is the correct page.
– Space map pages may be out of sequence and repeated.
– Compression dictionary, if built, occurs twice. The second one is the correct one.
– Additional space may be required due to the duplication of pages that occurs because
the image copy data set contains the full copy plus all the incremental copies.
SYSCOPY is updated with ICTYPE = F and SHRLEVEL = R. The originating utility can be
found in STYPE (R for LOAD REPLACE LOG YES, S for LOAD REPLACE LOG NO,
W for REORG LOG NO, X for REORG LOG YES). If STYPE is blank or contains another
value, the image copy is not an Inline COPY.
DSN1COPY and DSN1PRNT require extra options to process inline copies. See “Using
inline copies with DSN1COPY” on page 45 and “Using inline copies with DSN1PRNT” on
page 46.
Inline COPY data sets from the LOAD utility are NOT recommended for use with
RECOVER TOCOPY, because they are taken during the LOAD phase before any
DISCARD phases; therefore copies may contain unique index and/or RI violations. They
can be used in point-in-time recovery. If a RECOVERY TOCOPY is executed, then all
indexes are placed in recovery pending status, and dependent table spaces, if any, are
placed in check pending status. RECOVER INDEX and CHECK DATA must be run.
The benefits of taking an inline image copy is that the elapsed time is reduced when
compared with running the utility followed by a copy, and that the time unavailable to the
applications is also shorter. Table 3-1 shows the performance measurements of LOAD and
REORG with Inline COPY on a DB2 table with two indexes, one partitioning index and one
non-partitioning index. The figures are taken from the introduction of Inline COPY as at
Version 5 DB2.
CPU Elap CPU Elap CPU Elap CPU Elap CPU Elap
Time Time Time Time Time Time Time Time Time Time
LOAD 926 1067 955 1302 954 1086 +3 +1.8 -0.1 -16.6
REORG 829 1262 858 1497 856 1257 +3.3 -0.4 -0.2 -16.0
Details are given in percent, comparing utility with Inline COPY figures to:
The utility alone (Delta alone)
The utility and COPY run as two separate jobs (Delta +Copy)
Full Image copy on the same table space took 29 seconds CPU time and 235 seconds
elapsed.
In the + COPY column, the full image copy numbers are added to the utility’s figures to
show the entire duration for the two jobs
These figures show that considerable savings can be made in elapsed time, while consuming
virtually the same amount of CPU time, by the use of Inline COPY.
Two other new columns in SYSCOPY are JOBNAME and AUTHID, which together will enable
the originating job of a utility to be traced. In Example 3-3, you can see the SYSCOPY entry
for the originating LOAD REPLACE utility (ICTYPE = ‘R’) and the corresponding Inline COPY
(ICTYPE=’F’). The Inline COPY will have STYPE=’R’.
42 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Inline copies produced during the LOAD phase of the LOAD utility can contain data that was
removed afterwards during the INDEXVAL and ENFORCE phases (duplicates and RI
violating rows).
During the LOG phase of REORG, incremental image copies are taken. These incremental
copies are appended to the image copy data set created during the RELOAD phase. The full
image copy, which results from a successful Online REORG, will probably contain duplicate
pages, and therefore it will be bigger than other full image copies of this table space. The later
pages will contain the most recent changes.
The DB2 RECOVER utility will take this into account and apply the most recent pages, but
there are some restrictions regarding the stand-alone utilities DSN1COPY and DSN1PRNT.
In an Inline COPY taken by the REORG utility, there is a full image copy and multiple
incremental copies in the data set. RECOVER process the data pages in order, and any
duplicate pages are replaced as the utility progresses. As the incremental image copies
contain any change pages, these are used to replace the corresponding pages in the table
spaces. Thus, at the end the table, spaces are correctly recovered.
For an Inline COPY taken with the LOAD REPLACE LOG NO option, the image copy is taken
during the LOAD phase of the utility. This means that any rows discarded during the
INDEXVAL or ENFORCE stage are included in the copy. Recovery to a point-in-time applies
the image copy and then has to process the discarded rows. See Figure 3-1 for an example
timeline.
If the Inline COPY is taken at time T1, and then discard processing removes a row at time T2,
RECOVER ensures that if a recovery to time T3 is required, then the row discarded is not
present when the recovery is complete. The method of ensuring data integrity is that the
discard processing is logged even when LOG NO is specified. This ensures that data integrity
is not compromised. See Example 3-4 for an example of a LOAD. After this LOAD was
completed, a REPORT RECOVERY was run to show that logging had taken place between
the image copy and the QUIESCE point. This logging is due to the discard processing being
logged; see Example 3-5. This report shows that there was logging between the copy being
taken and the QUIESCE point.
T0 T1 T2 T3
LOAD LOAD INDEXVAL/ Recovery
REPLACE phase ends
LOG NO ENFORCE to this point
image copy
with inline taken discard 1 required
copy starts row
Example 3-4 LOAD REPLACE LOG NO with Inline COPY and discards
DSNU000I DSNUGUTC - OUTPUT START FOR UTILITY, UTILID = TEMP
0DSNU050I DSNUGUTC - LOAD REPLACE COPYDDN(SYSCOPY) SORTKEYS SORTDEVT SYSDA SORTNUM 3 LOG NO
DSNU650I ( DSNURWI - INTO TABLE SYSADM.TABEMP
DSNU350I ( DSNURRST - EXISTING RECORDS DELETED FROM TABLESPACE
DSNU400I DSNURBID - COPY PROCESSED FOR TABLESPACE DB1.TABEMP
NUMBER OF PAGES=16
AVERAGE PERCENT FREE SPACE PER PAGE = 25.87
PERCENT OF CHANGED PAGES =100.00
ELAPSED TIME=00:00:16
DSNU304I ( DSNURWT - (RE)LOAD PHASE STATISTICS - NUMBER OF RECORDS=9 FOR TABLE SYSADM.TABEMP
DSNU302I DSNURILD - (RE)LOAD PHASE STATISTICS - NUMBER OF INPUT RECORDS PROCESSED=9
DSNU300I DSNURILD - (RE)LOAD PHASE COMPLETE, ELAPSED TIME=00:00:18
DSNU042I DSNUGSOR - SORT PHASE STATISTICS -
NUMBER OF RECORDS=9
ELAPSED TIME=00:00:00
DSNU345I DSNURBXD - UNIQUE INDEX KEY DUPLICATES KEY FROM INPUT RECORD
'1' LOADED AT RID X'0000000201',
INDEX = SYSADM.TABI,
TABLE = SYSADM.TABEMP,
RECNO = 9,
RID = X'0000000203'
DSNU348I ( DSNURBXA - BUILD PHASE STATISTICS - NUMBER OF KEYS=1 FOR INDEX SYSADM.TABI PART 1
DSNU348I ( DSNURBXA - BUILD PHASE STATISTICS - NUMBER OF KEYS=4 FOR INDEX SYSADM.TABI PART 2
DSNU348I ( DSNURBXA - BUILD PHASE STATISTICS - NUMBER OF KEYS=1 FOR INDEX SYSADM.TABI PART 3
DSNU348I ( DSNURBXA - BUILD PHASE STATISTICS - NUMBER OF KEYS=1 FOR INDEX SYSADM.TABI PART 4
DSNU258I DSNURBXD - BUILD PHASE STATISTICS - NUMBER OF INDEXES=1
DSNU343I DSNURBXD - BUILD PHASE STATISTICS - 2 DUPLICATE KEY ERRORS ENCOUNTERED
DSNU259I DSNURBXD - BUILD PHASE COMPLETE, ELAPSED TIME=00:00:00
DSNU428I DSNURELD - DB2 IMAGE COPY SUCCESSFUL FOR TABLESPACE DB1.TABEMP
DSNU355I DSNURVIX - INDEXVAL PHASE STATISTICS - 2 DUPLICATE KEY ERRORS CORRECTED BY DELETING 2 DATA ROWS
DSNU356I DSNURVIX - INDEXVAL PHASE COMPLETE, ELAPSED TIME=00:00:00
DSNU399I DSNURRIS - LOAD UTILITY ERROR SUMMARY REPORT
44 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Example 3-5 REPORT RECOVERY showing logging during discard processing
DSNU000I DSNUGUTC - OUTPUT START FOR UTILITY, UTILID = TEMP
0DSNU050I DSNUGUTC - REPORT RECOVERY TABLESPACE DB1.TABEMP DSNUM ALL
DSNU581I ( DSNUPREC - REPORT RECOVERY TABLESPACE DB1.TABEMP
DSNU593I ( DSNUPREC - REPORT RECOVERY ENVIRONMENT RECORD:
' MINIMUM RBA: 000000000000
' MAXIMUM RBA: FFFFFFFFFFFF
' MIGRATING RBA: 000000000000
DSNU582I ( DSNUPPCP - REPORT RECOVERY TABLESPACE DB1.TABEMP SYSCOPY ROWS
TIMESTAMP = 2001-06-06-18.35.39.288433, IC TYPE = *S*, SHR LVL = , DSNUM = 0000, START LRSN =000001F433B3
DEV TYPE = , IC BACK = , STYPE = , FILE SEQ = 0000, PIT LRSN = 000000000000
LOW DSNUM = 0000, HIGH DSNUM = 0000
JOBNAME = SYSADM1 , AUTHID = SYSADM , COPYPAGESF = -1.0E+00
NPAGESF = -1.0E+00 , CPAGESF = -1.0E+00
DSNAME = ROY.ROYEMP , MEMBER NAME =
TIMESTAMP = 2001-06-06-18.35.42.362598, IC TYPE = F , SHR LVL = R, DSNUM = 0000, START LRSN =000001F53BA6
DEV TYPE = 3380 , IC BACK = , STYPE = S, FILE SEQ = 0000, PIT LRSN = 000000000000
LOW DSNUM = 0000, HIGH DSNUM = 0000
JOBNAME = SYSADM1 , AUTHID = SYSADM , COPYPAGESF = 1.6E+01
NPAGESF = 4.8E+01 , CPAGESF = 4.8E+01
DSNAME = SYSADM.COPY.G0017V00 , MEMBER NAME =
TIMESTAMP = 2001-06-06-18.35.45.133665, IC TYPE = *Q*, SHR LVL = , DSNUM = 0000, START LRSN =000001F648C8
DEV TYPE = , IC BACK = , STYPE = W, FILE SEQ = 0000, PIT LRSN = 000000000000
LOW DSNUM = 0000, HIGH DSNUM = 0000
JOBNAME = SYSADM1 , AUTHID = SYSADM , COPYPAGESF = -1.0E+00
NPAGESF = -1.0E+00 , CPAGESF = -1.0E+00
DSNAME = ROY.ROYEMP , MEMBER NAME =
DSNU583I ( DSNUPPLR - SYSLGRNX ROWS FROM REPORT RECOVERY FOR TABLESPACE DB1.TABEMP
UCDATE UCTIME START RBA STOP RBA START LRSN STOP LRSN PARTITION MEMBER ID
060601 18351187 000001F27B30 000001F2CCF9 000001F27B30 000001F2CCF9 0001 0000
060601 18351192 000001F29E35 000001F2CCF9 000001F29E35 000001F2CCF9 0002 0000
060601 18351195 000001F2B374 000001F2CCF9 000001F2B374 000001F2CCF9 0003 0000
060601 18351198 000001F2C850 000001F2CCF9 000001F2C850 000001F2CCF9 0004 0000
060601 18351259 000001F319AF 000001F35000 000001F319AF 000001F35000 0001 0000
060601 18351263 000001F31D23 000001F35000 000001F31D23 000001F35000 0002 0000
060601 18351264 000001F32094 000001F35000 000001F32094 000001F35000 0003 0000
060601 18351266 000001F323CC 000001F35000 000001F323CC 000001F35000 0004 0000
060601 18353047 000001F4866C 000001F4866C 000001F4866C 000001F4866C 0001 0000
060601 18353048 000001F4866C 000001F4866C 000001F4866C 000001F4866C 0002 0000
060601 18353049 000001F4866C 000001F4866C 000001F4866C 000001F4866C 0003 0000
060601 18353052 000001F4866C 000001F4866C 000001F4866C 000001F4866C 0004 0000
060601 18353937 000001F4E7D3 000001F4E7D3 000001F4E7D3 000001F4E7D3 0001 0000
060601 18353942 000001F4EA89 000001F4EA89 000001F4EA89 000001F4EA89 0002 0000
060601 18353946 000001F4ECFD 000001F4ECFD 000001F4ECFD 000001F4ECFD 0003 0000
060601 18353956 000001F4EF71 000001F4EF71 000001F4EF71 000001F4EF71 0004 0000
060601 18354201 000001F5320A 000001F5320A 000001F5320A 000001F5320A 0001 0000
060601 18354212 000001F5347E 000001F5347E 000001F5347E 000001F5347E 0002 0000
060601 18354221 000001F536F2 000001F536F2 000001F536F2 000001F536F2 0003 0000
060601 18354235 000001F53966 000001F53966 000001F53966 000001F53966 0004 0000
060601 18354281 000001F56736 000001F5AEE8 000001F56736 000001F5AEE8 0001 0000
To avoid this problem, a new parameter has been added to DSN1COPY, INLCOPY, which
specifies that the input is an Inline COPY.
As with DSN1COPY, a new parameter has been added to DSN1PRNT, INLCOPY, which
specifies that the input is an Inline COPY.
The main objective of inline statistics is to reduce the total elapsed time of a REORG, LOAD
or RECOVER/REBUILD index followed by statistics gathering. This is achieved by the
elimination of additional scans of the data for gathering the statistics. By having the statistics
gathering function within the utility job, administrative and scheduling procedures can be
simplified. The collection is initiated by the inclusion of the STATISTICS keyword, and all
RUNSTATS options are supported, including the HISTORY functionality. See Chapter 11,
“Gathering statistics” on page 253, introduced in DB2 Version 7.
46 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
These are some restrictions for inline statistics gathering:
LOAD with Inline statistics is only valid for REPLACE or RESUME NO options.
A utility working on a part or part range of a table space cannot collect statistics on NPIs.
Table space statistics collected during REORG TABLESPACE SHRLEVEL CHANGE do
not reflect the changes to the originating table space after the UNLOAD phase. If a large
number of updates are expected during the Log Apply phase, and accurate statistics are
required, then it is better to run a RUNSTATS afterwards.
Index statistics gathered during REORG INDEX SHRLEVEL CHANGE do not reflect the
changes after the BUILD phase.
Inline statistics cannot be used during a REORG of the directory or catalog table spaces
containing links.
Utilities restarted using RESTART(CURRENT) may not have inline statistics gathered.
Using RESTART(PHASE) allows statistic gathering.
Rows discarded during the LOAD process may or may not be reflected in the statistics. If a
large number of discards are expected and accurate statistics are required, then the use
of inline statistics is not recommended. A RUNSTATS utility should be run afterwards
instead.
Performance measurements
The LOAD and REORG utilities were run on a V6 system against a 5-million-row table with 10
partitions, each partition containing approximately 500,000 rows with 26 columns and a row
length of 118 bytes. The table has two indexes, the partitioning index and a non-partitioning
index. See DB2 UDB for OS/390 Version 6 Performance Topics, SG24-5351-00, for more
details.
The LOAD and the REORG are run followed by the RUNSTATS utility and compared against
the same utility with inline statistics specified.
Note: All times are in seconds; bracketed figures are elapsed times for the LOAD utility
As shown, there is a 34% reduction in the elapsed time using inline statistics. Notice also that
the elapsed time of the LOADs are very similar, 603 seconds versus 597 seconds; this is due
to the elimination of an additional scan of the data for the statistics collection. Also worthwhile
noting is that this is not at the expense of a significant increase in CPU time, which is almost
stable.
Note: All times are in seconds; bracketed figures are elapsed times for the REORG utility
As shown, a 23% reduction in elapsed time is achieved. Again, also worth noting is the
similarity of the elapsed times for the REORGs, 993 seconds versus 966 seconds; this is
again due to the elimination of additional scans of the data. As with LOAD, there is no
significant difference in the amount of CPU that is consumed.
As previously possible with RECOVER, DB2 V6 introduced the ability to also COPY a group
of objects in one COPY statement. This is done by specifying a LIST of table spaces, index
spaces, or indexes after the COPY/RECOVER statement. With DB2 V7 you can also specify
a previously defined listdef-name. One of the benefits of the list processing is to create a
consistent backup for the objects in the list:
48 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
With COPY SHRLEVEL REFERENCE, DB2 will first drain all writers on all objects of the
list before starting the actual copying. All image copies will have the same RBA/LRSN
value in SYSIBM.SYSCOPY, and the copy set results in a good recovery point if you need
to PIT RECOVER these objects back to this RBA/LRSN.
With COPY SHRLEVEL CHANGE, DB2 will drain the writers object by object, and the
RBA/LRSN values will be different for each object. The set will not be consistent for PIT
recovery.
Furthermore you can reduce the elapsed time for COPY and RECOVER when you specify a
list of objects, by processing the objects in the list in parallel:
You do this by specifying the number of objects to process in parallel with the new option
PARALLEL (num-objects).
C op y P arallel (3)
O b ject 1
O b ject 2
O b ject 3
O b ject 4
O b ject 5
P ip e P ip e
P ip e
W rite W rite
Copy Copy C op y
W rite
S u b task 1 S u b task 2
D atasets D atasets D atasets S u btask 3
In this example, COPY creates 3 subtask sets. Each subtask set consists of two
subtasks, one for reading the table/index and one for writing to the image copies. The
objects 1, 2, and 3 are assigned to subtask set 1, 2, and 3, respectively. Object 3 finishes
first, so object 4 is assigned to subtask set 3. Object 1 finishes next, and object 5 is
assigned to subtask set 1. While the read subtask is putting pages into a pipe, the write
subtask is pulling them out and writing them to the output data set.
In this example, RECOVER creates 3 subtask sets. A subtask set consists of two subtasks,
one for reading the image copies and one for writing to the object space. Objects 1, 2, and 3
are assigned to subtask set 1, 2, and 3, respectively. Object 3 finishes first, so object 4 is
assigned to subtask set 3. Then, object 1 finishes next and object 5 is assigned to subtask set
1. While the read subtask is putting pages into the pipe, the write subtask is pulling them out
and writing them to the object space(s).
Parallelism is only achieved in the RESTORE phase. The Log Apply processing does not
begin until all objects have been restored. During the Log Apply processing, the log will only
be scanned once, and recovery will use the fast Log Apply feature, as explained in 9.5, “Fast
Log Apply” on page 217.
50 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
3.2.4 Performance measurements
The COPY and RECOVER utilities were run on a V6 system against a 5-million-row table with
10 partitions, each partition containing approximately 500,000 rows with 26 columns and a
row length of 118 bytes. See DB2 UDB for OS/390 Version 6 Performance Topics,
SG24-5351-00 for more details.
11 234 12 58 0% -75%
There is a 75% elapsed time reduction when using the keyword PARALLEL. The number of
objects that are copied in parallel was 10. Due to I/O contention on the underlying table
spaces and image copy data sets, a 10 times improvement of elapsed times was not
achieved. There was not a significant difference in CPU times.
14 249 14 71 0% -71%
A 71% elapsed time reduction is achieved by using the PARALLEL keyword. All 10 partitions
were restored in parallel. Due to I/O contention on the underlying table spaces and image
copy data sets, a 10 times improvement of elapsed times was not achieved. There was not a
significant difference in CPU times.
Note: In this chapter, REORG means REORG TABLESPACE unless otherwise specified.
PIB was introduced in DB2 V6 with the use of the SORTKEYS option. Because SORTKEYS
already existed in DB2 V5, we will first try to explain the difference between SORTKEYS in
DB2 V5 and DB2 V6.
Ta b le S p a c e
In d e x 1
(R E O R G ) In d e x 2
U n lo a d
S o rt B u ild
In d e x 3
(R e )lo a d
keys
In d e xe s b u ilt
s e ria lly
K e y s p a s s e d th ro u g h S o rt E xits
M o s t I/O o f k e y s e lim in a te d
The table space can be partitioned or non-partitioned. In case of a partitioned table space,
the building of the partitioned index during LOAD or REORG is not different from the building
of the other indexes. It is done serially with the other indexes.
SORTKEYS eliminates the need for intermediate workfiles (SYSUT1or SORTOUT) when
building the indexes, by passing the extracted keys in memory to the sort process. However,
SYSUT1 and SORTOUT are still required for the LOAD utility, which uses them for processing
errors detected during the RELOAD, VALIDATE, or ENFORCE phase, and in passing foreign
keys from the SORT phase to the ENFORCE phase.
52 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
3.3.2 SORTKEYS with DB2 V6 and DB2 V7
DB2 Version 6 introduced multiple pairs of sort and build subtasks so that indexes are built in
parallel, thereby further improving the elapsed time of the LOAD and REORG utilities. This is
illustrated in Figure 3-5 for REORG.
All this is done in the new SORTBLD phase, which replaces the SORT and BUILD phases.
The sort subtask sorts the extracted keys, while the build subtask builds the index. This is
also done partially in parallel, because the index build starts as soon as the sort subtask
passes the first sorted keys. Moreover the SYSUT1 and SORTOUT work data sets are not
used during the SORTBLD because the key/rid pairs are now directly piped in memory
between the subtasks, eliminating considerable I/O processing.
Ta b le S p a ce
Ind ex 1
(R E O R G ) Inde x 2
U n lo a d SO RTBLD
S o rt B u ild
In dex 3
(R e)load
ke ys In d e xe s b u ilt
in p a ra lle l
Figure 3-5 SORTKEYS with DB2 V6, V7, and Parallel Index Build
The table space can be partitioned or non-partitioned. In case of a partitioned table space,
the building of the partitioned index during LOAD or REORG is not different from the building
of the other indexes. Building of the partitioned index is always done on the entire index level,
even if the table space has no NPIs. This in contrast with the REBUILD INDEX reported in
3.4.1, “Rebuilding the partitioning index of a partitioned table space using PIB” on page 58.
Each sort subtask uses its own sort work data set and its own message data set (not shown).
If inline statistics are also collected, there is a separate RUNSTATS subtask for each build
task that collects the sorted keys and updates the catalog in parallel. When collecting
statistics on the table space, an additional subtask runs in parallel with the RELOAD phase.
LOAD and REORG Parallel Index Build is automatically selected if the following conditions
are true:
For LOAD, SORTKEYS n must be specified, n being an estimation of the total number of
keys to be sorted. If you specify n equal to zero, PIB will not be activated.
For REORG, SORTKEYS must be specified.
There are different ways to influence the number of parallel subtasks. See 3.7,
“Considerations for running parallel subtasks” on page 67 for more details.
Important: Although you cannot specify SORTKEYS for REORG SHRLEVEL CHANGE, it
always operates as if the SORTKEYS parameter was specified. See also 7.6.3, “SORT
and BUILD as compared to SORTBLD” on page 170.
DB2 V7 improves the performance of the BUILD2 phase by updating the NPIs in parallel.
Optimally, the number of subtasks will be equivalent to the number of NPIs, with one subtask
handling all updates for all logical partitions on the same NPI.
This parallelism is only activated when specifying SORTKEYS for REORG SHRLEVEL
REFERENCE. For SHRLEVEL CHANGE, it is always activated.
Even with one NPI, a performance improvement can be noticed in V7, because of internal
design changes in the handling of the log writes.
The measurements taken compare LOAD and REORG SHRLEVEL NONE with Parallel Index
Build using 2, 3, and 6 indexes in DB2 Version 6 against DB2 Version 5 (level SUP9904).
54 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Table 3-7 LOAD utility with 3 indexes
DB2 Version 5 with DB2 Version 6 with Delta(%)
SORTKEYS SORTKEYS
In the example with two indexes (see Table 3-6), there is a reduction of 3%. This increases to
18% with three indexes (see Table 3-7), and 36% with six indexes (see Table 3-8).
From the figures from both the LOAD and REORG utilities, it can be deduced that the more
indexes are created, the greater the savings in elapsed time. This is at the expense of greater
CPU, due to the increase in subtasks processing the indexes. In the cases above, the number
of parallel tasks was equivalent to twice the number of indexes (one for sort and one for build).
Table 3-12 summarizes the comparison of the BUILD2 phase between DB2 Version 6 and
DB2 Version 7 for the REORG utility using SHRLEVEL REFERENCE, SORTKEYS,
NOSYSREC and Inline COPY, when reorganizing 3 partitions.
Table 3-12 REORG 3 partitions with 1 NPI
DB2 V6 DB2 V7 Delta (%)
Although no parallelism was used, running REORG against a range of partitions, with only
one NPI in V7, shows improvements of 33% CPU time and 36% elapsed time, because of the
different handling of log writes.
The number of NPIs was increased to five and the same test was run. The results are shown
in Table 3-13.
Table 3-13 REORG 3 partitions with 5 NPI
DB2 V6 DB2 V7 Delta (%)
The BUILD2 phase now returns an 83% saving in elapsed time when reorganizing with 5
NPIs. The relative increase in Delta CPU time, from -33% to -29%, is due to the managing of
the five parallel subtasks used to build the NPIs in parallel. From this, it can be concluded that
the greater the number of NPIs, the greater the savings in elapsed time, although this is at the
cost of CPU time.
56 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
3.3.4 Dropping NPIs before REORG
Because REBUILD INDEX can unload all the keys in parallel when rebuilding a single NPI
(see 3.4, “REBUILD INDEX Parallel Index Build” on page 57), dropping the NPIs before the
REORG and rebuilding them afterwards with REBUILD INDEX can still be quicker than using
SORTKEYS in DB2 Version 6 and DB2 Version 7, when reorganizing large partitioned table
spaces. However, you have to consider that:
The elapsed time of a REORG can become less important by using Online REORG
(SHRLEVEL REFERENCE or CHANGE), allowing concurrent access by applications.
This is a far more complex process than submitting a single REORG job with the
SORTKEYS option.
Dropping and recreating indexes may require extra rebinds and further unavailability.
For examples of LOAD jobs using PIB, see 6.2, “Parallel Index Build” on page 111.
The method used by PIB is similar to the methods used by the LOAD and REORG utilities
that are described in 3.3, “LOAD and REORG Parallel Index Build” on page 52.
However, the main difference is that REBUILD INDEX can also execute the UNLOAD phase
in parallel for partitioned table spaces. The utility starts multiple subtasks, ideally one per
partition, to unload the data; these are then passed to individual SORT subtasks and BUILD
subtasks.
3.4.1 Rebuilding the partitioning index of a partitioned table space using PIB
This case applies if we only rebuild the partitioning index of a partitioned table space, or if we
do a REBUILD INDEX(ALL) and the partitioned table space contains no NPIs. In this case, if
we specify SORTKEYS, the partitions will be unloaded in parallel, and the individual parts of
the partitioned index will be built in parallel during the SORTBLD phase.
Figure 3-6 shows three unload subtasks piping the index keys to three sort subtasks and then
onto three build subtasks. In this example, inline statistics were not gathered (otherwise there
would be three more subtasks).
SO RTBLD In dex
p artitio n
S ort B uild 2
In dex
pa rtitio n
3
In de xe s b u ilt
in p a ra lle l
UN LO AD
When building a non-partitioning index, the unload and sort run in parallel, and the sort
subtasks pipe the data to a single merge subtask which in turn pipes the data to a single build
task. If inline statistics are collected, then there will be a single statistics subtask. This is
shown in Figure 3-7.
58 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
P a rtitio n P a rtitio n P a rtitio n
1 2 3
Ta b le S p a c e
keys In d e x
S o rt
M e rg e B u ild
UN LO AD
With this enhancement, it is no longer required to submit separate REBUILD INDEX PART n
jobs to unload the keys, then merge the SYSUT1 outputs in order to create a large NPI. DB2
can now easily create a large NPI using the REBUILD INDEX.
Unlike the case where the partitioned table space has no NPIs, the building of the partitioned
index is done on the entire index level. This because building the parts of the partitioned index
in parallel would not decrease the total elapsed time of the utility job.
Figure 3-8 shows three unload subtasks piping the index keys to three sort subtasks and then
onto three build subtasks. In this example, inline statistics were not gathered (otherwise there
would be three more subtasks).
SO RTB LD In d e x 2
S o rt B u ild NPI 1
In d e x 3
NPI 2
In d e x e s b u ilt
in p a ra lle l
U N LO AD
Figure 3-8 Rebuilding all indexes of a partitioned table space with PIB using SORTKEYS
You can force DB2 to build the parts of the partitioned index in parallel as well, by specifying in
the REBUILD INDEX command a list of all the individual index parts with the PART keyword,
or by using a list defined in a LISTDEF with the PARTLEVEL keyword. However, this
approach is not recommended, as it will increase the CPU time and not reduce the elapsed
time of the whole utility job.
Figure 3-9 shows three sort subtasks and three build subtasks for building three indexes in
parallel. In this example, inline statistics were not gathered (otherwise there would be three
more subtasks).
60 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Ta b le S p a ce
Ind ex 1
Inde x 2
SO RTBLD
S ort B u ild
In dex 3
U nload
ke ys
In d e xe s bu ilt
in p a ra lle l
Figure 3-9 Rebuilding all indexes of a non-partitioned table space with PIB using SORTKEYS
Using SORTKEYS shows an 81% reduction in elapsed time. There are 30 parallel tasks with
SORTKEYS, 10 for UNLOAD, 10 for SORT and 10 for BUILD, used to process each index
partition in parallel.
Using SORTKEYS shows a 69% reduction in elapsed time. There are 22 parallel tasks with
SORTKEYS, 10 for UNLOAD, 10 for SORT, 1 for MERGE, and 1 for BUILD.
Using SORTKEYS shows a 66% reduction in elapsed time. There were only 6 parallel tasks
used with SORTKEYS, 2 for UNLOAD, 2 for SORT, and 2 for BUILD.
There are different ways to influence the number of parallel subtasks. See 3.7,
“Considerations for running parallel subtasks” on page 67 for more details.
Partition parallelism with the LOAD Utility enables the LOAD utility to load partitions in
parallel. This can be used with or without PIB (SORTKEYS n). Both of these features reduce
the elapsed time of the LOAD utility.
62 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
To activate parallel partition loading, the following rules apply:
Separate input data sets have to be supplied per partition.
The syntax of the LOAD has to specify each partition to be loaded via the INTO TABLE
option; see Figure 3-10.
The need to supply separate input files is made easier with the TEMPLATE functionality
described in Chapter 2, “Wildcarding and Templates” on page 17, and it is recommended that
this feature be used for loading partitions in parallel.
The existing message DSNU397I explains the constraints on number of tasks. Furthermore,
there is a new message:
DSNU364I PARTITIONS WILL BE LOADED IN PARALLEL, NUMBER OF TASKS = nnn
See 3.7, “Considerations for running parallel subtasks” on page 67 for more considerations
when introducing multiple parallel subtasks.
SYS RELOAD
REC1 Part 1
PI
D
REC2
IL
BU
key/RID pairs SYS SORT SORT BUILD NPI1
UT1 OUT
BU
SYS
Part 3
IL
RELOAD SORTWKnn
D
REC3
NPI2
SYS
REC4
RELOAD Part 4
For examples of LOAD jobs using partition parallelism without PIB, see 6.3.1, “Partition
parallelism without Parallel Index Build” on page 119.
By using parallel partition LOAD and PIB together, DB2 will initiate subtasks for the LOAD
phase and the SORTBLD phase. For loading partitions in parallel, the optimum is to initiate
one subtask per partition to be loaded and two for each index build (sort and build), although
this is not always possible due to constraints, such as virtual memory, DB2 threads available,
and processors available. See 3.7, “Considerations for running parallel subtasks” on page 67
for more details.
64 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Error/Map
SYS RELOAD
REC1 Part 1
SYS
REC2
RELOAD Part 2
SORT SORTBLD
SW02WKnn
SW02WKxx NPI1
BUILD
SYS
REC3
RELOAD Part 3
SYS
REC4
RELOAD Part 4
For examples of LOAD jobs using partition parallelism with PIB, see 6.3.2, “Partition
parallelism with Parallel Index Build” on page 123.
Table 3-17 shows the results of loading the table utilizing parallel partition loading only, but not
using PIB, because the SORTKEYS n clause was omitted.
Table 3-17 LOAD partition parallelism without PIB
LOAD TABLE PARALLEL PARTITION Delta (%)
LOAD
Table 3-18 shows the comparison of running LOAD utility for the whole table with the LOAD
utility running in parallel for all partitions and building NPIs in parallel by specifying
SORTKEYS n with an estimated value greater than 0.
For tables without NPIs, the parallel partition LOAD with SORTKEYS n improves up to 48.2%
in elapsed time compared to loading the whole table with SORTKEYS n, and with only 9.1%
CPU time degradation. Managing the parallel subtasks to load partitions in parallel
contributes to an increase in CPU time.
Table 3-18 LOAD partition parallelism with PIB
LOAD TABLE PARALLEL PARTITION Delta (%)
LOAD
The use of SORTKEYS n and parallel partitions leads to great improvements in the
performance of the LOAD utility, and it is strongly recommended that these feature are used
where there is more than one NPI index on the table. The number of NPIs is very important in
the use of SORTKEYS n and it is this factor that enables the LOAD utility to drive down the
cost of the utility
See 3.7, “Considerations for running parallel subtasks” on page 67 for other considerations.
When unloading a partitioned table space into individual data sets (one per partition), the
UNLOAD utility automatically activates multiple parallel subtasks. Optimally, the number of
subtasks will be equal to the number of partitions to be unloaded, but most of the times it will
be limited to the number of CPUs available.
There are different ways to influence the number of parallel subtasks. See 3.7,
“Considerations for running parallel subtasks” on page 67 for more details.
The parallelism of the UNLOAD utility is enabled by using the PART option in the UNLOAD
statement or using the PARTLEVEL option in the LISTDEF command. See 8.4.2, “Unload by
partition and parallelism” on page 200 for more details.
66 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
3.6.1 Performance measurements
The following UNLOAD utilities were run on a DB2 V7 system against a 16 columns table with
approximately 6 million rows per partition. The processor is a IBM 7-way G6 with 7 CPUs
available. See “DB2 for z/OS and OS/390 Version 7 Performance Topics SG24-6129-00” for
more details.
200 191
150
100 86 86 89
50
0
1 4 7 14
Number of partitions
The elapsed times for unloading 1 to 4 partitions are the same. There is no extra cost in
elapsed time for unloading the 3 extra partitions.
The elapsed time only increase a little bit, when unloading 7 partitions in parallel. The
increase in elapsed time is due to handling more subtasks and partitions. The elapsed times
for unloading 1 or 7 partitions in parallel are almost the same.
The elapsed time for unloading 14 partitions in parallel shows that the elapsed time is more
than double, because the UNLOAD utility could only start 7 subtasks, which have to unload
more than one partition each, and the overhead to handle more partitions than subtasks
increased too.
If you specify both sort workfile DD statements and UTPRINT data sets, the number of sort
subtasks equals the smallest number of data sets specified.
With many utilities now running parallel subtasks, the following items are worth noting:
Ensure that IDBACK is large enough to accommodate all the parallel threads that are
required to run concurrently. The number of threads required per utility varies from utility to
utility, also between phases within the same utility, and is dependent upon the options
specified. It is recommended to increase IDBACK to a number that is unlikely to be
reached to avoid any failures. If this is problematic, then a scheduling product, such as
Tivoli Operation Planning and Control (OPC), could also be used to control the number of
jobs running concurrently, and thus the number of threads within DB2. CTHREAD may
also need increasing in line with IDBACK.
Virtual storage requirements will rise dependent upon the number of subtasks being run
by the utility. Ensure that the REGION parameter is large enough.
The amount of real storage available will have to be considered. DB2 will limit the number
of subtasks dependent upon available virtual storage. If REGION=0M is coded on the
JOBCARD, then the subtasks can have as much virtual storage as is required. With tens
of subtasks not uncommon, this could cause problems with real storage exhaustion
(excessive paging).
Increasing the number of parallel subtasks will also increase the CPU consumption of the
utility jobs.
Sufficient DASD will have to be available to support the multiple sort and work data sets.
Using templates for the sort and work data sets is the best way to enable maximum
parallelism.
68 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
4
Every DB2 site has one method or another to trigger their utilities on DB2 objects during their
“housekeeping” window. Some of these trigger limits are within the utility control statements,
while others are external to the utilities. In most cases, either ISV products or in-house written
programs are used to query the catalog and generate the utility statements.
We describe the statistical fields within the catalog in detail and show how these fields can be
used to determine the limits for utilities. We also cover the new history catalog tables and their
use in calculating trends in growth in DB2 objects, such as determining the compression and
decompression dictionary build.
Two sections deal with stored procedures and DB2 tools. In these sections, we show
examples of how the stored procedure and tools can be used to control the utility functions on
DB2 objects.
LEAFDIST Limit
9990 9989
Leaf Page 8 Leaf Page 9 Leaf Page 10 Leaf Page 11 Leaf Page 9998 Leaf Page 9999
There is a navigation from root page to leaf page 8 which extends to leaf page 9998. This
gives a physical page gap of 9990.
Similarly, the page gaps are obtained for other leaf page navigations, giving page gaps of
9989, 9988, and 9989.
LEAFDIST is calculated as indicated in Figure 4-1.
If the partition index space is reorganized without the PART keyword, then the entire index is
reorganized. The LEAFDISTLIMIT is compared to the LEAFDIST value of each individual
partition index, and REORG INDEX is triggered if any one of the partition LEAFDIST values
exceeds LEAFDISTLIMIT.
The optimum value for LEAFDIST is 0 when the index space is fully organized and
FREEPAGE is set to 0. DB2 for OS/390 V5 Administration Guide, SC26-8957-03,
recommends a threshold value of 200. The same value is set as a default for the REORG
INDEX utility. Thus, we recommend using LEAFDISTLIMIT 200 in REORG INDEX to trigger
REORG.
70 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
DB2 V7 introduced two new statistical fields, LEAFNEAR and LEAFFAR, which provide better
measurements of index disorganization. These two statistical fields are discussed in detail in
4.2.1, “REORG index with LEAFNEAR and LEAFFAR” on page 77.
OFFPOSLIMIT keyword
This basically indicates the degradation of the clustering index of a table within the table
space. When rows are accessed by cluster index, the performance will be affected due to
high synchronous I/O associated data pages not within the prefetch range. When the
CLUSTERATIO is 100% it indicates that data in the table space is in cluster order. In
Figure 4-2 we have a table space on the left that is unclustered. The table space on the right
is in cluster order after the REORG.
Clustering
TS IX TS
R3 K1 R1
K2 R2
deleted K3 R3
K4 R4
R2
R4
deleted
R1
deleted
INDREFLIMIT keyword
When an update is done to a row, the resulting length of the row may be longer than the
original record. In such cases, if the row cannot be fit into the original page, DB2 places the
record in another page. However, the original RID in the index still points to the former page.
In such a case, an entry is made in the original data page which is a pointer to the new page
where the updated records are stored. This is termed as indirect row reference. The diagram
in Figure 4-3 is an example of such an indirect reference. There could be an extra I/O involved
to read the extra page into buffer pool.
Row 2
Row 3
Row 4
ID Map ID Map
Two values in SYSTABLEPART provide quantitative values for the measurement of indirect
addressing — NEARINDREF and FARINDREF:
NEARINDREF is the number of indirect references to overflow rows on another page
within 64 pages.
FARINDREF is the number of indirect references to overflow rows outside the 64 page
limit.
When considering the statistics, both values indicate disorganization of the data. When rows
are placed in another page than their home page, lock avoidances cannot be used, so DB2
has to take a lock. That can lead to performance degradation, especially in a data sharing
environment.
When the INDREFLIMIT is specified in REORG TABLESPACE, the REORG utility will
calculate the value as:
INDREFLIMIT = ((NEARINDREF + FARINDREF)/CARDF)*100
If this value exceeds the specified value in REORG, then REORG is performed. The default
value is 10.
72 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
We recommend that you run REORG TABLESPACE, if the percent of indirect row references
is greater than 10% for a non-data sharing environment, and for a data sharing environment if
the percent of indirect row references is greater than 5%. You may also want to consider
changing the PCTFREE value to allow more expansion without overflowing.
Note: If the partitioned table space is reorganized without the PART keyword, then the
limits are calculated for each partition. REORG is triggered when the limits exceed any one
of the partitions.
Note: When the table space is reorganized, so are the associated index spaces. If you
have a job stream that generates REORG statements, first you generate reorganization of
table spaces, followed by index spaces. Your generator job should be able to remove
reorganization of index spaces when the associated table space is reorganized.
COPY scans the header page of the table space for space map bits and the image copy bit
LRSN.
pv1 > 0 and %changed pages < pv1 Yes, incremental image copy
pv1 < %changed pages < pv2 Yes, incremental image copy
pv1 = pv2 AND %changed pages > or = pv1 Yes, full image copy
COPY scans the spacemap pages of the table space for modified page indicators. This will
recall table spaces that are migrated even with the REPORTONLY option.
Note: CHANGELIMIT option is not available for COPY INDEXSPACE. Furthermore, COPY
does not support incremental image copy for index space.
CHANGELIMIT pv1 0 < %changed pages < pv1 COPY TS incremental copy
pv1=0
CHANGELIMIT (pv1,pv2) pv1 < %changed pages < pv2 COPY TS incremental copy
pv1=pv2=0
SYSCOPY:
COPYPAGESF is the number of pages written to the copy data set. This value is needed
by the RECOVER utility to know the number of pages in the image data set
(full/incremental).
NPAGESF is the number of pages in the table space or index at the time of Inline COPY.
74 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
CPAGESF is the number of changed pages.
JOBNAME is a column whose value indicates the job name of the utility.
AUTHID is a column whose value indicates the authorization ID of the utility.
SYSINDEXES:
SPACEF is the number of kilobytes of DASD allocated to the index. The value is -1 if
statistics have not been gathered.
SYSINDEXPART:
SPACEF is the number of kilobytes of DASD allocated to the index partition. The value is
-1 if statistics have not been gathered.
DSNUM is the number of data sets. The value is -1 if statistics have not been gathered.
EXTENTS is the number of data set extents. The value is -1 if statistics have not been
gathered.
LEAFNEAR is the number of leaf pages physically near previous leaf page for successive
active leaf pages. The value is -1 if statistics have not been gathered.
LEAFFAR is the number of leaf pages located physically far away from previous leaf pages
for successive (active leaf) pages accessed in an index scan. The value is -1 if statistics
have not been gathered.
PSEUDO_DEL_ENTRIES is the number of pseudo deleted entries in the index. These
entries are marked as “logically” deleted but still physically remain in the index. For a
non-unique index, the value is the number of RIDs that are pseudo deleted. For an unique
index the value is the number of both keys and RIDs that are pseudo deleted.The value is
-1 if statistics have not been gathered.
SYSTABLEPART:
SPACEF is the number of kilobyte DASD allocated to the table space partition. The value
is -1 if statistics have not been gathered.
DSNUM is the number of data sets. The value is -1 if statistics have not been gathered.
EXTENTS is the number of data set extents. The value is -1 if statistics have not been
gathered.
SYSTABLES:
NPAGESF is the number of pages used by the table.The value is -1 if statistics have not
been gathered.
SPACEF is the number of kilobyte DASD allocated to the table. The value is -1 if statistics
have not been gathered.
AVGROWLEN is the average length of rows for the tables in the table space. If the table
space is compressed, the value is the compressed row length. If the table space is not
compressed, the value is the uncompressed row length. The value is -1 if statistics have
not been gathered.
The space statistics are kept in the catalog database DSNDB06 in the two table spaces
SYSDBASE and SYSHIST, as shown in Figure 4-5.
SYSTABLEPART:
CARD, CARDF
FARINDREF, NEARINDREF
PAGESAVE
PERCACTIVE, PERCDROP
SPACE, SPACEF, PQTY, SQTY, SECQTYI, DSNUM, EXTENTS
SYSINDEXPART:
CARDF
FAROFFPOSF, NEAROFFPOSF, CLUSTERATIOF
LEAFFAR, LEAFNEAR, LEAFDIST
PSEUDO_DEL_ENTRIES
SPACE, SPACEF, PQTY, SQTY, SECQTYI, DSNUM, EXTENTS
SYSLOBSTATS:
FREESPACE, AVGSIZE
ORGRATIO
SYSTABLES:
AVGROWLEN
In addition to the new columns, DB2 V7 also created a new table space, SYSHIST, with
historical statistical information, which contains tables that are near duplicates to the actual
tables in SYSDBASE and SYSSTATS. Table 4-4 provides a list these tables and the
corresponding tables in SYSHIST.
Table 4-4 New tables in SYSHIST
Table space Table name HIST table name in SYSHIST
SYSTABLEPART SYSTABLEPART_HIST
SYSINDEXES SYSINDEXES_HIST
SYSINDEXPART SYSINDEXPART_HIST
SYSCOLUMNS SYSCOLUMNS_HIST
SYSTABSTATS SYSTABSTATS_HIST
SYSLOBSTATS SYSLOBSTATS_HIST
SYSINDEXSTATS SYSINDEXSTATS_HIST
Next we discuss the impact of the new columns and the new HIST tables and their usage in
triggering the RUNSTATS, REORG, and COPY utilities.
76 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
RUNSTATS
RUNSTATS must be run regularly to update the tables with the relevant statistics. Statistics
collected by RUNSTATS can be divided into two major categories.
First, there are access path or optimizer statistics, which are used by the optimizer to select
the best access path to the data.
We will limit our discussion to the space statistics. When RUNSTATS is run with UPDATE
SPACE, then only those fields listed in Figure 4-4 on page 76 are updated with space
statistics. Additionally, if the HISTORY SPACE is specified, then the corresponding history
tables in SYSHIST are also updated with the relevant space statistics. Figure 4-5 provides a
pictorial view of all the tables that are updated with the RUNSTATS command. Refer to
Chapter 11, “Gathering statistics” on page 253 for details.
SYSLOBSTATS SYSLOBSTATS_HIST
SYSTABLES SYSTABLES_HIST
SYSINDEXPART SYSINDEXPART_HIST
SYSTABLEPART SYSTABLEPART_HIST
SYSDBASE SYSHIST
LEAFDIST measures the average gap between leaf pages. For many cases, this is a good
measure. However, in a very large index, where the index is growing and requires page splits,
LEAFDIST exaggerates the disorganization.
In V7, LEAFFAR and LEAFNEAR were introduced to measure physical leaf disorganization,
or the number of physical leaf pages not in an optimal position. As shown in Figure 4-6, there
is a logical and physical view of an index. The logical view is the tree shown at the top (2
levels in our example). If we were accessing the data by scanning for keys "FRISKE" through
"JACKSON", the 4 leaf pages would be scanned as shown.
This same scanning of pages looking at physical page access is shown at the bottom of
Figure 4-6. The first page is physical page 78, and the other leaf pages are located at
physical locations 79, 13, and 14. A jump backward or ahead more than 1 page represents
non-optimal physical ordering. As in other statistics, LEAFNEAR represents this jump within
64 pages and LEAFFAR represents the jump outside the same interval.
In our example, there are 2 big jumps: page 1 to page 78, and page 79 to page 13. These 2
jumps are recorded in LEAFFAR.
When considering the statistics, the LEAFFAR value is more significant and likely to affect
performance than the LEAFNEAR value, but both these values indicate disorganization of the
leaf pages, which can lead to performance degradation.
logical view
Root Page 2
DAR-JAK
0-32 65-96
After reorganizing the index, the leaf pages are in the optimal physical position as in
Figure 4-7. For small indexes, LEAFNEAR and LEAFFAR will be 0 after reorganization. For
large indexes, LEAFNEAR may not be 0 after reorganization. This is because space map
pages are periodically placed throughout the page set, and the jump over space map pages
shows up as a count in LEAFNEAR (space pages in a table space are also the reason why
NEAROFFPOS may not get to zero after reorganization of a table space).
78 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
The percentage of leaf pages in disorganization can be calculated based on the values of
LEAFFAR in SYSINDEXPART and NLEAF in SYSINDEXES as follows:
Physical leaf disorganization = (LEAFFAR/NLEAF)*100
You should consider REORG INDEX if the percentage of leaf disorganization is greater than
10%. Review changing PCTFREE or FREEPAGE to allow keys to be inserted without splitting
as often.
When using DB2 V7 we recommend that you use the above formula to determine when to run
the REORG INDEX utility, instead of using the LEAFDISTLIMIT keyword for REORG INDEX
utility. The usage of LEAFDIST should be discontinued after V6.
logical view
Root Page 2
DAR-JAK
0-32 65-96
Figure 4-7 Logical and physical view after reorganization
A threshold limit can be set on the estimated percentage of changed pages to trigger
RUNSTATS on the table space or index space.
Some DB2 database administrators are reluctant to update the access path statistics too
frequently. In such cases, the DBA may consider running RUNSTATS with the UPDATE
SPACE UPDATE HISTORY option only. The performance of RUNSTATS can be enhanced
with the SAMPLE keyword on large objects; refer to Chapter 11, “Gathering statistics” on
page 253 for details.
There are upper bounds to the number of extents. The size of each extent may influence the
limit of maximum volumes per data set (59 per data set and 133 volumes per STOGROUP).
80 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
With DB2 V7, with improvements to parallel data set open, FASTSWITCH for online REORG,
and the disk capabilities provided by PAV, the requirement to reduce the number of extents is
certainly not so critical in terms of performance. However, it is good practice to keep the
environment under control in terms of DSMAX and VTOC size.
For simple table spaces, dropping a table results in that tables data rows remaining. The
amount of dead space for a simple table space can be tracked directly with PERCDROP in
SYSTABLEPART.
For an index, deleted keys are marked as pseudo deleted. Actual cleaning up will not occur
except during certain processes like before a page split.
You can calculate the percentage of RIDs that are pseudo deleted based on the values of
PSEUDO_DEL_ENTRIES and CARDF in SYSINDEXPART:
(PSEUDO_DEL_ENTRIES/CARDF)*100
You can then determine if you should run REORG INDEX to physically remove the pseudo
deleted entries from the index.
To minimize the CPU cost of an index scan and reclaim space, it is important to remove
pseudo deleted entries. Every time a SQL statement makes a scan of an index, it has to scan
all entries in the index, including pseudo deleted entries, that have not yet been removed.
We recommend that you run REORG INDEX, if the percentage of pseudo deleted entries is
greater than 10%.
LOB table spaces can accumulate dead space that is reclaimed during reorganization. If this
dead space is not regularly reclaimed, it may result in acquiring more extents than needed to
add additional LOBs.
For LOB table spaces, an updated LOB will be written without reclaiming the old version of
the LOB immediately.
When FREESPACE approaches zero for a LOB table space, it is a good time to reclaim the
space, but there are no direct statistics indicating how much will be reclaimed. You can
calculate the percentage of free space for LOB table spaces based on the values of
FREESPACE in SYSLOBSTATS and SPACEF in SYSTABLEPART:
(FREESPACE/SPACEF)*100
We recommend that you reorganize LOB table spaces, if the percent of free space is less
than 10%.
LOBs AVGSIZE
AVGSIZE is the average size of all LOBs in the table space, or the total number of LOB bytes
divided by the number of LOBs. For example, if you had an employee column for a picture of
the employee saved as a LOB, the format used (GIF, JPEG, and so on) will affect the size.
The typical value of size within the column is indicated by AVGSIZE.
AVGSIZE does not contribute to triggering of any utility. Instead it can be used to estimate the
data set sizes of work and sort data sets in LOB data set management.
LOBs ORGRATIO
ORGRATIO is a measure of fragmentation or non-optimal organization of the LOBs stored in
the table space. A value of 1.0 is optimal.
A LOB column always has an auxiliary index which locates the LOB within the LOB table
space. Access path is not an issue, because LOB access is always via a probe using the
auxiliary index. However, performance can be affected if LOBs are scattered into more
physical pieces than necessary, thus involving more I/O to materialize; see Figure 4-8.
Our example shows 4 LOBs as accessed from the auxiliary index. These LOBs are stored in
chunks of pages. A chunk is 16 contiguous pages of a LOB. If the size of a LOB is smaller
than a chunk, then it is expected to fit in 1 chunk. If the size is greater than a chunk, then it is
optimized to fit into the minimum number of chunks (LOB size/chunk size). If, because of
fragmentation within chunks, LOBs are split up to store in more chunks than optimally, the
ORGRATIO will increase.
In our example, Figure 4-8, the first LOB (from rowid 1) has an extra chunk because the LOB
is small enough to store in one chunk instead of two. Likewise, the fourth LOB can be stored
in two chunks instead of three. The formula is shown for ORGRATIO.
82 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
LOBs Orgratio (fragmented)
Auxiliary Index
rowid 1
rowid 2
rowid 3
rowid 4
After reorganization of the LOB space, Figure 4-9, the LOBs are placed in the optimal number
of chunks. Since there are no extra chunks allocated, the ORGRATIO is 1.0 according to the
formula shown.
Auxiliary Index
rowid 1
rowid 2
rowid 3
rowid 4
CARDF and SPACEF can be tracked regularly for space growth of an object. Assuming
constant growth over time, these numbers can be used to derive growth trends to plan for
future needs. Consider the following sample SQL:
SELECT
MAX(CARDF), MIN(CARDF), ((MAX(CARDF)-MIN(CARDF))*100)/MIN(CARDF),
(DAYS(MAX(STATSTIME))-DAYS(MIN(STATSTIME)))
FROM SYSIBM.SYSTABLEPART_HIST
WHERE DBNAME='DB' AND TSNAME='TS';
Assuming that the number of rows is constantly increasing, so that the highest number is the
latest, the query shows the percentage of rows added over a specific time period. This could
be extrapolated to scale on a monthly or yearly basis.
COPY utility
The space growth and SYSCOPY entries of a DB2 object can be used in determining the
requirement for an image copy. If the space growth is greater than a pre-determined
percentage growth value since the last full image copy, then the COPY utility can be triggered
to take a full image copy of the object.
84 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Regularly building the dictionary is expensive, and it should be avoided. The PAGESAVE
value in SYSTABLEPART_HIST can be used to monitor the effectiveness of the dictionary. If
the following situation occurs:
(max(pagesave)-min(pagesave))*100 /max(pagesave) > threshold_limit
Then you should remove the keyword KEEPDICTIONARY from the REORG or LOAD utility.
The utility will build a new dictionary, or else code the keyword in the utility command. We
recommend a threshold_limit of 10%.
R eal-Tim e S tatistics
DB2
C atalo g
C C /390
or
DB2 DSNACCOR m o n ito r
R eal-Tim e p ro g ram
S tatistics
tab le s
Note: RTS are introduced with the PTFs for APAR PQ48447 and PQ48448, DSNACCOR
is introduced with the PTF for APAR PQ46859.
The first planned users of this function are the DB2 Control Center for OS/390, the SAP’s
Control Center Management System, and DB2 tools such as the DB2 Automation Tool.
User programs can also take advantage of RTS rather than using the more static catalog
statistics.
Table definition
The statistics are contained within a user created database called DSNRTSDB, and a
segmented table space called DSNRTSTS. The RTS tables are:
SYSIBM.SYSTABLESPACESTATS
It contains table space statistics, one row per table space or partition
SYSIBM.SYSINDEXSPACESTATS
It contains index space statistics, one row per index space or index partition
The tables must be defined with row level locking and CCSID EBCDIC. A dedicated buffer
pool will improve the efficiency while updating statistics.
Appendix A, “Tables for Real-Time Statistics” on page 275 shows the sample DDL to create
these objects and gives a description of the columns in these tables. These definitions are
contained in a new member of the DB2 sample programs library
DSN710.SDSNSAMP(DSNTESS).
DB2 will always generate in-memory statistics for each table space and index space in DB2.
Statistics are maintained for each partition if the page set is partitioned. DB2 will only
externalize these statistics when:
The required database, table space, tables, and indexes are created.
The objects are created with the specified object names, schema name, and the specified
attributes.
The RTS database is started in RW mode; immediately after creation, this database is
stopped.
86 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
DSNACCOR
By default, the DSNACCOR stored procedure:
Queries both RTS tables
Applies algorithms, using ROT thresholds and limits to
the objects in the tables
Reports back any objects that require REORG,
RUNSTATS or COPY
call
DSNACCOR
(defaults)
SYSIBM.INDEXSPACESTATS
When externalizing in-memory statistics, DB2 inserts a row for each partition or
non-partitioned page set. If a row already exists it will be updated. The absolute statistical
values are replaced with the new values, and incremental values are summed with the
in-memory statistics. Active statistics blocks are ordered in memory in clustered order, and
then inserted using the clustering index for optimal performance.
When the Statistics Manager attempts to update the RTS, objects might be in lock contention
or other resource-not-available conditions that prohibit the updates from succeeding. The
strategy for the Statistics Manager is to ensure that it does NOT fail and cause DB2 to crash,
or cause the update requester to fail. The Statistics Manager will note the failure with a
console message and attempt the externalizing the next update cycle.
DROP of any table space or index will cause their statistics rows to be deleted. If the statistics
table rows cannot be deleted for any reason, the DROP is NOT failed. Therefore, after DROP,
rows may exist in the statistics tables that do not belong to a table space or index (orphaned
rows). The orphaned rows can be removed by using SQL, or the orphaned rows can remain
in the statistics tables. If, subsequently, a table space or index is created with the same DBID
and PSID, DB2 will re-initialize the orphaned row before updating any values in that row.
DB2 performs locking based upon the LOCKSIZE of the DSNRTSDB.DSNRTSTS table
space. Reading of the statistics tables uses isolation level CS and “current data yes”
semantics.
Utilities that reset page sets to empty can invalidate other DB2 members in-memory statistics.
The member that resets a page set will notify the other DB2 members that a reset has
occurred and the in-memory statistics are invalidated. If the notify process fails, the utility
running on the "resetting" member will not fail. The appropriate timestamp (ReorgLastTime,
StatsLastTime or CopyLastTime) is set to NULL to indicate that the table statistics are
unknown. The timestamp is set to NULL for each utility, as stated in the above utility sections.
In data sharing, in a coexistence environment the statistics can be inaccurate until all DB2
members are updated to Version 7.
If the statistic values are in question, accurate values can be reestablished by a REORG,
RUNSTATS, and COPY on the suspect objects.
88 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Real-Tim e Statistics in m em ory collection
Real-Tim e
D B2A Statistics DB2B
tables
To override either the scope of the data queried or the type of recommendation evaluated, the
caller will pass along the appropriate parameter with the call to the stored procedure.
Parameters can also be passed that override the default threshold settings. Calls to the
stored procedure that do not provide parameters will employ default parameters (or
thresholds) where appropriate.
Here we show a simple example to reorganize a table space from the CC. Once in the CC,
connect to the DB2 subsystem by clicking on the database alias. CC will invoke a logon
window for userid and password, which are passed to the host for validation. You will be
presented with a window of all DB2 objects, buffer pool, alias, databases, and so on.
Click on the databases, which will expand to a list of all databases in the DB2 subsystem.
Select the DB2 objects in a database by clicking on the database that you wish to run the
utility. Then click on the table spaces. You will be presented with table spaces within the
database.
In our example, we clicked on database U7G01T11 and obtained a list of table spaces, as
shown in Figure 4-13. Then we right-clicked on table space TSLINEI and got a drop-down
menu which consisted of all utilities that can be run on the table space.
We wanted to reorganize the table space TSLINEI, so we clicked on Reorganize. The next
window (see Figure 4-14) shows the REORG options for TSLINEI. Among all the options,
please note the option Specify indicators for INDREFLIMIT and OFFPOSLIMIT, which can
be used to trigger the REORG of the table space TSLINEI.
90 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Figure 4-14 REORG basic options
92 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
The window shown in Figure 4-16 contains the data set allocation definitions when
TEMPLATE are used for dynamic allocation of work and sort data sets. Finally, the stored
procedure DSNUTILS is generated with the Show SQL button. This stored procedure is run
on the host when the OK button is clicked.
Similarly, all other utilities can be generated on the Control Center and run on the host as
utility stored procedures. The complete procedure to set up a stored procedure environment
is described in the redbook DB2 UDB for OS/390 Version 6 Management Tools Package,
SG25-5759. Briefly, you need to:
Define the workload environment in OS/390 WLM; the default is WLMENV.
Start the DB2 stored procedure that is managed by the workload manager (for example,
DB2GWLM).
Perform a RACF access setup in the DSNR class.
Figure 4-17 shows the administration menu. Choose Option 3 to get the next panel.
94 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Figure 4-18 shows how you can query your databases.
Choosing Option 7 and database DSNDB06, we get the list of indexes with large leaf page
distances reported in Figure 4-19.
Select the index SYSIBM.DSHHKX01 which has the LEAFDISTLIMIT = 318. This index is a
candidate for REORG. DB2ADM will generate the required JCL to run the REORG utility.
Review the JCL, modify LEAFDISTLIMIT as appropriate, and submit the job.
Other options in Figure 4-18 can be selected to trigger the RUNSTATS, REORG, and
STOSPACE utilities on DB2 objects.
Now, without relying on repeated manual interventions, every DB2 administrator is able to add
maximum value to the enterprise by extracting full performance from even the most heavily
used database environment.
In this section we will concentrate only on the utility part of the tool and show some ISPF
panels that are used to define the profiles and trigger values for the utilities. For further details
on this tool please see http://www.ibm.com/software/data/db2imstools/details/#admin
on the Web, or refer to DB2 Automation Tool for z/OS, V1R1, User's Guide, SC27-1189-00.
Option 2 from the main panel, Figure 4-20, requires a stored utility profile name.
If the profile is not available, then the profile create panel is displayed, as shown in
Figure 4-21.
96 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Figure 4-21 Utility profile display panel
Enter a profile name, which is stored in the tools control tables (defined as part of the
installation), as shown in Figure 4-22.
The utilities profile options panel where you select the utilities is displayed, as shown in
Figure 4-23.
Figure 4-24 is for the COPY utility. The CHANGELIMIT values can be set to trigger image copy.
98 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Figure 4-25 is for RUNSTATS. Although there are no trigger limits, the options for the UPDATE
keyword can still be set.
Figure 4-26 is for the REORG table space. The INDREFLIMIT and OFFPOSLIMIT can be set
to trigger the reorganization of the table space.
100 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
5
In this chapter we examine the reasons for partitioning, and the best ways to manage them.
The following releases introduced more and more partition independence, for DB2 Utilities
execution, and for SQL, providing for a better distribution of the data and therefore less
contention, and for parallel access and better performance. Another reason for partitioning is
to increase the amount of data that can be stored in a single (partitioned) table space. As
shown in Table 5-1, the limit has increased from 64 GB to 16 TB between Version 4 and
Version 6.
Table 5-1 Number of partition and partitioned table space sizes
DB2 Version Number of partitions Maximum size Total maximum size
V4* 1 to 16 4 GB 64 GB
17 to 32 2 GB 64 GB
33 to 64 1 GB 64 GB
Note: *** Requires DSSIZE parameter, DFSMS/MVS 1.5 and SMS managed table space
The partitioning of table spaces results in smaller “parts” of data, that is, many smaller data
sets combining up to one larger table space. SQL can exploit this feature automatically by
only scanning the relevant partitions for data that is required to match the WHERE clause,
and by running the SQL in parallel against separate partitions. If possible, this is done
automatically and does not require any extra coding in the SQL statement. This can increase
performance of SQL by only reading from individual partitions rather than the whole table
space, thus reducing I/O times.
The placement of individual partitions, and their partitioning index partitions, is possible to
reduce contention across devices and channels. This can be achieved by either using user
managed data sets or by coding ACS routines to handle individual partition data sets. Be
aware that in DB2 Version 7 the data set names can switch naming standards following a
REORG using FASTSWITCH. See 7.6.5, “SWITCH” on page 171.
Utilities also exploit the features of partitioning. Utilities can act upon one, all, or a range of
partitions, which can greatly reduce the elapsed time of a utility. With partitioning, many
utilities can start parallel subtasks to further increase the savings in elapsed time. In previous
releases of DB2, any non-partitioning indexes on a partitioned table space caused contention
when running parallel invocations of the same utility against different partitions. This
contention is vastly reduced by utility parallelism and subtasks, thus eliminating the need to
drop and recreate non-partitioning indexes (NPIs) before utilities such as REORG.
Partitioning helps enable true 24x7 high availability and should be used for all large table
spaces where continuous availability is an issue.
102 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
5.2 Altering partitions
Over time, the data in a partitioned table space can become skewed, which results in some
partitions being larger than other partitions. This can lead to degradation of performance in
the larger partitions. To change this, the key values in the partitioning index have to be altered
to reflect the new distribution of data.
Prior to DB2 V6, the sequence for altering the key values in a partitioning index was as
follows:
1. Stop the table space and start it in read-only mode.
2. Unload the entire table space.
3. Drop the table space.
4. Redefine the table space with new limitkey definitions.
5. Redefine any indexes and non-partitioning indexes (NPIs).
6. Reload the unloaded data.
7. Run RUNSTATS.
8. Rebind affected plans and packages.
9. Reissue necessary grants for authorization to the original objects.
10.Redefine any synonyms, views, and referential constraints.
11.Make new image copies for recoverability.
This is a complicated process and leads to the whole table space being unavailable during
the process, which could be lengthy, depending upon the size of the table.
Version 6 offers the ability to ALTER the key range of a partitioning index. When ALTERING
the key values, DB2 sets a restrictive status of REORGP (REORG pending) on the affected
partitions. Access to the data is only restricted to the partitions with this status; no other
partitions are affected and are accessible. To remove the REORGP status, a REORG has to
be run against all the affected partitions (it is recommended that this be a REORG
SHRLEVEL NONE).
R EO R G P
A LT E R IN D E X B IR TH _Y E A R P A R T 3 V A LU E S '1966'
This enhancement could be used to allow extra partitions for future growth and then altering
key values to move rows into the new partition. This will assist database administrators to
expand the range of values in the last partition without affecting the availability of the other
partitions.
Partitioning can also be used to prepare for future growth of data. For example, a table space
may be partitioned on YEAR, as per Figure 5-1, and may contain partitions for future years.
These partitions can be created small (48/48) and increased when required, that is, when the
data for that year starts being produced. Using this method enables the physical design of a
database to recognize future growth, while deferring disk usage until it is required.
Partitioning is generally transparent to SQL, but partitioning has also some indirect impact on
SQL due to the restriction that was placed on UPDATE statements. The restriction was that
the columns of a partitioning index could NOT be updated. This option to remove this
restriction was introduced in DB2 Version 6 with the ZPARM field PARTKEYU in panel
DSNTIPB.
Any attempt to update a key that violates the PARTKEYU ZPARM results in an SQLCODE
-904 error message.
Be aware that using YES or SAME will allow users to update primary keys that could
invalidate logical design rules.
When planning to partition, it is essential that the application code be fully understood. If there
is updating of the “soon-to-be” partitioning key, then these programs will have to be changed,
or the PARTKEYU ZPARM field set to YES. If the change is made without application
checking, applications that worked before will stop working.
If the application cannot be changed, and the value of PARTKEYU is to remain NONE, then
an artificial key must be chosen to act as the partitioning key and be built into the design of
the database.
104 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
5.4 Utilities and partitioning
Most utilities have been able to operate at the partition level for some time. Recent
enhancements to utilities have allowed single utilities to act in parallel against multiple
partitions. The problematic area of non-partitioning indexes (NPIs) has also been addressed
by the use of parallelism and subtasks. In this section, we look at how to use partitions with
utilities.
In early releases of DB2, it was advised to drop NPIs before running utilities, run multiple
occurrences of a utility in parallel, and then recreate the NPIs. In very large table spaces, it
was even advised to set multiple RECOVER INDEX (now renamed REBUILD INDEX) jobs
running in parallel, stop them after the UNLOAD phase, sort and merge the SYSUT1 files,
and then feed the data set back into a restarted REBUILD (RECOVER) INDEX utility. This
method reduced the SORT time, which for very large sorts, tends to increase exponentially.
With partitioning and parallelism, this advice is no longer current. By the use of SORTKEYS,
the REBUILD INDEX will start multiple subtasks, optimally three subtasks (one unload, one
SORT, and one to build the index), for each partition of a partitioning index. If an NPI exists,
there will be a MERGE step added, and a single subtask, per NPI, to build the index. This
reduces contention on the NPI and reduces total elapsed time. The total number of subtasks
is determined by the amount of available resources. See 3.4, “REBUILD INDEX Parallel Index
Build” on page 57, for more details on the REBUILD INDEX function.
The SORTKEYS keyword enables the parallel index build of indexes. For example, each load
task takes input from a sequential data set and loads the data into a corresponding partition.
The utility then extracts index keys and passes them in parallel to the sort task that is
responsible for sorting the keys for that index. If there is too much data to perform the sort in
memory, the sort product writes the keys to the sort work data sets. The sort tasks pass the
sorted keys to their corresponding build task, each of which builds one index. If the utility
encounters errors during the load, DB2 writes error and error mapping information to the error
and map data sets.
Each partition can be operated upon independently from the others by most utilities, thus
removing the need to run utilities against all the data when only a subset requires acting
upon. For example, a partitioned table space may be partitioned on YEAR, once the year has
passed this data becomes stable. Therefore, it is not necessary to run REORGs on it. If the
table space was not partitioned, all data would be REORGed on every run of REORG. With
partitioning, only the partition in use needs to be REORGed.
Independence of partitions is a critical factor that is further enhanced by DB2 Version 7 with
the ability to run parallel utilities (both automatically or manually). Contention has also been
reduced on the non-partitioned indexes by using subtasks to build the indexes. Parallel index
build allows for a partitioning index to have its partitions build in parallel by multiple subtask. It
can unload multiple partitions in parallel, something that had to be manually organized prior to
Version 6.
The LOAD utility can now operate on partitions in parallel as well as individual partitions or
the whole table space.
PIECESIZE was introduced to allow DBAs to decide how big an NPI data set could become
before creating an additional data set for an index. The default is 2 GB, which means that a
data set over 2 GB would be in two or more data sets. By setting the PIECESIZE smaller, the
NPI will occupy more data sets. These, like partitions, can then be intelligently placed to
ensure that there is little contention across devices and channels.
106 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Part 3
Part 3 Executing
DB2 Utilities
In this part, we describe the functionality of the DB2 Utilities and provide examples of their
use:
Chapter 6, “Loading data” on page 109
Chapter 7, “Reorganizing data” on page 161
Chapter 8, “Unloading data” on page 191
Chapter 9, “Recovering data” on page 209
Chapter 10, “Copying data” on page 231
Chapter 11, “Gathering statistics” on page 253.
In this chapter, we will mainly focus on new LOAD options introduced since DB2 V5, such as
the following:
Inline COPY and Inline RUNSTATS
Sortkeys
Partition parallelism
Preformat
Reuse
Cross Loader
Online Resume
DB2 V5 introduced the ability to create a full image copy while loading the table space with
the LOAD utility. This inline image copy is created during the RELOAD phase. The table
space is fully accessible to applications after the LOAD and an additional full image is no
longer necessary to guarantee the recoverability of the table space. So the unavailability time
of the table space during LOAD is reduced.
The inline image copy is triggered by the presence of the COPYDDN (and optionally
RECOVERYDDN) keyword of the LOAD utility. It can only be used with LOAD REPLACE.
After a LOAD RESUME YES with LOG NO or LOAD RESUME NO with LOG NO the table
space is still put into the restrictive COPYP status, and a full image copy with SHRLEVEL
REFERENCE is still needed to make the table space available to writing applications and to
make the table space recoverable.
You can neither create an inline image copy of an index space, nor can you create an inline
incremental image copy. Also, you cannot take an inline image copy of a LOB table space. An
inline image copy is always recorded in SYSCOPY as a full image copy with SHRLEVEL
REFERENCE, although it is not recommended for use with RECOVERY TOCOPY.
DB2 V6 introduced the ability to update the statistics while loading the table space, thus
eliminating the need for an extra RUNSTATS utility and reducing the total elapsed time.
The inline statistics is triggered by the STATISTICS option of the LOAD utility. It can be used
with LOAD REPLACE and LOAD RESUME NO, except on LOB table spaces. All RUNSTATS
options are supported, including the new HISTORY functionality introduced in DB2 V7. With
LOAD RESUME YES, you still have to run a RUNSTATS utility after the LOAD if you want to
make the statistics current. The same applies for LOB table spaces.
If any parallelism is used during the LOAD (see 6.2, “Parallel Index Build” on page 111 and
6.3, “Partition parallelism within one LOAD” on page 118), the statistics will be gathered in
parallel as well, both for table space and index statistics. This will occur as additional subtasks
associated with the reload and/or the build subtasks.
110 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
See 3.1.2, “Inline RUNSTATS” on page 46 for more details.
6.1.3 Recommendations
Use the aforementioned inline utilities as much as possible for reducing the elapsed time and
unavailability of your table spaces during LOAD:
Use LOAD REPLACE with LOG NO and COPYDDN (/RECOVERYDDN): This is the most
effective way to LOAD or RELOAD your table space. After this, the table space will be fully
available to all applications and fully recoverable.
Use LOAD REPLACE and LOAD RESUME NO with STATISTICS: This is the most
effective way to keep your statistics current after LOAD or RELOAD.
In contrast with REBUILD INDEX, if a partitioned table space only contains the partitioned
index, the parts of the partitioned index will never be built in parallel during LOAD. However,
the partitioned index and the non-partitioned indexes will be built in parallel.
Ta b le S p a c e
In d e x 1
SO RTBLD In d e x 2
S o rt B u ild
In d e x 3
LO AD
keys
In d e x e s b u ilt
in
p a ra lle l
M u ltip le S o rt/B u ild ta s k p a irs p ro v id e p a ra lle lis m
PIB is activated by the SORTKEYS n option, n being an estimation of the total number of
keys to be sorted. If you specify n = 0, PIB will not be activated. DB2 will pass a value based
on n to DFSORT (or other SORT product used) to allow the sort subtasks to allocate their
proper work data sets. If the estimation is far from the real number of keys to be sorted, the
SORT might fail. See DB2 Universal Database for OS/390 and z/OS Utility Guide and
Reference, SC26-9945-00, regarding how to calculate an appropriate value of n.
The indexes are built in parallel by multiple pairs of sort and build subtasks (optimally one pair
per index). All this is done in the new SORTBLD phase, which replaces the SORT and BUILD
phases. The sort subtask sorts the extracted keys, while the build subtask builds the index.
This is also done partially in parallel, because the index build starts as soon as the sort
subtask passes the first sorted key. Moreover the SYSUT1 and SORTOUT work data sets are
no longer used because the key/RIDs pairs are now directly piped in memory between the
subtasks, eliminating considerable I/O processing.
Optimally with PIB, the number of subtasks pairs is equal to the number of indexes to be built.
DB2 determines the degree of parallelism, that is, the number of subtask pairs based on the
following factors:
Number of indexes to be built
Available virtual storage in the batch region
Number of CPUs available
See 3.7, “Considerations for running parallel subtasks” on page 67 for more information.
If you specify both sort workfile DD statements and UTPRINT data sets, the number of
subtask pairs equals the smallest number of data sets specified.
See also 3.3, “LOAD and REORG Parallel Index Build” on page 52.
In Example 6-1 we show the loading of a non-partitioned table space using PIB. The input
data set is allocated in the JCL with a DD card SYSREC, and contains about 50000 records.
We use templates to specify the Inline COPY data sets and work data sets (with SPACE
parameters because DB2 is unable to estimate them properly). We also specify Inline
RUNSTATS. The table space contains 3 indexes. We activate the PIB by specifying
SORTKEYS 150000 (150000 = 3 indexes * 50000 keys/index).
112 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
WORKDDN(TSYSUT1,TSORTOUT)
STATISTICS TABLE(ALL) INDEX(ALL KEYCARD)
INTO TABLE PAOLOR3.EXAMPN
WHEN(1:2 = X'000E')
(CASE_KEY POSITION(003:006) INTEGER
,SEVERITY_KEY POSITION(007:010) INTEGER
,RECIEVED_VIA_KEY POSITION(011:014) INTEGER
,TYPE_KEY POSITION(015:018) INTEGER
,ASSIGNED_TO_KEY POSITION(019:022) INTEGER
,TOD_KEY POSITION(023:026) INTEGER
,PRODUCT_KEY POSITION(027:030) INTEGER
,CUSTOMER_KEY POSITION(031:034) INTEGER
,STATUS_KEY POSITION(035:038) INTEGER
,RESOLUTION_KEY POSITION(039:042) INTEGER
,REASON_CLOSED_KEY POSITION(043:046) INTEGER
,CLOSED_BY_KEY POSITION(047:050) INTEGER
,TIME_CREATED_KEY POSITION(051:054) INTEGER
,TIME_CLOSED_KEY POSITION(055:058) INTEGER
,NEW_CASE_VOLUME POSITION(060:063) INTEGER
NULLIF(059)=X'FF'
,OPEN_CASE_VOLUME POSITION(065:068) INTEGER
NULLIF(064)=X'FF'
,CLOSED_CASE_VOLUME POSITION(070:073) INTEGER
NULLIF(069)=X'FF'
,CLOSED_ONFIRSTCONT POSITION(075:078) INTEGER
NULLIF(074)=X'FF'
,DAYS_TO_RESOLUTION POSITION(080:083) INTEGER
NULLIF(079)=X'FF'
,AVERAGE_DAYS_TO_RE POSITION(085:088) INTEGER
NULLIF(084)=X'FF'
,CLOSED_BY_OTHERS POSITION(090:093) INTEGER
NULLIF(089)=X'FF'
,CASE_SUBJECT POSITION(095:196) VARCHAR
NULLIF(094)=X'FF'
,CASE_NOTE POSITION(198:0454) VARCHAR
NULLIF(197)=X'FF'
,LAST_COMMENT POSITION(456:712) VARCHAR
NULLIF(455)=X'FF' )
In Example 6-2 we LOAD the same table space by specifying the input data set with a
template and adding discard processing. For this, we have to add a template for the input,
discard, error, and mapping data sets.
Example 6-2 LOAD of non-partitioned table space with PIB; discard processing
// EXEC PGM=DSNUTILB,PARM='DB2G,PAOLOR3'
//STEPLIB DD DSN=DSN710.SDSNLOAD,DISP=SHR
//SYSPRINT DD SYSOUT=*
//UTPRINT DD SYSOUT=*
//SYSIN DD *
TEMPLATE TSYSREC DSN(PAOLOR3.&DB..&TS..UNLOAD)
TEMPLATE TSYSDISC DSN(PAOLOR3.&DB..&TS..DISCARD)
SPACE(10,10) CYL
TEMPLATE TSYSERR DSN(PAOLOR3.&DB..&TS..SYSERR)
SPACE(10,10) CYL
TEMPLATE TSYSMAP DSN(PAOLOR3.&DB..&TS..SYSMAP)
SPACE(10,10) CYL
TEMPLATE TSYSCOPY DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME..P)
An extract of the job output is shown in Example 6-3. Here you can see that DB2 used 9
subtasks for building the 3 indexes in parallel (3 pairs of sort, build and 3 statistics) during the
SORTBLD phase. Inline COPY and inline statistics are produced.
114 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Example 6-3 Job output LOAD of a non-partitioned table space with PIB
DSNU000I DSNUGUTC - OUTPUT START FOR UTILITY, UTILID = PAOLOR3
DSNU050I DSNUGUTC - TEMPLATE TSYSREC DSN(PAOLOR3.&DB..&TS..UNLOAD)
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSDISC DSN(PAOLOR3.&DB..&TS..DISCARD) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSERR DSN(PAOLOR3.&DB..&TS..SYSERR) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSMAP DSN(PAOLOR3.&DB..&TS..SYSMAP) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSCOPY DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME..P) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TRECOVER DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME..R) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - LOAD DATA REPLACE LOG NO SORTKEYS 150000 INDDN(TSYSREC) DISCARDDN(TSYSDISC) ERRDDN(TSYSERR)
MAPDDN(TSYSMAP) COPYDDN(TSYSCOPY) RECOVERYDDN(TRECOVER) WORKDDN(TSYSUT1,TSORTOUT) STATISTICS TABLE(ALL) INDEX(ALL
KEYCARD)
DSNU650I -DB2G DSNURWI - INTO TABLE PAOLOR3.EXAMPN WHEN(1:2=X'000E')
DSNU650I -DB2G DSNURWI - (CASE_KEY POSITION(3:6) INTEGER,
DSNU650I -DB2G DSNURWI - SEVERITY_KEY POSITION(7:10) INTEGER,
DSNU650I -DB2G DSNURWI - RECIEVED_VIA_KEY POSITION(11:14) INTEGER,
DSNU650I -DB2G DSNURWI - TYPE_KEY POSITION(15:18) INTEGER,
DSNU650I -DB2G DSNURWI - ASSIGNED_TO_KEY POSITION(19:22) INTEGER,
DSNU650I -DB2G DSNURWI - TOD_KEY POSITION(23:26) INTEGER,
DSNU650I -DB2G DSNURWI - PRODUCT_KEY POSITION(27:30) INTEGER,
DSNU650I -DB2G DSNURWI - CUSTOMER_KEY POSITION(31:34) INTEGER,
DSNU650I -DB2G DSNURWI - STATUS_KEY POSITION(35:38) INTEGER,
DSNU650I -DB2G DSNURWI - RESOLUTION_KEY POSITION(39:42) INTEGER,
DSNU650I -DB2G DSNURWI - REASON_CLOSED_KEY POSITION(43:46) INTEGER,
DSNU650I -DB2G DSNURWI - CLOSED_BY_KEY POSITION(47:50) INTEGER,
DSNU650I -DB2G DSNURWI - TIME_CREATED_KEY POSITION(51:54) INTEGER,
DSNU650I -DB2G DSNURWI - TIME_CLOSED_KEY POSITION(55:58) INTEGER,
DSNU650I -DB2G DSNURWI - NEW_CASE_VOLUME POSITION(60:63) INTEGER NULLIF(59)=X'FF',
DSNU650I -DB2G DSNURWI - OPEN_CASE_VOLUME POSITION(65:68) INTEGER NULLIF(64)=X'FF',
DSNU650I -DB2G DSNURWI - CLOSED_CASE_VOLUME POSITION(70:73) INTEGER NULLIF(69)=X'FF',
DSNU650I -DB2G DSNURWI - CLOSED_ONFIRSTCONT POSITION(75:78) INTEGER NULLIF(74)=X'FF',
DSNU650I -DB2G DSNURWI - DAYS_TO_RESOLUTION POSITION(80:83) INTEGER NULLIF(79)=X'FF',
DSNU650I -DB2G DSNURWI - AVERAGE_DAYS_TO_RE POSITION(85:88) INTEGER NULLIF(84)=X'FF',
DSNU650I -DB2G DSNURWI - CLOSED_BY_OTHERS POSITION(90:93) INTEGER NULLIF(89)=X'FF',
DSNU650I -DB2G DSNURWI - CASE_SUBJECT POSITION(95:196) VARCHAR NULLIF(94)=X'FF',
DSNU650I -DB2G DSNURWI - CASE_NOTE POSITION(198:454) VARCHAR NULLIF(197)=X'FF',
DSNU650I -DB2G DSNURWI - LAST_COMMENT POSITION(456:712) VARCHAR NULLIF(455)=X'FF')
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSREC
DDNAME=SYS00001
DSN=PAOLOR3.PAOLOR3N.PAOLOR3N.UNLOAD
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSUT1
DDNAME=SYS00002
DSN=PAOLOR3.PAOLOR3N.PAOLOR3N.SYSUT1
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSORTOUT
DDNAME=SYS00003
DSN=PAOLOR3.PAOLOR3N.PAOLOR3N.SORTOUT
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSDISC
DDNAME=SYS00004
DSN=PAOLOR3.PAOLOR3N.PAOLOR3N.DISCARD
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSERR
DDNAME=SYS00005
DSN=PAOLOR3.PAOLOR3N.PAOLOR3N.SYSERR
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSMAP
DDNAME=SYS00006
DSN=PAOLOR3.PAOLOR3N.PAOLOR3N.SYSMAP
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSCOPY
DDNAME=SYS00007
DSN=PAOLOR3.PAOLOR3N.PAOLOR3N.D2001166.T213106.P
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TRECOVER
DDNAME=SYS00008
DSN=PAOLOR3.PAOLOR3N.PAOLOR3N.D2001166.T213106.R
DSNU350I -DB2G DSNURRST - EXISTING RECORDS DELETED FROM TABLESPACE
DSNU395I DSNURPIB - INDEXES WILL BE BUILT IN PARALLEL, NUMBER OF TASKS = 9
DSNU400I DSNURBID - COPY PROCESSED FOR TABLESPACE PAOLOR3N.PAOLOR3N
NUMBER OF PAGES=2693
AVERAGE PERCENT FREE SPACE PER PAGE = 7.99
PERCENT OF CHANGED PAGES =100.00
ELAPSED TIME=00:00:13
DSNU610I -DB2G DSNUSUTP - SYSTABLEPART CATALOG UPDATE FOR PAOLOR3N.PAOLOR3N SUCCESSFUL
DSNU610I -DB2G DSNUSUTB - SYSTABLES CATALOG UPDATE FOR PAOLOR3.EXAMPN SUCCESSFUL
DSNU610I -DB2G DSNUSUCO - SYSCOLUMNS CATALOG UPDATE FOR PAOLOR3.EXAMPN SUCCESSFUL
DSNU610I -DB2G DSNUSUTS - SYSTABLESPACE CATALOG UPDATE FOR PAOLOR3N.PAOLOR3N SUCCESSFUL
DSNU620I -DB2G DSNURDRT - RUNSTATS CATALOG TIMESTAMP = 2001-06-15-17.31.11.762304
In Example 6-4 we show the loading of a partitioned table space with 3 partitions using PIB.
Here we are not using the partition parallelism introduced in DB2 V7 as explained in 6.3,
“Partition parallelism within one LOAD” on page 118. The 3 input data sets are concatenated
in the JCL to form one single sequential data set allocated to the DD card SYSREC
containing about 600000 records. We use templates to specify the Inline COPY data sets and
work data sets. We also use Inline RUNSTATS. The table space contains 2 indexes (the
partitioned index and one NPI). We activate the PIB by specifying SORTKEYS 1200000
(1200000 = 2 indexes * 600000 keys/index). We verified that DB2 will use 6 subtasks for
building the 2 indexes in parallel (2 pairs of sort, build and 2 statistics) during the SORTBLD
phase. Inline COPY and inline statistics are produced.
116 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
COPYDDN(TSYSCOPY) RECOVERYDDN(TRECOVER)
WORKDDN(TSYSUT1,TSORTOUT)
STATISTICS TABLE(ALL) INDEX(ALL KEYCARD)
INTO TABLE PAOLOR3.EXAMPP
(PS_PARTKEY POSITION(03:006) INTEGER
,PS_SUPPKEY POSITION(07:010) INTEGER
,PS_AVAILQTY POSITION(11:014) INTEGER
,PS_SUPPLYCOST POSITION(15:018) FLOAT(21)
,PS_COMMENT POSITION(19:219) VARCHAR )
An extract of the job output is shown in Example 6-5. Here you can see that DB2 used 6
subtasks for building the 2 indexes in parallel (2 pairs of sort, build and 2 statistics) during the
SORTBLD phase. Inline COPY and inline statistics are produced.
Example 6-5 Job output LOAD of partitioned table space with PIB; no parallelism
DSNU000I DSNUGUTC - OUTPUT START FOR UTILITY, UTILID = PAOLOR3
DSNU050I DSNUGUTC - TEMPLATE TSYSDISC DSN(PAOLOR3.&DB..&TS..DISCARD) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSERR DSN(PAOLOR3.&DB..&TS..SYSERR) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSMAP DSN(PAOLOR3.&DB..&TS..SYSMAP) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSCOPY DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME..P) SPACE(100,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TRECOVER DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME..R) SPACE(100,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - LOAD DATA REPLACE LOG NO SORTKEYS 1200000 DISCARDDN(TSYSDISC) ERRDDN(TSYSERR) MAPDDN(
TSYSMAP) COPYDDN(TSYSCOPY) RECOVERYDDN(TRECOVER) WORKDDN(TSYSUT1,TSORTOUT) STATISTICS TABLE(ALL) INDEX(ALL KEYCARD)
DSNU650I -DB2G DSNURWI - INTO TABLE PAOLOR3.EXAMPP
DSNU650I -DB2G DSNURWI - (PS_PARTKEY POSITION(3:6) INTEGER,
DSNU650I -DB2G DSNURWI - PS_SUPPKEY POSITION(7:10) INTEGER,
DSNU650I -DB2G DSNURWI - PS_AVAILQTY POSITION(11:14) INTEGER,
DSNU650I -DB2G DSNURWI - PS_SUPPLYCOST POSITION(15:18) FLOAT(21),
DSNU650I -DB2G DSNURWI - PS_COMMENT POSITION(19:219) VARCHAR)
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSUT1
DDNAME=SYS00001
DSN=PAOLOR3.PAOLOR3P.PAOLOR3P.SYSUT1
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSORTOUT
DDNAME=SYS00002
DSN=PAOLOR3.PAOLOR3P.PAOLOR3P.SORTOUT
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSDISC
DDNAME=SYS00003
DSN=PAOLOR3.PAOLOR3P.PAOLOR3P.DISCARD
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSERR
DDNAME=SYS00004
DSN=PAOLOR3.PAOLOR3P.PAOLOR3P.SYSERR
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSMAP
DDNAME=SYS00005
DSN=PAOLOR3.PAOLOR3P.PAOLOR3P.SYSMAP
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSCOPY
DDNAME=SYS00006
DSN=PAOLOR3.PAOLOR3P.PAOLOR3P.D2001167.T005804.P
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TRECOVER
DDNAME=SYS00007
DSN=PAOLOR3.PAOLOR3P.PAOLOR3P.D2001167.T005804.R
DSNU350I -DB2G DSNURRST - EXISTING RECORDS DELETED FROM TABLESPACE
DSNU395I DSNURPIB - INDEXES WILL BE BUILT IN PARALLEL, NUMBER OF TASKS = 6
DSNU400I DSNURBID - COPY PROCESSED FOR TABLESPACE PAOLOR3P.PAOLOR3P
NUMBER OF PAGES=26152
AVERAGE PERCENT FREE SPACE PER PAGE = 6.47
PERCENT OF CHANGED PAGES =100.00
ELAPSED TIME=00:00:36
DSNU610I -DB2G DSNUSUTP - SYSTABLEPART CATALOG UPDATE FOR PAOLOR3P.PAOLOR3P SUCCESSFUL
DSNU610I -DB2G DSNUSUPT - SYSTABSTATS CATALOG UPDATE FOR PAOLOR3.EXAMPP SUCCESSFUL
DSNU610I -DB2G DSNUSUPC - SYSCOLSTATS CATALOG UPDATE FOR PAOLOR3.EXAMPP SUCCESSFUL
DSNU610I -DB2G DSNUSUTB - SYSTABLES CATALOG UPDATE FOR PAOLOR3.EXAMPP SUCCESSFUL
DSNU610I -DB2G DSNUSUCO - SYSCOLUMNS CATALOG UPDATE FOR PAOLOR3.EXAMPP SUCCESSFUL
DSNU610I -DB2G DSNUSUTS - SYSTABLESPACE CATALOG UPDATE FOR PAOLOR3P.PAOLOR3P SUCCESSFUL
DSNU620I -DB2G DSNURDRT - RUNSTATS CATALOG TIMESTAMP = 2001-06-15-20.58.11.634612
In 6.3, “Partition parallelism within one LOAD” on page 118, we show that in DB2 V7,
parallelism can be increased by also loading the partitions in parallel.
Prior to V7, one solution was to make the table spaces partitioned and try to load the
partitions in parallel. The only way of doing this was by submitting different jobs in parallel,
each job loading a different partition or set of partitions. This worked fine if there were no
non-partitioned indexes (NPIs). If there were NPIs, you ended up with considerable
contention problems. For that reason, many customers first dropped the NPIs and then
recreated them afterwards.
In DB2 V7 an improvement has been made allowing partitions to be loaded in parallel in the
same job and reducing the NPI contention. This single job will now read one input data set
per partition and launch multiple subtasks, optimally one for each partition.
In order to allow each partition to be loaded from a separate data set, with discards
(optionally) written to discard data sets for that partition, the LOAD syntax allows the
specification of the INDDN and DISCARDDN keywords as part of the INTO TABLE PART
specification.
Make sure that the individual input data sets only contain records for the corresponding
partition, as all other records will be discarded.
Optimally the number of load subtasks will be equal to the number of partitions to be loaded.
DB2 determines the degree of parallelism, that is, the number of load subtasks, based on the
following factors:
Number of CPUs
Available virtual storage in the batch region
Available number of DB2 threads
See also 3.7, “Considerations for running parallel subtasks” on page 67.
118 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
It is still recommended to submit multiple parallel jobs per partition if there are no NPIs. This
to have the parts of the partitioned index built in parallel, which is not the case when using the
new partition parallel LOAD function.
There are two ways you can deploy this partition parallel LOAD: with or without Parallel Index
Build (PIB).
Error/Map
SYS RELOAD
REC1 Part 1
PI
D
REC2
IL
BU
key/RID pairs SYS SORT SORT BUILD NPI1
UT1 OUT
BU
SYS
Part 3
IL
RELOAD
D
SORTWKnn
REC3
NPI2
SYS
REC4
RELOAD Part 4
See also 3.5.1, “LOAD partition parallelism without PIB” on page 63.
In Example 6-6 we show the loading of the same partitioned table space as in Example 6-4
on page 116, but now using partition parallelism without PIB. The 3 input data sets are
now allocated to a separate DD card in the JCL, one per partition. We use templates to
specify the Inline COPY data sets and work data sets. We also ask for Inline RUNSTATS. The
3 partitions will be loaded in parallel and the indexes will be built serially, avoiding NPI
contention. We do not specify the SORTKEYS option.
In Example 6-7 we LOAD the same table space by specifying the input data sets with a
template (using the &PART variable) and adding discard processing. For this, we have to add
a template for the input, discard, error, and mapping data sets.
Example 6-7 Partition parallelism without PIB using a template for input data sets
// EXEC PGM=DSNUTILB,PARM='DB2G,PAOLOR3'
//STEPLIB DD DSN=DSN710.SDSNLOAD,DISP=SHR
//SYSPRINT DD SYSOUT=*
//UTPRINT DD SYSOUT=*
//SYSIN DD *
TEMPLATE TSYSREC DSN(PAOLOR3.&DB..&TS..U&PART.)
SPACE(10,10) CYL
TEMPLATE TSYSDISC DSN(PAOLOR3.&DB..&TS..D&PART.)
SPACE(10,10) CYL
TEMPLATE TSYSERR DSN(PAOLOR3.&DB..&TS..SYSERR)
SPACE(10,10) CYL
TEMPLATE TSYSMAP DSN(PAOLOR3.&DB..&TS..SYSMAP)
SPACE(10,10) CYL
TEMPLATE TSYSCOPY DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME..P)
SPACE(100,10) CYL
120 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
TEMPLATE TRECOVER DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME..R)
SPACE(100,10) CYL
TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1)
SPACE(10,10) CYL
TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT)
SPACE(10,10) CYL
LOAD DATA REPLACE LOG NO
COPYDDN(TSYSCOPY) RECOVERYDDN(TRECOVER)
ERRDDN(TSYSERR) MAPDDN(TSYSMAP)
WORKDDN(TSYSUT1,TSORTOUT)
STATISTICS TABLE(ALL) INDEX(ALL KEYCARD)
INTO TABLE PAOLOR3.EXAMPP PART 1
INDDN(TSYSREC) DISCARDDN(TSYSDISC)
(PS_PARTKEY POSITION(03:006) INTEGER
,PS_SUPPKEY POSITION(07:010) INTEGER
,PS_AVAILQTY POSITION(11:014) INTEGER
,PS_SUPPLYCOST POSITION(15:018) FLOAT(21)
,PS_COMMENT POSITION(19:219) VARCHAR )
INTO TABLE PAOLOR3.EXAMPP PART 2
INDDN(TSYSREC) DISCARDDN(TSYSDISC)
(PS_PARTKEY POSITION(03:006) INTEGER
,PS_SUPPKEY POSITION(07:010) INTEGER
,PS_AVAILQTY POSITION(11:014) INTEGER
,PS_SUPPLYCOST POSITION(15:018) FLOAT(21)
,PS_COMMENT POSITION(19:219) VARCHAR )
INTO TABLE PAOLOR3.EXAMPP PART 3
INDDN(TSYSREC) DISCARDDN(TSYSDISC)
(PS_PARTKEY POSITION(03:006) INTEGER
,PS_SUPPKEY POSITION(07:010) INTEGER
,PS_AVAILQTY POSITION(11:014) INTEGER
,PS_SUPPLYCOST POSITION(15:018) FLOAT(21)
,PS_COMMENT POSITION(19:219) VARCHAR )
An extract of the job output is shown in Example 6-8. Here you can see that DB2 used 3
subtasks for loading the 3 partitions in parallel. The 2 indexes are build serially. Inline COPY
and inline statistics are produced.
122 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
DSNU610I -DB2G DSNUSUCD - SYSCOLDIST CATALOG UPDATE FOR PAOLOR3.I_EXAMPP_1 SUCCESSFUL
DSNU610I -DB2G DSNUSUIP - SYSINDEXPART CATALOG UPDATE FOR PAOLOR3.I_EXAMPP_2 SUCCESSFUL
DSNU610I -DB2G DSNUSUIX - SYSINDEXES CATALOG UPDATE FOR PAOLOR3.I_EXAMPP_2 SUCCESSFUL
DSNU610I -DB2G DSNUSUCO - SYSCOLUMNS CATALOG UPDATE FOR PAOLOR3.EXAMPP SUCCESSFUL
DSNU610I -DB2G DSNUSUCD - SYSCOLDIST CATALOG UPDATE FOR PAOLOR3.I_EXAMPP_2 SUCCESSFUL
DSNU620I -DB2G DSNURDRI - RUNSTATS CATALOG TIMESTAMP = 2001-06-15-21.24.53.477255
DSNU258I DSNURBXD - BUILD PHASE STATISTICS - NUMBER OF INDEXES=2
DSNU259I DSNURBXD - BUILD PHASE COMPLETE, ELAPSED TIME=00:00:31
DSNU010I DSNUGBAC - UTILITY EXECUTION COMPLETE, HIGHEST RETURN CODE=0
E rr o r/M a p
SYS RELO AD
REC1 P a rt 1
SYS
REC2
RELO AD P a rt 2
SORT SO RTBLD
S W 02W Knn
S W 02W K xx N P I1
B U IL D
SYS
REC3
RELO AD P a rt 3
SYS
REC4
RELO AD P a rt 4
As already explained in 6.2, “Parallel Index Build” on page 111, this is done in the new
SORTBLD phase, which replaces the SORT and BUILD phases. Moreover, the SYSUT1 and
SORTOUT work data sets are no longer used, because the key/RIDs pairs are now directly
piped in memory between subtasks, thus eliminating considerable I/O processing. Because
each NPI is built by its own build task, NPI contention is eliminated.
Partition parallelism with PIB is activated by the SORTKEYS n option, n being an estimation
of the number of keys to be sorted. If you specify n equals to zero, PIB will not be activated.
DB2 V5 introduced the SORTKEYS option to use Sort and eliminate multiple I/O accesses to
workfiles when building the indexes. DB2 V6 used the same option to activate PIB as well. In
V7 this feature can also be used in combination with partition parallel LOAD.
See also 3.5.2, “LOAD partition parallelism with PIB” on page 64.
124 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
An extract of the job output is shown in Example 6-10. Here you can see that DB2 used 3
subtasks for loading the 3 partitions in parallel. DB2 used also 6 subtasks for building the 2
indexes in parallel (2 pairs of sort, build and 2 statistics) during the SORTBLD phase. Inline
COPY and inline statistics are produced.
Note: All examples shown in 6.2, “Parallel Index Build” on page 111 and 6.3, “Partition
parallelism within one LOAD” on page 118, were run on a system that was not tuned for
optimal performance. They are shown for example only. Actions for tuning I/O and buffer
pools are needed to avoid bottlenecks and achieving maximum parallelism if you start
using Parallel Index Build and Partition Parallelism.
In Example 6-11 we show how to use templates to create inline copies on the partition level
instead of having one single Inline COPY data set on the table space level. This is done to
avoid contention on the single Inline COPY data set from the different load subtasks.
Example 6-11 Partition parallelism with inline copies on the partition level
TEMPLATE TSYSREC DSN(PAOLOR3.&DB..&TS..U&PART.)
SPACE(10,10) CYL
TEMPLATE TSYSDISC DSN(PAOLOR3.&DB..&TS..D&PART.)
SPACE(10,10) CYL
TEMPLATE TSYSERR DSN(PAOLOR3.&DB..&TS..SYSERR)
SPACE(10,10) CYL
TEMPLATE TSYSMAP DSN(PAOLOR3.&DB..&TS..SYSMAP)
SPACE(10,10) CYL
TEMPLATE TSYSCOPY DSN(PAOLOR3.&DB..&TS..T&TIME..P&PART.)
SPACE(100,10) CYL
126 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
TEMPLATE TRECOVER DSN(PAOLOR3.&DB..&TS..T&TIME..R&PART.)
SPACE(100,10) CYL
TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1)
SPACE(10,10) CYL
TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT)
SPACE(10,10) CYL
LOAD DATA LOG NO SORTKEYS 1200000
ERRDDN(TSYSERR) MAPDDN(TSYSMAP)
WORKDDN(TSYSUT1,TSORTOUT)
STATISTICS TABLE(ALL) INDEX(ALL KEYCARD)
INTO TABLE PAOLOR3.EXAMPP PART 1 REPLACE
INDDN(TSYSREC) DISCARDDN(TSYSDISC)
COPYDDN(TSYSCOPY) RECOVERYDDN(TRECOVER)
(PS_PARTKEY POSITION(03:006) INTEGER
,PS_SUPPKEY POSITION(07:010) INTEGER
,PS_AVAILQTY POSITION(11:014) INTEGER
,PS_SUPPLYCOST POSITION(15:018) FLOAT(21)
,PS_COMMENT POSITION(19:219) VARCHAR )
INTO TABLE PAOLOR3.EXAMPP PART 2 REPLACE
INDDN(TSYSREC) DISCARDDN(TSYSDISC)
COPYDDN(TSYSCOPY) RECOVERYDDN(TRECOVER)
(PS_PARTKEY POSITION(03:006) INTEGER
,PS_SUPPKEY POSITION(07:010) INTEGER
,PS_AVAILQTY POSITION(11:014) INTEGER
,PS_SUPPLYCOST POSITION(15:018) FLOAT(21)
,PS_COMMENT POSITION(19:219) VARCHAR )
INTO TABLE PAOLOR3.EXAMPP PART 3 REPLACE
INDDN(TSYSREC) DISCARDDN(TSYSDISC)
COPYDDN(TSYSCOPY) RECOVERYDDN(TRECOVER)
(PS_PARTKEY POSITION(03:006) INTEGER
,PS_SUPPKEY POSITION(07:010) INTEGER
,PS_AVAILQTY POSITION(11:014) INTEGER
,PS_SUPPLYCOST POSITION(15:018) FLOAT(21)
,PS_COMMENT POSITION(19:219) VARCHAR )
6.4 Preformat
The PREFORMAT option of the LOAD utility was introduced in DB2 V5. It will preformat a
table space and its associated index spaces during LOAD time and prepare it for INSERT
processing. It reduces execution time delays due to formatting and eliminates the need to
preformat new pages in a table space during INSERT processing. However, when the
preformatted space is utilized and DB2 has to extend the table space, normal data set
extending and preformatting will occur.
Preformatting causes all free pages in the VSAM cluster (between the high used RBA and
high allocated RBA) to be preformatted. This includes secondary extents that may have been
allocated. Preformatting occurs after the data has been loaded and the indexes are built.
Preformatting can operate on the entire table space and its index spaces, or on a partition of
a partitioned table space and its corresponding partition index space. Note that preformatting
an entire partitioned table space at the table space level can inhibit concurrent processing of
separate partitions. In this case you should specify PART integer PREFORMAT to serialize
on the partition level, as illustrated in Example 6-12.
Preformatting is only recommended for table spaces and index spaces that are part of high
INSERT applications where normal formatting can cause unacceptable delays or inconsistent
elapsed times. It is not recommended for:
Tables that are only populated by the LOAD REPLACE utility (for example, table spaces
that are refreshed every day with LOAD REPLACE). In that case, PREFORMAT will only
cause an additional elapsed time of the LOAD processing.
Tables that are mainly used for query processing. The additional preformatted pages can
cause table space scans to read additional empty pages, extending the elapsed time for
these queries.
Use the PREFORMAT option with care. General use of this option in all your LOAD jobs is not
recommended. The best results may be seen when the final size of the table space is known,
and when there is a high ratio of inserts to read operations.
Note: DB2 V7 introduced asynchronous INSERT preformatting. This new function will
asynchronously preformat the next range of new pages during INSERT processing if the
current page is within a predefined range from the end of the formatted pages.
6.5 Reuse
REUSE is a feature of the LOAD utility introduced in DB2 V5. It can only be used with the
LOAD REPLACE option and with STOGROUP defined data sets, also called DB2-managed
data sets. Normally, if you do not specify REUSE, LOAD REPLACE of a DB2-managed data
set will delete and redefine the underlying VSAM data sets. With the REUSE option, the
underlying VSAM data sets will not be deleted and redefined, but only logically reset; that
means the high used RBA is reset back to zero.
The logical reset time of a VSAM data set is much shorter than the physical delete and
redefine time. Savings of 66% in elapsed time and 30% in CPU time have been measured on
a LOAD REPLACE of an empty table space with 128 partitions. But bear in mind that the
logical reset of a VSAM data set will not reclaim any secondary extents.
The REUSE option can operate on the entire table space and its index spaces, or on a
partition of a partitioned table space and its corresponding partition index space. In the last
case you should specify PART integer REPLACE REUSE as illustrated in Example 6-13.
Example 6-13 Specifying the REUSE option during LOAD REPLACE
Logically reset and reuse of an entire table space and associated index spaces:
LOAD DATA REPLACE REUSE INTO TABLE table name
Logically reset and reuse of a table space partition and associated index partition:
LOAD DATA INTO TABLE table name PART 1 REPLACE REUSE
128 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
We recommend the use of REUSE option:
If you want to preserve the allocated space on the disk volumes between consecutive
LOAD REPLACE operations (for I/O tuning reasons or to avoid space problems in case of
reallocation).
To reduce CPU and elapsed times, if you have partitioned table spaces with a lot of
underlying VSAM data sets.
To increase the throughput by avoiding service task contention, if you have a lot of
concurrent LOAD REPLACE utilities run in parallel. Since DB2 V6, DB2 can have up to 20
service tasks run in parallel.
Do not use the REUSE option if you want to do any of the following:
Reduce the number of secondary extents.
Activate altered PRIQTY and SECQTY values.
Move the data sets to another STOGROUP.
LOAD SELECT
LOCAL DB2,
DRDA, or
Data DataJoiner
Conversion DB2 family
Oracle
Sybase
Informix
IMS
VSAM
DB2 for SQL Server
OS/390 NCR Teradata
and z/OS
The data is read from the source location with a dynamic SQL statement and loaded in a
table at the target location by the LOAD utility. It is a single job process that replaces the
typical sequence of jobs of unloading, file transfer, and loading the data. The source data can
be on the local system, on a remote DRDA server, or on any system accessible via Data
Joiner.
The Cross Loader was introduced in DB2 V7 after GA with PTF UQ55541 for APAR PQ45268
and PTF UQ55542 for APAR PQ46759. Therefore, you should check if these PTFs, and all
their prerequisites, are applied to your DB2 system before trying to use the Cross Loader.
A typical Cross Loader example consists of the definition of the dynamic SQL statement via
the EXEC SQL DECLARE CURSOR utility statement, followed by a LOAD utility statement
referring to this cursor. This is illustrated in Example 6-14. In this example we LOAD an
existing summary table called EMPSUMMARY with data coming from the local sample table
DSN8710.EMP. The aggregation is done in the SQL statement of the CURSOR definition.
EXEC SQL is a new DB2 V7 utility statement explained in more detail in 6.6.1, “Using SQL
statements in the utility input stream” on page 130
INCURSOR is a new DB2 V7 LOAD option explained in more detail in 6.6.2, “Using the Cross
Loader” on page 136
The EXEC SQL statement requires no extra privileges other than the ones required by the
SQL statements itself.
130 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
You can only put one SQL statement between the EXEC SQL and ENDEXEC keywords. The
SQL statement can be any dynamic SQL statement that can be used as input for the
EXECUTE IMMEDIATE statement, as listed in Example 6-15.
COMMIT,ROLLBACK operation
(See also “Thread behavior and commit scope of the EXEC SQL utility statement” on
page 133).
In Example 6-16 we create a new table in the default database DSNDB04 with the same
layout as SYSIBM.SYSTABLES.
Example 6-16 Create a new table with the same layout as SYSIBM.SYSTABLES
EXEC SQL
CREATE TABLE PAOLOR3.SYSTABLES LIKE SYSIBM.SYSTABLES
ENDEXEC
In Example 6-17 we give this table a comment and allow everybody read access. Note the
two separate steps.
In the same way, we are able to create indexes on this table, create views on it, and so on.
All this is done in the utility input stream.
In Example 6-19 we show how this cursor can now be used in a LOAD statement (Cross
Loader).
The SQL statement in the declare cursor definition can be any valid SQL statement including
joins, unions, data conversions, aggregations, special registers, and UDFs. The source data
can be on a local server or remote server using DRDA access or Data Joiner. See 6.6.2,
“Using the Cross Loader” on page 136 for more details.
The SQL statement placed after the EXEC SQL keyword is parsed and checked for errors
during its execution. It is not checked during the UTILINIT phase of the utility. If an invalid SQL
statement is found during the execution of the EXEC SQL, the utility job immediately ends
with return code 8. If the SQL statement is valid, but fails during execution (with a negative
SQL code), the utility also immediately ends with return code 8 as well. So be aware that if
you have syntax errors in an EXEC SQL statement and the utility job gets started, the
previous EXEC SQL statements and utility statements are already executed before the utility
ends. You might have to remove these from the input stream before rerunning the job.
Normally, a utility that encounters an SQL error during the EXEC SQL statement execution
always ends with return code 8 and never abends with ABEND04E. So the utility is not in a
stopped state, the utility is not restartable with the RESTART option and a TERMINATE
UTILITY command is NOT necessary. But be aware that all previous EXEC SQL and utility
statements are executed successfully and might have to be removed first before rerunning the
job.
Currently it is impossible to influence the behavior of the utility job after a failing EXEC SQL
statement. An OPTION to allow to discard the failing EXEC SQL, and to continue the utility
step when the EXEC SQL failed, is currently not available in DB2 V7.
132 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
If the utility input stream contains both EXEC SQL statements and other utility statements,
and a utility statement failed so that DB2 put the utility in the stopped state, the utility step is
restartable with the RESTART keyword. During restart, all the non-select dynamic SQL
statements from EXEC SQL statements already executed, are skipped, except the ones with
SQL SET statements. All the declare cursor definitions within EXEC SQL statements already
executed, are reactivated so that they can be used in the following LOAD statements.
This can be illustrated with Example 6-20. This job contains one non-select dynamic SQL
statement (CREATE TABLE), one cursor definition, and one utility LOAD statement. If the
CREATE TABLE fails with a negative SQL code, the utility will immediately end with return
code 8 and the utility will not be restartable with the RESTART keyword. If the CREATE
TABLE executes successfully, but the DECLARE CURSOR fails, the utility will also end with
return code 8, but the table will have been created. If both CREATE TABLE and DECLARE
CURSOR execute successfully, but the LOAD statement fails so that DB2 puts the utility in
the stopped state (for example because of a resource unavailable condition) the utility will be
restartable. During restart the CREATE TABLE statement will be skipped but the DECLARE
CURSOR statement will be re-executed, so that it can be used in the LOAD statement.
Thread behavior and commit scope of the EXEC SQL utility statement
The EXEC SQL statement runs in a thread that is separate from the utility thread. This implies
that the EXEC SQL SET CURRENT register operations do influence all following EXEC SQL
statements in the utility stream input, but they do not influence the real utility statements like
LOAD, REORG, and so on.
We can illustrate this with the utility job in Example 6-21. This job runs with USER=PAOLOR3.
So the DB2 primary AUTHID is PAOLOR3. We change the current SQLID with EXEC SQL to
PAOLOR4 and try to see what table creator is used if we do not prefix our tables in the
following EXEC SQL statements and other utility statements.
Example 6-21 JCL for testing the thread behavior of EXEC SQL
//PAOLOR3A JOB (ACCOUNT),'PAOLOR3',NOTIFY=PAOLOR3,USER=PAOLOR3
//*
// EXEC PGM=DSNUTILB,PARM='DB2G,MEPL'
//STEPLIB DD DSN=DSN710.SDSNLOAD,DISP=SHR
//SYSPRINT DD SYSOUT=*
//SYSIN DD *
EXEC SQL
SET CURRENT SQLID = 'PAOLOR4'
ENDEXEC
EXEC SQL
CREATE TABLE MYTABLE LIKE SYSIBM.SYSTABLES
In the first EXEC SQL, the current SQLID is set to PAOLOR4. In the following EXEC SQL,
the table MYTABLE is created without specifying a table creator. We verified in the DB2
catalog that this table is created with CREATOR = PAOLOR4, which proves that the previous
current SQLID was used in this EXEC SQL. If we try to LOAD this table with the Cross Loader
we have to specify PAOLOR4.MYTABLE otherwise the LOAD fails, because the LOAD utility
thread does not use the current SQLID set in the previous EXEC SQL. Instead, its current
SQLID is still equal to the primary AUTHID, PAOLOR3, which it inherited from the USER
keyword in the JCL.
Each block of EXEC SQL and ENDEXEC is a separate unit-of-work. Each block can only
contain a single SQL statement. The unit-of-work is always committed when the SQL
statement is executed successfully. Although the COMMIT and ROLLBACK statements are
accepted, these statements do not perform any function.
134 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
We verified the above behavior with the utility job shown in Example 6-23. In the first EXEC
SQL we create a new table. This is followed by an EXEC SQL ROLLBACK to try to undo the
CREATE statement. In the third EXEC SQL we populate this table with an INSERT statement
and try to undo the INSERT in the fourth EXEC SQL. If the ROLLBACK statement in the
second EXEC SQL would undo the CREATE, we expect the third EXEC SQL to fail.
Example 6-23 JCL for verifying the commit scope of EXEC SQL
//PAOLOR3A JOB (ACCOUNT),'PAOLOR3',NOTIFY=PAOLOR3,USER=PAOLOR3
// EXEC PGM=DSNUTILB,PARM='DB2G,MEPL'
//STEPLIB DD DSN=DSN710.SDSNLOAD,DISP=SHR
//SYSPRINT DD SYSOUT=*
//SYSIN DD *
EXEC SQL
CREATE TABLE PAOLOR3.SYSTABLESR LIKE SYSIBM.SYSTABLES
ENDEXEC
EXEC SQL
ROLLBACK
ENDEXEC
EXEC SQL
INSERT INTO PAOLOR3.SYSTABLESR
SELECT * FROM SYSIBM.SYSTABLES
WHERE TYPE = 'T'
AND CREATOR LIKE 'PAOLOR%'
ENDEXEC
EXEC SQL
ROLLBACK
ENDEXEC
The result of this job is shown in Example 6-24. As you can see, all four EXEC SQL
statements executed successfully. We verified with SPUFI that the table
PAOLOR3.SYSTABLESR was successfully created and populated with 35 rows. So the
EXEC SQL ROLLBACK statements did not influence the previous EXEC SQL statements.
So, although you can code COMMIT and ROLLBACK statements after the EXEC SQL
keyword, they do not influence the behavior of previous EXEC SQL commands, and their use
seems to be meaningless in this context.
EXEC SQL eliminates additional job steps in a job to execute dynamic SQL statements
before, between or after the regular utility steps. It can simplify the JCL coding by eliminating
this dynamic SQL applications like DSNTIAD or DSNTEP2 from the JCL stream and it
enables to merge different utility steps, separated by dynamic SQL applications, into one
single utility step. But be aware of the restrictions imposed by the EXEC SQL statement.
136 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Declaring a cursor for use with the Cross Loader
The following rules apply to the cursor you DECLARE in the EXEC SQL statement:
You must always declare the cursor before the LOAD statement. Otherwise you get a
message DSNU1190I CURSOR NOT DECLARED.
The cursor name can only be declared once within the whole utility input stream. It can be
referred in multiple LOAD statements for LOADING different target tables with the same
data. If you declare an existing cursor, you get the error message DSNU1189I CURSOR
ALREADY DECLARED.
The table being loaded cannot be part of the select statement. So you cannot LOAD into
the same table where you defined the cursor.
The column names in the result set of the select statement must be identical to the
column names in the table being loaded. This can be achieved by using the AS clause in
the SQL statement. If you have column names in the result set which do not match any
column name in the target table you get the error message DSNU053I FIELD 'colname' NOT
FOUND or the error message DSNU329I FIELD 'colnumber' IS NOT DEFAULTABLE.
Pay special attention to derived column names which are the result of column functions
such as SUM or AVG.
You are able to skip unwanted columns in the result set with the LOAD IGNOREFIELDS
YES option, which skips any columns in the cursor result set that are not present in the
target table being load. However use this IGNOREFIELDS option with care, as it also
skips misspelled columns you wanted to LOAD.
The sequence of the columns in the result set may differ from the sequence of the
columns in the table being loaded. DB2 matches the columns by their names and not by
their sequence.
The number of columns in the cursor can be less than the number of columns in the target
table. All missing columns are loaded with their default values. If the missing columns are
defined in the target table as NOT NULL without default, the LOAD fails with message
DSNU323I COLUMN 'colname' IS OMITTED.
If the data types in the target table do not match with the data types in the cursor, DB2
tries to convert as much as possible between compatible data types. Examples are from
CHAR to VARCHAR or from INTEGER to FLOAT. If the conversion fails, you get
messages like DSNU390I INVALID CONVERSION FOR FIELD ‘columnname’ (conversion error
detected before the actual LOAD during the matching process) or DSNU334I INPUT FIELD
'columnname' INVALID FOR 'tablename', ERROR CODE cc (conversion error detected during
the LOAD). You might use DB2 supplied built-in functions or own developed UDFs in the
SQL statement to force more sophisticated conversions. An example is the CHAR function
which allows you to convert from INTEGER to CHARACTER.
If the encoding scheme of the source data in the cursor and the target table differ, DB2
automatically converts the encoding schemes. An example may be conversion from
EBCDIC to UNICODE or from ASCII to EBCDIC. Remember that referencing tables with
more than one encoding scheme (ASCII,EBCDIC or UNICODE) in a single SQL
statement is not supported and finishes with SQLCODE -873.
If the target table contains UDTs, you do not have to specify the appropriate casting
functions in the cursor SQL statement to LOAD the table. If the source data in the cursor
contains UDTs, you also do not have to specify casting functions in the select list of the
cursor SQL statement. But additional WHERE clauses in the cursor SQL statement might
require casting functions, or you could end up with SQLCODE -401.
If your table contains LOB columns, the maximum length is 32 KB. You cannot use the
Cross Loader if you want to transfer LOB columns larger than 32 KB. Instead you should
use a program with embedded SQL. If a table contains LOB columns, there is at least one
Apart from the aforementioned rules, the SQL statement in the declare cursor definition
can be any valid SQL statement including joins, unions, conversions, aggregations,
special registers, UDFs. The source data can be local or remote using DRDA access or
Data Joiner. Remote tables are always referred by their three-part-name
LOCATION.CREATOR.NAME or by an ALIAS name CREATOR.NAME. You cannot use an
explicit CONNECT statement to connect to the remote location.
You can use any option of the LOAD utility except the following:
SHRLEVEL CHANGE: There is no support for online LOAD in the Cross Loader. If you
specify SHRLEVEL CHANGE, you get the message DSNU070I KEYWORD OR OPERAND
'SHRLEVEL CHANGE' INVALID WITH INCURSOR.
FORMAT: You cannot specify the UNLOAD or SQL/DS formats.
FLOAT(IEEE): The cursor always returns FLOAT(S390).
EBCDIC,ASCII,UNICODE: The Cross Loader always automatically converts the encoding
schemes. The target table must be created with the correct CCSID.
NOSUBS: You must accept substitution characters in a string.
CONTINUEIF: You cannot treat each input record as a part of a larger record.
WHEN: You cannot use the WHEN option to filter the result set of the cursor. Instead, filter
the result set by using the appropriate WHERE clauses in the select statement.
Field specifications: You cannot specify field specifications in the LOAD Statement. If
you specify field specifications, you get the message DSNU331I FIELD LISTS ARE NOT
ALLOWED WITH INCURSOR KEYWORD.
138 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
The same cursor can be reused multiple times in the utility job step. It can be referenced in
different LOAD statements to LOAD the same data in different target tables. It can also be
reused in one LOAD statement containing multiple INTO TABLE clauses to LOAD the same
data in different target tables (in the same table space).
If a cursor is used more than once in the utility job step, the data is transferred more than
once as well. Each time you refer to a cursor name in a LOAD statement, the SQL statement
is re-executed. There is no buffering nor reuse of the previous transferred data. So the result
sets can differ if the source data is being updated or if you use time dependent functions like
CURRENT TIMESTAMP.
It is recommended that you LOAD your data in clustering sequence. If loading from a
sequential data set this can be done by first sorting the sequential data set in clustering
sequence. With the Cross Loader this can be achieved by sorting the result set of the cursor
by using an ORDER BY statement on the clustering columns.
You can LOAD partitions in parallel by specifying a different cursor for each partition.
Example 6-26 Cross Loader example with AS clause and default columns
EXEC SQL
CREATE TABLE PAOLOR3.EXAMP2
(COL1 TIMESTAMP NOT NULL WITH DEFAULT,
COL2 VARCHAR(18),
COL3 VARCHAR(18),
COL4 VARCHAR(18),
COL5 VARCHAR(18),
COL6 VARCHAR(18))
ENDEXEC
EXEC SQL
DECLARE C2 CURSOR FOR
SELECT DBNAME AS COL4,
TSNAME AS COL5,
CREATOR AS COL2,
NAME AS COL3,
CHAR(OBID) AS COL6
FROM SYSIBM.SYSTABLES
WHERE TYPE = 'T'
ENDEXEC
TEMPLATE TSYSCOPY
DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME.)
SPACE(100,10) CYL
LOAD DATA REPLACE REUSE LOG NO
INCURSOR C2
COPYDDN(TSYSCOPY)
STATISTICS
INTO TABLE PAOLOR3.EXAMP2
In Example 6-27 we show the use of the IGNOREFIELDS YES LOAD option to ignore
columns in the cursor that are not present in the target table. We also created a clustering
unique index on the target table PAOLOR3.EXAMP3. We added the ORDER BY clause in the
cursor definition to LOAD the table in clustering sequence. We have to add templates
TSYSUT1 and TSORTOUT for the WORKDDN work data sets because of the index.
140 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
CREATE UNIQUE INDEX PAOLOR3.XEXAMP3
ON PAOLOR3.EXAMP3 (CREATOR,NAME)
USING STOGROUP SYSDEFLT
CLUSTER
ENDEXEC
EXEC SQL
DECLARE C3 CURSOR FOR
SELECT *
FROM SYSIBM.SYSTABLES
WHERE TYPE = 'T'
ORDER BY CREATOR,NAME
ENDEXEC
TEMPLATE TSYSCOPY
DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME.)
SPACE(100,10) CYL
TEMPLATE TSYSUT1
DSN(PAOLOR3.&DB..&TS..&UTILID..SYSUT1)
SPACE(100,10) CYL
TEMPLATE TSORTOUT
DSN(PAOLOR3.&DB..&TS..&UTILID..SORTOUT)
SPACE(100,10) CYL
LOAD DATA REPLACE LOG NO
INCURSOR C3
COPYDDN(TSYSCOPY)
WORKDDN(TSYSUT1,TSORTOUT)
STATISTICS
INTO TABLE PAOLOR3.EXAMP3
IGNOREFIELDS YES
We will now LOAD a table containing UDT columns. In Example 6-28 we first create and
populate a table containing a UDT column. This UDT is defined as CHAR(8) WITH
COMPARISONS. We do not have to specify the appropriate casting function in the select list
of the cursor SQL statement to LOAD the table.
Example 6-28 Populating a table containing a UDT column with the Cross Loader
EXEC SQL
CREATE DISTINCT TYPE PAOLOR3.TABLE_OWNER AS CHAR(8) WITH COMPARISONS
ENDEXEC
EXEC SQL
CREATE TABLE PAOLOR3.EXAMP4
(COL1 PAOLOR3.TABLE_OWNER,
COL2 VARCHAR(18),
COL3 CHAR(8),
COL4 CHAR(8))
ENDEXEC
EXEC SQL
CREATE UNIQUE INDEX PAOLOR3.XEXAMP4
ON PAOLOR3.EXAMP4 (COL1,COL2)
USING STOGROUP SYSDEFLT
ENDEXEC
EXEC SQL
DECLARE C4 CURSOR FOR
SELECT CREATOR AS COL1,
NAME AS COL2,
DBNAME AS COL3,
TSNAME AS COL4
FROM SYSIBM.SYSTABLES
WHERE TYPE = 'T'
In Example 6-29 we copy a subset of this table back to PAOLOR3.EXAMP3 with the Cross
Loader, converting the UDT back to the standard CHAR data type. We do not have to specify
any casting function in the select list of the cursor but we have to specify it in the WHERE
clause to create the subset. We only want to copy the rows corresponding with COL1 equals
to ‘SYSIBM’ and rename the CREATOR value to ‘SYSIBMX’ to prevent duplicate values in
PAOLOR3.EXAMP3. As we cannot use an Inline COPY with LOAD RESUME(YES), we have
to take the image copy with an additional COPY statement.
142 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
In Example 6-30 we populate a table containing a LOB column with the Cross Loader.
Remember that the maximum LOB size that can be transferred by the Cross Loader is 32 KB.
We first create the target base table, auxiliary table, and corresponding table spaces and
indexes with the EXEC SQL statement. We declare the ROWID as NOT NULL GENERATED
ALWAYS and do not copy the ROWID value from the source table. After the LOAD we collect
STATISTICS on all table spaces using a LISTDEF with the ALL LOB indicator keyword to
include both base and LOB table spaces.
144 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
CLOSED_BY_KEY ,
TIME_CREATED_KEY ,
TIME_CLOSED_KEY )
USING STOGROUP SGLP0002
PRIQTY 720 SECQTY 720 ;
ENDEXEC
EXEC SQL
DECLARE C7 CURSOR FOR
SELECT * FROM CIC_DMPC.DB2INST2.CIC_FACT_TABLE
ENDEXEC
TEMPLATE TSYSCOPY DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME.)
SPACE(10,10) CYL
TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1) SPACE(10,10) CYL
TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT) SPACE(10,10) CYL
LOAD DATA REPLACE SORTKEYS 40000
INCURSOR C7
WORKDDN(TSYSUT1,TSORTOUT)
COPYDDN(TSYSCOPY)
STATISTICS
INTO TABLE PAOLOR3.EXAMP7
In Example 6-32 we copy aggregated information from the same datamart to the host. The
result set is a star join between the fact table and some dimension tables of the datamart.
In Example 6-33 we load a partitioned table space with the Cross Loader. In this case we
only use one cursor. The source and target tables are identical.
Example 6-33 Loading a partitioned table space with the Cross Loader
EXEC SQL
CREATE TABLESPACE TSEXAMP9
IN PAOLOR3L
USING STOGROUP SGLP0002
PRIQTY 50000 SECQTY 5000
BUFFERPOOL BP0
NUMPARTS 3
ENDEXEC
EXEC SQL
CREATE TABLE PAOLOR3.EXAMP9
(PS_PARTKEY INTEGER NOT NULL ,
PS_SUPPKEY INTEGER NOT NULL ,
PS_AVAILQTY INTEGER NOT NULL ,
PS_SUPPLYCOST REAL NOT NULL ,
PS_COMMENT VARCHAR(199) FOR SBCS DATA NOT NULL )
IN PAOLOR3L.TSEXAMP9
ENDEXEC
EXEC SQL
CREATE UNIQUE INDEX PAOLOR3.XEXAMP9
ON PAOLOR3.EXAMP9
(PS_PARTKEY ASC ,
PS_SUPPKEY ASC ,
PS_SUPPLYCOST ASC )
USING STOGROUP SGLP0002
PRIQTY 1200 SECQTY 400
CLUSTER
(PART 1 VALUES(27000) ,
PART 2 VALUES(54000) ,
PART 3 VALUES(80000) )
BUFFERPOOL BP2
ENDEXEC
EXEC SQL
DECLARE C9 CURSOR FOR
SELECT *
FROM PAOLOR4.PARTSUPP2
ORDER BY 1
ENDEXEC
TEMPLATE TSYSCOPY
DSN(PAOLOR3.&DB..&TS..D&DATE..T&TIME.)
146 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
SPACE(100,10) CYL
TEMPLATE TSYSUT1
DSN(PAOLOR3.&DB..&TS..&UTILID..SYSUT1)
SPACE(100,10) CYL
TEMPLATE TSORTOUT
DSN(PAOLOR3.&DB..&TS..&UTILID..SORTOUT)
SPACE(100,10) CYL
LOAD DATA REPLACE LOG NO SORTKEYS
INCURSOR C9
COPYDDN(TSYSCOPY)
WORKDDN(TSYSUT1,TSORTOUT)
STATISTICS
INTO TABLE PAOLOR3.EXAMP9
In Example 6-34 we load the same partitioned table space using partition parallelism.
Therefore we declare one different cursor per partition, transferring the data belonging to that
particular partition.
The classic LOAD drains all access to the table space which prevents the data to be available
for SQL applications. Therefore many DB2 sites wrote their own INSERT programs for
loading data, thus avoiding the drain and allowing concurrent SQL access. Another reason for
writing INSERT programs is to use triggers which are fired by the INSERT statements and
can be used for additional controls on the correctness of the input data.
With the new Online LOAD Resume, you can LOAD data with minimal impact on SQL
applications and without writing and maintaining INSERT programs. It behaves externally as
a LOAD, but it works internally like a mass INSERT. Therefore, triggers will be fired as well.
The index building, duplicate key and referential constraint checking will be handled by SQL
insert processing. The records which fail the insert will be written to the discard data set.
Because the Online LOAD Resume internally works like a SQL INSERT, this kind of LOAD is
slower than the classic LOAD. However you have to trade off performance for availability and
data integrity forced by triggers.
SHRLEVEL CHANGE always requires RESUME YES and LOG YES and invokes a
completely different processing. Although the syntax is like the classical LOAD (and the input
is provided as before), internally each input record is placed in the table space via a
mechanism comparable to an individual INSERT.
However, this INSERT is handled by the Data Manager and not by the RDS. Therefore, most
of the SQL overhead is omitted.
148 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
M ixture of LO AD and INSERT
es
... ,'CONFOR' );
oc
...
001 MOTOR CORP.
002 ENGINE MASTER
STAR
SPEEDY Pr
003 MOBILES, LTD. CONFOR Serialization: Claim s (not drains)
... Logging: LO G YES only
Triggers: Fire
Security: RI: Parent key m ust exist
LO AD (not INSERT) privilege Duplicate keys: First accepted
Tim eout of the LO AD: Clustering: Preserved
Like utility (not like SQ L appl.) Free space: Used (not provided)
INSERT behavior
Following are some of the consequences of INSERT processing.
Serialization
Even though full INSERT statements are not generated, LOAD RESUME YES SHRLEVEL
CHANGE will functionally operate like SQL INSERT. Whereas the classic LOAD drains the
table space, thus inhibiting any SQL access, these INSERTs act like normal INSERTs by
using claims when accessing an object. That is, they behave like any other SQL application
and can run concurrently with other, even updating, SQL applications. The records are stored
one after each other in the same sequence as they occur in the input sequential data set.
Logging
Only LOG YES is allowed. All inserts are logged. After loading, the table space or partition is
not put in the COPYP restrictive state and no COPY is required afterwards.
Because all inserts are logged, they can be captured by DpropR for propagation.
Triggers
Triggers are fired like with normal SQL INSERT processing
When you load a self-referencing table the foreign key value in each input record:
Must already exist as primary key value in the table
Must be provided as primary key value within this record
After loading, the table space will never be put in the CHECKP restrictive state and no
CHECK DATA is required afterwards.
Duplicate keys
When uniqueness of a column is required, INSERTs are accepted as long as they provide
different values for this column. Following INSERTs with the same values are not accepted.
This is different from the classic LOAD procedure, which discards all records having the same
value for such a column.
You may be forced to change your handling of the discarded records accordingly, when you
change existing LOAD jobs to SHRLEVEL CHANGE.
Clustering
Whereas the classic LOAD utility with RESUME YES stores the new records (in the sequence
of the input) at the end of the already existing records; the new Online LOAD Resume tries to
insert the records in available free pages as close to clustering order as possible; additional
free pages are not created. As you probably insert a lot of rows, these are likely to be stored
out of the clustering order (OFFPOS records).
So, a REORG may be needed after the classic LOAD, as the clustering may not be
preserved, but also after the new Online LOAD Resume, as OFFPOS records may exist. A
RUNSTATS with SHRLEVEL CHANGE UPDATE SPACE followed by a conditional REORG is
recommended.
Free space
Furthermore the free space, obtained either by PCTFREE or by FREEPAGE, is used by these
INSERTs of the Online LOAD Resume, in contrast to the classical LOAD, which loads the
pages thereby providing these types of free space.
UTILITY behavior
Online LOAD Resume also behaves as a utility:
Phases
The phases of Online LOAD Resume are:
UTILINIT, RELOAD, DISCARD, REPORT, UTILTERM
The DISCARD and the REPORT phase are performed like in the classic LOAD. Input records
which fail the insert will be written to the discard data set. Error information is stored in the
error data set.
150 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Incompatible LOAD options
SHRLEVEL CHANGE is incompatible with these options:
LOG NO, ENFORCE NO, KEEPDICTIONARY, SORTKEYS, STATISTICS, COPYDDN,
RECOVERYDDN, PREFORMAT, REUSE, and PART REPLACE
Locking
Lock contention with SQL applications is avoided due to a intelligent management of the
commit scope. DB2 manages the commit scope dynamically monitoring the current locking
situation.
If the utility gets a timeout, it will be handled like a utility timeout and not like an SQL
application timeout. So you will get messages like DSNT500I RESOURCE UNAVAILABLE
REASON 00c9008e and the utility will be put in a stopped state. The job step will be restartable.
Online LOAD RESUME YES cannot run against a table space that is in COPY pending,
CHECK pending or RECOVER pending status. Similarly, it cannot run against an index space
that is in CHECK pending or RECOVER pending status.
Restart
During RELOAD, internal commit points are set, therefore, RESTART(CURRENT) is possible
as with the classic LOAD.
Partition parallelism
Partition parallelism is supported.
Security
Online LOAD Resume requires the LOAD privilege and not the INSERT privilege.
Example 6-35 Creating an input load data set with the UNLOAD utility
// EXEC PGM=DSNUTILB,PARM='DB2G,UNLOAD'
//STEPLIB DD DSN=DSN710.SDSNLOAD,DISP=SHR
//SYSPRINT DD SYSOUT=*
//UTPRINT DD SYSOUT=*
//SYSIN DD *
TEMPLATE TUNLDDN
DSN(PAOLOR3.&DB..&TS..UNLOAD)
TEMPLATE TPUNCH
DSN(PAOLOR3.&DB..&TS..PUNCH)
UNLOAD TABLESPACE PAOLOR3L.TSEXAMP7
UNLDDN TUNLDDN
PUNCHDDN TPUNCH
In Example 6-36 we first empty the table with a mass DELETE statement and than repopulate
it using the Online LOAD Resume utility. The input data set is specified by a template
TSYSREC. Although there are no SORT, BUILD, and INDEXVAL phases with Online LOAD
Resume, the WORKDDN data sets are mandatory if the table has indexes.
152 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
INDDN(TSYSREC)
WORKDDN(TSYSUT1,TSORTOUT)
INTO TABLE PAOLOR3.EXAMP7
WHEN(1:2 = X'000E')
(CASE_KEY POSITION(003:006) INTEGER
,SEVERITY_KEY POSITION(007:010) INTEGER
,RECIEVED_VIA_KEY POSITION(011:014) INTEGER
,TYPE_KEY POSITION(015:018) INTEGER
,ASSIGNED_TO_KEY POSITION(019:022) INTEGER
,TOD_KEY POSITION(023:026) INTEGER
,PRODUCT_KEY POSITION(027:030) INTEGER
,CUSTOMER_KEY POSITION(031:034) INTEGER
,STATUS_KEY POSITION(035:038) INTEGER
,RESOLUTION_KEY POSITION(039:042) INTEGER
,REASON_CLOSED_KEY POSITION(043:046) INTEGER
,CLOSED_BY_KEY POSITION(047:050) INTEGER
,TIME_CREATED_KEY POSITION(051:054) INTEGER
,TIME_CLOSED_KEY POSITION(055:058) INTEGER
,NEW_CASE_VOLUME POSITION(060:063) INTEGER
NULLIF(059)=X'FF'
,OPEN_CASE_VOLUME POSITION(065:068) INTEGER
NULLIF(064)=X'FF'
,CLOSED_CASE_VOLUME POSITION(070:073) INTEGER
NULLIF(069)=X'FF'
,CLOSED_ONFIRSTCONT POSITION(075:078) INTEGER
NULLIF(074)=X'FF'
,DAYS_TO_RESOLUTION POSITION(080:083) INTEGER
NULLIF(079)=X'FF'
,AVERAGE_DAYS_TO_RE POSITION(085:088) INTEGER
NULLIF(084)=X'FF'
,CLOSED_BY_OTHERS POSITION(090:093) INTEGER
NULLIF(089)=X'FF'
,CASE_SUBJECT POSITION(095:196) VARCHAR
NULLIF(094)=X'FF'
,CASE_NOTE POSITION(198:0454) VARCHAR
NULLIF(197)=X'FF'
,LAST_COMMENT POSITION(456:712) VARCHAR
NULLIF(455)=X'FF' )
As a result, 48420 records will be inserted into the empty table. See Example 6-37 for an
extract of the job output.
In Example 6-38 we empty the table space and duplicate the input data to force duplicate
records in the input. In this case the Online LOAD Resume will insert the first 48420 records
and reject another 48420 records which will be reported. We had to add a DFSPARM data set
to increase the SORT storage workarea from its default value of 4096K to a value of 24000K.
Otherwise the sort failed with message ICE046A 0 SORT CAPACITY EXCEEDED - RECORD
COUNT 34767 during the REPORT phase.
Example 6-38 Online LOAD Resume duplicating the input data set
// EXEC PGM=DSNUTILB,PARM='DB2G,PAOLOR3'
//STEPLIB DD DSN=DSN710.SDSNLOAD,DISP=SHR
//SYSPRINT DD SYSOUT=*
//UTPRINT DD SYSOUT=*
//DFSPARM DD *
OPTION MAINSIZE=24000K
//SYSREC DD DSN=PAOLOR3.PAOLOR3L.TSEXAMP7.UNLOAD,DISP=SHR
// DD DSN=PAOLOR3.PAOLOR3L.TSEXAMP7.UNLOAD,DISP=SHR
//SYSIN DD *
EXEC SQL
DELETE FROM PAOLOR3.EXAMP7
ENDEXEC
TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1)
SPACE(10,10) CYL
TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT)
SPACE(10,10) CYL
LOAD DATA SHRLEVEL CHANGE RESUME(YES)
WORKDDN(TSYSUT1,TSORTOUT)
INTO TABLE PAOLOR3.EXAMP7
WHEN(1:2 = .................
154 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
See Example 6-39 for an extract of the job output.
Example 6-39 Online LOAD Resume job output duplicate records in the input
DSNU000I DSNUGUTC - OUTPUT START FOR UTILITY, UTILID = PAOLOR3
DSNU050I DSNUGUTC - EXEC SQL DELETE FROM PAOLOR3.EXAMP7 ENDEXEC
..........................................................
DSNU050I DSNUGUTC - TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - LOAD DATA SHRLEVEL CHANGE RESUME(YES) WORKDDN(TSYSUT1,TSORTOUT)
DSNU650I -DB2G DSNURWI - INTO TABLE PAOLOR3.EXAMP7 WHEN(1:2=X'000E')
DSNU650I -DB2G DSNURWI - (CASE_KEY POSITION(3:6) INTEGER,
............................................................
DSNU320I -DB2G DSNURWI - RESUME(YES) WAS SPECIFIED FOR EMPTY TABLESPACE
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSUT1
DDNAME=SYS00001
DSN=PAOLOR3.PAOLOR3L.TSEXAMP7.SYSUT1
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSORTOUT
DDNAME=SYS00002
DSN=PAOLOR3.PAOLOR3L.TSEXAMP7.SORTOUT
DSNU1117I -DB2G DSNURBWF - UNIQUE INDEX KEY DUPLICATES KEY OF INDEXED RECORD
AT RID X'0000000201',
INDEX = PAOLOR3.XEXAMP7,
TABLE = PAOLOR3.EXAMP7,
RECNO = 48421
DSNU1117I -DB2G DSNURBWF - UNIQUE INDEX KEY DUPLICATES KEY OF INDEXED RECORD
AT RID X'0000000202',
INDEX = PAOLOR3.XEXAMP7,
TABLE = PAOLOR3.EXAMP7,
RECNO = 48422
........................................................................
repeating this message for every duplicate input record
........................................................................
DSNU1118I -DB2G DSNURWT - (RE)LOAD PHASE STATISTICS - 48420 DUPLICATE KEY ERRORS ENCOUNTERED
DSNU1114I -DB2G DSNURWT - (RE)LOAD PHASE STATISTICS - NUMBER OF RECORDS LOADED =48420 FOR TABLE PAOLOR3.EXAMP7
DSNU302I DSNURILD - (RE)LOAD PHASE STATISTICS - NUMBER OF INPUT RECORDS PROCESSED=96840
DSNU300I DSNURILD - (RE)LOAD PHASE COMPLETE, ELAPSED TIME=00:01:31
DSNU399I DSNURRIS - LOAD UTILITY ERROR SUMMARY REPORT
This behavior is different from the LOAD SHRLEVEL NONE where all 96840 records would
have been loaded during the RELOAD phase and all 96840 records deleted again during the
INDEXVAL phase, as illustrated in the job output in Example 6-40.
Example 6-40 LOAD SHRLEVEL NONE job output duplicate records in the input
DSNU000I DSNUGUTC - OUTPUT START FOR UTILITY, UTILID = PAOLOR3
DSNU050I DSNUGUTC - EXEC SQL DELETE FROM PAOLOR3.EXAMP7 ENDEXEC
DSNU1180I DSNUGSQL - SQLCODE = 000, SUCCESSFUL EXECUTION
................................................................
DSNU050I DSNUGUTC - TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - LOAD DATA SHRLEVEL NONE RESUME(YES) WORKDDN(TSYSUT1,TSORTOUT)
DSNU650I -DB2G DSNURWI - INTO TABLE PAOLOR3.EXAMP7 WHEN(1:2=X'000E')
DSNU650I -DB2G DSNURWI - (CASE_KEY POSITION(3:6) INTEGER,
..................................................................
DSNU320I -DB2G DSNURWI - RESUME(YES) WAS SPECIFIED FOR EMPTY TABLESPACE
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSUT1
DDNAME=SYS00001
DSN=PAOLOR3.PAOLOR3L.TSEXAMP7.SYSUT1
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSORTOUT
Online LOAD Resume and LOAD SHRLEVEL NONE will both reject duplicate records in the
input data set that already existed in the table before the LOAD. In this case there is no
difference between both LOAD methods.
If you want to process the rejected records you can save them in a DISCARD file. This is
done using the DISCARDDN option and the ERRDDN option as shown in Example 6-41.
Here we use templates for both options. Remember the possible different number of
discarded records between LOAD SHRLEVEL CHANGE and LOAD SHRLEVEL NONE. You
may be forced to change your handling of the discarded records accordingly, when you
change existing LOAD jobs to SHRLEVEL CHANGE.
Example 6-41 Using a DISCARD data set with Online LOAD Resume
TEMPLATE TSYSREC DSN(PAOLOR3.&DB..&TS..UNLOAD)
TEMPLATE TSYSDISC DSN(PAOLOR3.&DB..&TS..DISCARD)
SPACE(10,10) CYL
TEMPLATE TSYSERR DSN(PAOLOR3.&DB..&TS..SYSERR)
SPACE(10,10) CYL
TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1)
SPACE(10,10) CYL
TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT)
SPACE(10,10) CYL
LOAD DATA SHRLEVEL CHANGE RESUME(YES)
INDDN(TSYSREC) DISCARDDN(TSYSDISC)
WORKDDN(TSYSUT1,TSORTOUT)
ERRDDN(TSYSERR)
INTO TABLE PAOLOR3.EXAMP7
WHEN(1:2 = .................
156 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Online LOAD Resume of a partitioned table space using parallelism
We will now show how you can use partition parallelism with Online LOAD Resume. This is
the same table space as in Example 6-32 on page 145. We first created the input data set by
unloading the existing table with the new V7 UNLOAD utility. The JCL and syntax of the
unload utility is shown in Example 6-42. Each partition is unloaded in parallel to a separate
unload data set. This is achieved by using the &PART. variable in the template for the unload
data sets.
As shown in Example 6-43, the 3 partitions are unloaded in 3 unload data sets using 2
parallel tasks.
Example 6-44 Online LOAD Resume of a partitioned table space with parallelism
// EXEC PGM=DSNUTILB,PARM='DB2G,PAOLOR3'
//STEPLIB DD DSN=DSN710.SDSNLOAD,DISP=SHR
//SYSPRINT DD SYSOUT=*
//UTPRINT DD SYSOUT=*
//SYSIN DD *
EXEC SQL
DELETE FROM PAOLOR3.EXAMP9
ENDEXEC
TEMPLATE TSYSREC DSN(PAOLOR3.&DB..&TS..U&PART.)
SPACE(10,10) CYL
TEMPLATE TSYSDISC DSN(PAOLOR3.&DB..&TS..D&PART.)
SPACE(10,10) CYL
TEMPLATE TSYSERR DSN(PAOLOR3.&DB..&TS..SYSERR)
SPACE(10,10) CYL
TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1)
SPACE(10,10) CYL
TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT)
SPACE(10,10) CYL
LOAD DATA SHRLEVEL CHANGE RESUME(YES)
WORKDDN(TSYSUT1,TSORTOUT)
ERRDDN(TSYSERR)
INTO TABLE PAOLOR3.EXAMP9
PART 1 INDDN(TSYSREC) DISCARDDN(TSYSDISC)
WHEN(1:2 = X'0018')
(PS_PARTKEY POSITION(03:006) INTEGER
,PS_SUPPKEY POSITION(07:010) INTEGER
,PS_AVAILQTY POSITION(11:014) INTEGER
,PS_SUPPLYCOST POSITION(15:018) FLOAT(21)
,PS_COMMENT POSITION(19:219) VARCHAR )
INTO TABLE PAOLOR3.EXAMP9
PART 2 INDDN(TSYSREC) DISCARDDN(TSYSDISC)
WHEN(1:2 = X'0018')
(PS_PARTKEY POSITION(03:006) INTEGER
,PS_SUPPKEY POSITION(07:010) INTEGER
,PS_AVAILQTY POSITION(11:014) INTEGER
,PS_SUPPLYCOST POSITION(15:018) FLOAT(21)
,PS_COMMENT POSITION(19:219) VARCHAR )
INTO TABLE PAOLOR3.EXAMP9
PART 3 INDDN(TSYSREC) DISCARDDN(TSYSDISC)
WHEN(1:2 = X'0018')
(PS_PARTKEY POSITION(03:006) INTEGER
,PS_SUPPKEY POSITION(07:010) INTEGER
,PS_AVAILQTY POSITION(11:014) INTEGER
,PS_SUPPLYCOST POSITION(15:018) FLOAT(21)
,PS_COMMENT POSITION(19:219) VARCHAR )
158 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
The resulting job output can be seen in Example 6-45. As you can see, 3 parallel tasks were
used.
Example 6-45 Online LOAD Resume job output with parallelism
DSNU000I DSNUGUTC - OUTPUT START FOR UTILITY, UTILID = PAOLOR3
DSNU050I DSNUGUTC - EXEC SQL DELETE FROM PAOLOR3.EXAMP9 ENDEXEC
DSNU1184I DSNUGSQL - SQLCODE = 100, NOT FOUND: ROW NOT FOUND FOR FETCH, UPDATE, OR DELETE, OR THE RESULT OF A QUERY
IS AN EMPTY TABLE
DSNU1198I DSNUGSQL - SQLSTATE = 02000 SQLSTATE RETURN CODE
DSNU1195I DSNUGSQL - SQLERRP = DSNXRSTD SQL PROCEDURE DETECTING ERROR
DSNU1196I DSNUGSQL - SQLERRD = -160 0 0 1155167978 0 0 SQL DIAGNOSTIC INFORMATION
DSNU1196I DSNUGSQL - SQLERRD = X'FFFFFF60' X'00000000' X'00000000' X'44DA76EA' X'00000000' X'00000000' SQL
DIAGNOSTIC INFORMATION
DSNU1197I DSNUGSQL - SQLWARN0-5 = W,,,,W, SQL WARNINGS
DSNU1197I DSNUGSQL - SQLWARN6-A = ,,,, SQL WARNINGS
DSNU050I DSNUGUTC - TEMPLATE TSYSREC DSN(PAOLOR3.&DB..&TS..U&PART.) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSDISC DSN(PAOLOR3.&DB..&TS..D&PART.) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSERR DSN(PAOLOR3.&DB..&TS..SYSERR) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSYSUT1 DSN(PAOLOR3.&DB..&TS..SYSUT1) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE TSORTOUT DSN(PAOLOR3.&DB..&TS..SORTOUT) SPACE(10,10) CYL
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - LOAD DATA SHRLEVEL CHANGE RESUME(YES) WORKDDN(TSYSUT1,TSORTOUT) ERRDDN(TSYSERR)
DSNU650I -DB2G DSNURWI - INTO TABLE PAOLOR3.EXAMP9 PART 1 INDDN(TSYSREC) DISCARDDN(TSYSDISC) WHEN(1:2=X'0018')
DSNU650I -DB2G DSNURWI - (PS_PARTKEY POSITION(3:6) INTEGER,
DSNU650I -DB2G DSNURWI - PS_SUPPKEY POSITION(7:10) INTEGER,
DSNU650I -DB2G DSNURWI - PS_AVAILQTY POSITION(11:14) INTEGER,
DSNU650I -DB2G DSNURWI - PS_SUPPLYCOST POSITION(15:18) FLOAT(21),
DSNU650I -DB2G DSNURWI - PS_COMMENT POSITION(19:219) VARCHAR)
DSNU650I -DB2G DSNURWI - INTO TABLE PAOLOR3.EXAMP9 PART 2 INDDN(TSYSREC) DISCARDDN(TSYSDISC) WHEN(1:2=X'0018')
DSNU650I -DB2G DSNURWI - (PS_PARTKEY POSITION(3:6) INTEGER,
DSNU650I -DB2G DSNURWI - PS_SUPPKEY POSITION(7:10) INTEGER,
DSNU650I -DB2G DSNURWI - PS_AVAILQTY POSITION(11:14) INTEGER,
DSNU650I -DB2G DSNURWI - PS_SUPPLYCOST POSITION(15:18) FLOAT(21),
DSNU650I -DB2G DSNURWI - PS_COMMENT POSITION(19:219) VARCHAR)
DSNU650I -DB2G DSNURWI - INTO TABLE PAOLOR3.EXAMP9 PART 3 INDDN(TSYSREC) DISCARDDN(TSYSDISC) WHEN(1:2=X'0018')
DSNU650I -DB2G DSNURWI - (PS_PARTKEY POSITION(3:6) INTEGER,
DSNU650I -DB2G DSNURWI - PS_SUPPKEY POSITION(7:10) INTEGER,
DSNU650I -DB2G DSNURWI - PS_AVAILQTY POSITION(11:14) INTEGER,
DSNU650I -DB2G DSNURWI - PS_SUPPLYCOST POSITION(15:18) FLOAT(21),
DSNU650I -DB2G DSNURWI - PS_COMMENT POSITION(19:219) VARCHAR)
DSNU320I -DB2G DSNURWI - RESUME(YES) WAS SPECIFIED FOR EMPTY TABLESPACE
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSREC
DDNAME=SYS00001
DSN=PAOLOR3.PAOLOR3L.TSEXAMP9.U00001
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSDISC
DDNAME=SYS00002
DSN=PAOLOR3.PAOLOR3L.TSEXAMP9.D00001
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSREC
DDNAME=SYS00003
DSN=PAOLOR3.PAOLOR3L.TSEXAMP9.U00002
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSDISC
DDNAME=SYS00004
DSN=PAOLOR3.PAOLOR3L.TSEXAMP9.D00002
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSREC
DDNAME=SYS00006
DSN=PAOLOR3.PAOLOR3L.TSEXAMP9.D00003
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSYSUT1
DDNAME=SYS00007
DSN=PAOLOR3.PAOLOR3L.TSEXAMP9.SYSUT1
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=TSORTOUT
DDNAME=SYS00008
DSN=PAOLOR3.PAOLOR3L.TSEXAMP9.SORTOUT
160 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
7
The REORG utility unloads the contents of a table space, or index space, and rebuilds the
object in optimum order, as specified by the clustering index. This will ensure that optimum
performance can be achieved by SQL referencing the data via the clustering index. REORG
and RUNSTATS allow the DB2 optimizer to choose better access paths.
Until Version 5 of DB2 reorganizing an object meant that the object was unavailable to all
processes while the REORG was executing. As applications and data grew and the drive for
24x7 processing increased, this unavailability became more and more unacceptable. In DB2
Version 5 the concept of Online REORG was introduced. This utility has been improved
further in Versions 6 and Version 7, specifically to drive down any outage caused by the
REORG.
In DB2 Version 6 there are new triggers added to the REORG syntax that allows REORGs to
be conditionally executed dependent upon the values in the catalog. Also introduced is the
ability to run a REPORTONLY parameter that will indicate whether a REORG will be run or
not without actually running it.
162 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
INDEX SPACES
– LEAFDISTLIMIT value (default 200)
When specified, the following SQL is executed by the REORG:
SELECT LEAFDIST
FROM SYSIBM.SYSINDEXPART
WHERE LEAFDIST > :leafdistlimit value
This query selects any indexes whose average number of pages between successive
leaf pages is greater than the specified value. Any rows returned are selected for
REORG. See 4.1.1, “REORG INDEX” on page 70 for further details.
The return code set by the REORG indicates whether a limit has been met:
RC1: No limits have been met.
RC2 or greater: Limits have been met and action has been taken.
If REPORTONLY is specified, the return codes are set as above. If the REORG is executed,
then the return code may be higher, dependent upon the REORG options specified — for
example, LOG NO without an Inline COPY gives RC4.
To ensure that correct information is given by the queries, RUNSTATS must be kept current.
This type of REORG is suitable when there is a batch window available for dedicated
housekeeping. It is also suitable if there is insufficient disk available to hold “shadow” data
sets during the REORG, and if the REUSE parameter is coded, then the “old” data sets are
not deleted, but are reused. When using REUSE, ensure that the data sets are big enough to
hold the data plus any free space that may be reinstated by the REORG. See 7.8.7, “REUSE”
on page 184 for more information on the REUSE parameter.
SHRLEVEL NONE must be used for the catalog and directory objects and for LOB table
spaces.
Intervention is required if the REORG is unsuccessful. This may involve restarting the utility or
recovering of the objects.
By using the DRAIN_WAIT parameter carefully, in conjunction with RETRY, applications need
not be given a resource unavailable message and should only notice a slight degradation of
response time during this phase.
If the DRAIN WAIT parameter is set to less than IRLMRWT, the application timeout
parameter, REORG will give up its drains and return the objects to UTRO before any
application processes have timed out, if draining a reader in the SWITCH phase. If REORG
cannot drain the writers at the start of the UNLOAD phase, access is set to UTRW. If this
happens, REORG will wait the time specified by RETRY_DELAY and will retry the drains,
assuming that RETRY is specified. If unsuccessful again, the process is repeated until either
the REORG is successful or the maximum number of retries is reached.
If the DRAIN WAIT parameter is set to more than the IRLMRWT, then the REORG has a
better chance of completing, but applications are more likely to suffer unavailable resources
or timeout.
In the event of the REORG being unsuccessful, the utility will return the original table/index
space to its original state and give up all drains. The utility will then act on the TIMEOUT
parameter. If it is TERM, the utility will delete all extra data sets, including shadow data sets,
and terminate the utility, and no further intervention is required as the original object is still
fully available.
To use SHRLEVEL REFERENCE more space is required to hold the “shadow” data sets.
These will be the same size as defined in the catalog, assuming DB2 managed data sets.
SHRLEVEL REFERENCE is ideal for objects that are read-only for long periods of time but
have time available for the SWITCH phase to complete — for example, QMF users table
spaces or tables with LOAD RESUME YES.
SHRLEVEL CHANGE works by allowing both readers and writers access to the data until the
last log iteration and the SWITCH phase. During the RELOAD phase, applications can
access the data in UTRW mode, with any updates being written to the log as normal. Once
the RELOAD is finished, REORG starts processing the log records and applying them to the
new “shadow” data sets. The log is processed in cycles, called iterations, until DB2 decides
that there is little enough log to apply that it can meet the time scales specified in MAXRO
parameter.
When this occurs, DB2 will switch the objects into one of two statuses:
UTRO status, if DRAIN WRITERS is specified
UTUT status, if DRAIN ALL is specified
164 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
If successful, the last iteration of the log will be processed, the object status will be set to
UTUT (if not already done), the readers will be drained (if not already done), and the SWITCH
phase will commence (see 7.6.5, “SWITCH” on page 171). When complete, the drains are
released, status is set to RW, and REORG deletes unwanted data sets, including the original
object data sets. REORG is recorded in SYSCOPY along with any inline copies.
If the drains are unsuccessful, REORG releases any drains it has acquired, returns the status
to UTRW, and returns to processing the log. It will wait for the time specified by the
RETRY_DELAY parameter (see 7.7.4, “RETRY_DELAY” on page 178) and then restarts the
process of changing status and getting drains again. This process is repeated until either the
REORG is successful or the maximum number of retries has been reached. Once this has
been reached, the utility will then act on the TIMEOUT parameter. If it is TERM, the utility will
delete all extra data sets, remove the utility, and place all objects back in RW status.
As per SHRLEVEL REFERENCE, if the DRAIN WAIT is specified as less than the IRLMRWT
value, then applications should not be timed out by the REORG.
This method of REORG allows for continuous availability. It should be used in a situation
where there is constant access to the DB2 object by updating transactions, and where a high
availability is demanded by users.
NONE R UN UN UN NA NA NA
REFERENCE R R R R NA UN NA
The last log iteration is not shown in SHRLEVEL CHANGE where the availability is either R or UN,
depending upon the DRAIN option.
7.3.5 Recommendations
Following are some recommendations for the use of the various types of REORGs:
SHRLEVEL NONE must be used for the catalog and directory and for LOBs.
SHRLEVEL NONE is for use when disk space is not available for shadow data sets.
SHRLEVEL NONE should be used to reset REORGP status.
SHRLEVEL REFERENCE is for use where a dedicated batch window is available.
This option is preferred above SHRLEVEL NONE for ease of operations.
SHRLEVEL REFERENCE is for use on (mostly) read-only table spaces.
SHRLEVEL CHANGE is for use when high availability is demanded.
A separate mapping table is needed for each REORG running concurrently, even if the
parallel jobs are operating on different objects or partition(s). If the REORGs are running
serially, then the same mapping table can be used. When sizing the mapping table and index,
it is the size of the index that is important, as this is used by the Online REORG. The table
space that the mapping table resides in can be sized as small as one track, irrespective of the
size of the table space being reorganized.
Tip: When running multiple Online REORGs in parallel, and creating the mapping table in
the job(s), create the mapping tables in separate databases. This reduces contention on
the DBD when the mapping table is dropped after completion of the REORG.
Note: Better performance of the mapping table can be achieved by removing the NULLS
attribute from the mapping table. An example of CREATE TABLE is included in job
DSNTEJ1,.contained in the DB2 sample libraries. This is a change from when DB2 V5
initially shipped this function.
Ensure that there is sufficient disk available for the shadow data sets, as these are created
first by the REORG. If a different pool of disk is to be used for the shadow, then issue an
ALTER STOGROUP prior to running the REORG.
If DB2 data sets are user managed, then the target data sets have to be created before
starting the Online REORG. Be aware that these data sets should not be deleted afterwards,
as DB2 will now be using these as the “live” data sets, assuming that FASTSWITCH is being
used (see “FASTSWITCH” on page 172). Also, ensure that the correct naming standards are
used. If reorganizing individual partitions, no NPIs will be renamed, but a shadow data set
should still be created.
166 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
7.6 Phases of Online REORG
Online REORG is, by definition, a REORG that is run with SHRLEVEL REFERENCE or
SHRLEVEL CHANGE. In this section the different phases of the REORG are described, see
Figure 7-1. For the different types of REORG, different phases are executed. In this section,
all phases are described/ See DB2 UDB for OS/390 and z/OS Version 7 Utility Guide and
Reference, SC26-9945, regarding which phases are executed by which type of REORG.
Only if
REORG ....
PART +
NPIs
7.6.1 UNLOAD
During the UNLOAD phase of Online REORG the data is UNLOADed in preparation for
RELOADing back into the table space, or partition of a table space. If SHRLEVEL CHANGE
is specified, the table space is fully available for all access. If SHRLEVEL REFERENCE is
specified, then the objects are placed in UTRO status.
SORTDATA, NOSYSREC, and SORTKEYS are implicitly set for an Online REORG
SHARLEVEL CHANGE, assuming that all conditions are met — for example, a clustering
index is available. If the table space being reorganized does not have a clustering index
available, then NOSYSREC and SORTKEYS will not be specified, and an UNLOAD data set
must be provided.
7.6.2 RELOAD
During the RELOAD phase, data is RELOADed into a shadow copy of that table space or
partition, while applications can read and write to the original copy of the table space.
It is recommended that if a large NPI exists on a partitioned table space, and individual
partitions are being reorganized, then the data set for the logical partition should be manually
created. If it is created by DB2, then DB2 will create a full size shadow of the NPI, when only
a portion of it is required.
For user-managed data sets, the shadow data sets need to be created using IDCAMS
DEFINE CLUSTER before starting the Online REORG. The number of data sets must match
the number of objects or partitions being reorganized. Additionally, you need to create data
sets for every logical partition of an NPI.
Note: Ensure that shadow data sets are correctly named with either the ‘I’ or ‘J’. The
current name can be retrieved from the catalog using the following SQL:
SELECT DBNAME,TSNAME,IPREFIX
FROM SYSIBM.SYSTABLEPART
WHERE DBNAME ='dbname' AND TSNAME ='psname'|
and
SELECT DBNAME,IXNAME,IPREFIX
FROM SYSIBM.SYSINDEXES X,SYSIBM.SYSINDEXPART Y
WHERE X.NAME =Y.IXNAME AND X.CREATOR =Y.IXCREATOR
AND X.DBNAME ='dbname'AND X.INDEXSPACE ='psname';
The original data sets will still remain, and have to be deleted manually using IDCAMS
DELETE CLUSTER
LOG YES is not allowed during the RELOAD phase of an Online REORG, although this does
not have the same implications as it does with a SHRLEVEL NONE REORG.
Online REORG forces an image copy to be taken during the process, if COPYDDN is not
specified COPYDDN(SYSCOPY) is assumed and the appropriate DD card or Template must
exist. The first part of the Inline COPY is taken during the RELOAD process (see Figure 7-2),
and during the LOG phase, incremental image copies are added to the data set.
168 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
SORTBLD
SW01WKnn Index1
Original Tablespace
SW02WKnn Index2
Shadow Copy of
orig. TS
SYSREC
Imagecopy
DB2 Log
Insert Update Delete Delete Insert Insert Update Delete Delete Insert Insert Update Delete Delete Insert
The number of incremental copies added will depend upon the number of log iterations
undertaken, and will be finished during the LOG Phase (see Figure 7-3). Two is the minimal
number of incremental copies that will be added, one prior to the final log iteration and one
after the final log iteration.
Note: There is only one copy data set produced. There are no incremental image copy
data sets produced.
The SYSCOPY record produced by an Inline COPY during an Online REORG contains:
ICTYPE = ‘F’
SHRLEVEL = ‘R’
STYPE = ‘W’
See 3.1.1, “Inline COPY” on page 40 for further details on inline copies.
4. Final Incremental
imagecopy is taken
Imagecopy
taken during
IIC
RELOAD + + IIC
Phase
(minimum)
After running an Online REORG, an informational pending state (ICOPY) will be placed on
index spaces of related indexes that have the COPY YES attribute. Since ICOPY is not a
restrictive state, inline image copies of indexes are not performed during REORG, although it
is recommended to take an image copy after the REORG.
SORTKEYS is always used, because it will lead to performance improvements when more
than one index needs to be created due to the parallelism. Since the mapping table is built at
the same time, Online REORG almost always has to build more than one index. This is valid
since Version 6, because in Version 5, the mapping index is built, for the most part, during
RELOAD phase, but in insert mode, not load mode.
170 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
7.6.4 LOG phase
The LOG phases processes the log records for any updates that have occurred since the
UNLOAD process began. Any changes that have occurred are now applied to the shadow
data sets. The log is read and applied in an iterative manner. While doing this, DB2 estimates
how long it will take to process the next iteration of the log and before starting on the next
iteration will estimate the amount of time it will take to process this iteration. At incremental
image copy is taken prior to the last iteration of the log.
When DB2 estimates that the amount of time required to process the next iteration of the log
is less than the parameter specified by the MAXRO parameter (see 7.7.5, “MAXRO” on
page 179), DB2 makes a final pass through the log, and the next action by REORG is
dependent upon the value of DRAIN (see 7.7.1, “DRAIN” on page 176). Up to this point, the
original table space is still available to applications.
If DRAIN ALL is specified, DB2 will place all the objects into UTUT status and try and get a
drain on the objects, via the Buffer Manager. If successful, DB2 will apply the last of the log
and enter the SWITCH phase. If unsuccessful, DB2 will return to the LOG phase, release all
drains, and set the status back to its original state.
If DRAIN WRITERS is specified, DB2 will place all the objects into UTRO and try to drain all
writers. If successful, DB2 will apply the last of the log and, before entering the SWITCH
phase, it will change the statuses to UTUT and drain all readers. If DB2 is unsuccessful at any
time, it will reset the status to the original, release all drains, and re-enter the LOG phase.
DB2 will now continue processing the log until the MAXRO value is met again and the
RETRY_DELAY period has passed (see 7.7.4, “RETRY_DELAY” on page 178). Once these
conditions are met DB2 re-enters the final log iteration process again. This process continues
until the limit for retries has been met, see 7.7.3, “RETRY” on page 178. At this point REORG
takes action dependent upon the TIMEOUT parameter, see 7.7.8, “TIMEOUT” on page 180.
The final incremental image copy is taken at the end of the final log iteration process. At this
point the table space is in either UTRO or UTUT status, this means that the image copy is
equivalent to a SHRLEVEL REFERENCE copy. This means that the whole image copy is a
SHRLEVEL REFERENCE copy and is recorded in SYSCOPY as such.
Important: There is only ONE entry in SYSCOPY for the image copy. This is because the
incremental image copies are appended to the FULL image copy so there is only one data
sets. There is no need to run MERGECOPY to merge the incremental to the FULL copy.
7.6.5 SWITCH
After applying the log records successfully, DB2 drains the remaining readers, if DRAIN
WRITERS was specified, and places the objects in UTUT status. It is now ready to start the
SWITCH phase and rename the data sets.
Until DB2 Version 7, DB2 would rename the ‘I’ data sets to a ‘T’ version and then the ‘S’ data
sets to the original ‘I’ version after deleting the original data sets. DB2 calls AMS to undertake
the renaming, and the time taken by this method depends upon the amount of data sets to be
renamed and the access to the MVS catalog.
With DB2 Version 7, a new option is introduced that removes the need for the renaming of
data sets and leads to vast improvements in the SWITCH process. This option,
FASTSWITCH, is discussed in the next section.
With DB2 Version 7, the first character in the instance node can alternate between a value of
‘I’ or ‘J’. In the SWITCH phase, now called FASTSWITCH, all data sets involved in the Online
REORG are no longer renamed, as this can be very time-consuming. Instead, the catalog
and the directory are updated to point to the shadow data sets rather than to the original data
sets, and the original data sets are deleted; see Figure 7-4.
FASTSWITCH phase
BUILD2
last LOG
iteration
Before UTIL UTIL After
SWITCH
REORG INIT TERM REORG
SQL Renames
Access I I
V5, V6 S T
V7 Fast SWITCH
or viceversa:
SQL
J ==> I
Access I Catalog
Update
I ==> J
J
Some ERP applications, such as SAP and PeopleSoft, design DB2 table spaces with several
hundred indexes. Others use partitioned table spaces with some hundred partitions. In both
cases, several hundred data sets must be renamed in the SWITCH phase. For the renaming,
DB2 invokes Access Method Services (AMS), AMS in turn invokes MVS supervisor calls
(SVCs) which result in further cascading SVCs, for example, for checking whether the new
name exists already, and whether the rename was successful.
Therefore, the elapsed time of the SWITCH phase can become too long, which can result in
applications timing out as the objects are in UTUT status during this time. To avoid this
scenario FASTSWITCH has been introduced.
FASTSWITCH is a revised alternative to the processing that speeds up the SWITCH phase
by removing invocation of AMS to rename the data sets. The alternative process, invoked by
FASTSWITCH, in the various phases, is as follows:
In the UTILINIT phase, DB2 creates shadow data sets. The fifth qualifier of these data sets
is now ‘J0001’, if it is currently ‘I0001’.
172 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
In the SWITCH phase, DB2 updates the catalog and the object descriptor (OBD) from ’I’ to
’J’ to indicate that the shadow object has become the active or valid database object.
During that time, SQL applications cannot access the table space.
After the SWITCH phase, the SQL applications can resume their processing, now on the
new ’J0001’ data sets.
In the UTILTERM phase, DB2 deletes the obsolete original data sets with the instance
node ’I0001’, if the data sets are DB2 managed.
Note:
If DB2 data sets are SMS-controlled, ensure that the ACS routines can accommodate
the new DB2 naming standard.
During the last log iteration and during the BUILD2 phase, the SQL access is limited,
too; but we ignore this here.
Ensure that all in-house written code recognizes the new naming standards.
As a result of the removal of AMS calls and associated SVC calls, the SWITCH phase is
much shorter, therefore applications are less likely to time out.
When reorganizing individual partitions, it is important to remember that not all partitions will
have the same fifth node identifier. Some may be ‘I’ or ‘J’, while any NPIs on the table space
will be the original node, that is, the same as before the REORG started.
The only restriction with FASTSWITCH is that it cannot be used with the catalog or directory
objects, including their indexes. If specified, the option is ignored, and normal switch
processing is used.
If TERM UTIL is issued for the REORG during the SWITCH phase, the objects being
REORGed will be returned to their original states and original data sets.
When using concurrent copy, the RECOVER utility has extra work to do to ensure that the
correct name is restored for the data sets. Consider the following example:
I (J)
FASTSWITCH should be used for all REORGs, where allowed, because it reduces the
elapsed time of the SWITCH phases, thereby improving the availability of the objects.
If RECOVER recovers the table space to a point before the FASTSWITCH occurred, the
RECOVER utility will still rename the data set to the instance currently in the DB2 catalog. It
will not update the catalog.
174 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
7.6.6 BUILD2
The BUILD2 phase is only executed for a REORG TABLESPACE PART SHRLEVEL
CHANGE or REFERENCE utility when non-partitioning indexes (NPIs) exist for the table
space. After completion of the SWITCH phase, all reorganized table space partitions and
index space partitions are renamed, and the I or J data sets are now reorganized. Since
reorganizing one or more partitions of a table space should not lead to any data unavailability
for the unaffected partitions, the data sets of non-partitioning indexes cannot be renamed
during the SWITCH phase.
In this case, the REORG utility creates shadow data sets for the NPI, one for each NPI, in
which it creates index entries for the logical partitions corresponding to the partitions being
reorganized. In Figure 7-6, only the entries of the partitions 2 and 3 are being reorganized.
Accordingly, the shadow data sets only contain a subset of the index entries for the NPIs.
Instead of renaming the data sets, DB2 initiates the BUILD2 phase, during which Online
REORG locates the next key in the logical partition of the NPI which matches the REORG
PART statement and updates the key RIDs with the new value from the shadow index (which
has been built during the SORTBLD phase).
With DB2 V7, DB2 introduces BUILD2 Parallelism, that is, DB2 dispatches several subtasks,
one for each NPI, to perform the updates of the entries in the original NPI data sets. Each
subtask is assigned an original NPI and the shadow copy of the logical partition.
This implies improvements for all cases with more than one NPI, because the elapsed time of
the BUILD2 phase is equivalent to the time it takes to process the logical partition with the
most RIDS. For further consideration on parallelism within utilities, see 3.7, “Considerations
for running parallel subtasks” on page 67.
3 2 1 Update entries
3 2 2 rather than
Shadow dataset for NPI
Instance node: NY 2 2 1 replace data set
S0nnn (V5,V6) or J0nnn/I0nnnn (V7),
where nnn first partition in the range,
2 3 1
here: 002 3 3 1
b
Unlike COPY, there is no keyword to control the number of parallel subtasks for the BUILD2
phase of REORG. Information about the number of subtasks is provided by the -DIS UTIL
command and by message DSNU114I in the job output.
Tip: If reorganizing individual partitions of a partitioned table space and large NPIs exist,
create the shadow data sets manually. If DB2 creates the data sets, a full-size NPI is
created when only a portion is required.
7.7.1 DRAIN
DRAIN specifies the action that REORG will take when entering the final log iteration. The
two options are:
ALL
REORG will place the object in UTUT status and drain all writers and readers. This should
be used when there is a lot of SQL update activity and a large number of deadlocks occur.
WRITERS
REORG will set the status to UTRO and only drain the writers when entering the last log
iteration. When entering the SWITCH phase, DB2 will set the status to UTUT, then drain
the readers.
176 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
The option selected will directly affect the availability of the object to the users. Specifying
WRITERS, the default, will provide for the greatest availability, but will increase the REORG
time, the number of deadlocks, and the possibility of failure at the SWITCH phase.
With Version 7, the new options RETRY, DRAIN_WAIT and RETRY_DELAY are available.
Either DRAIN ALL or DRAIN WRITERS can be used with RETRY. The status is always
returned to UTRW. Use the option most suitable to the expected amount of update
processing.
7.7.2 DRAIN_WAIT
This option specifies the number of seconds that REORG will wait for DML activity to finish
when issuing a drain before timing out. The value specified can range between 0 and 1800. If
omitted, then the IRLMRWT and UTIMOUT from ZPARMS values are used. DRAIN_WAIT
provides a more consistent wait time than the ZPARM options, since it refers to the aggregate
time for a set of objects. Example 7-1 shows an example of DRAIN_WAIT being set to 50
seconds, while Example 7-2 shows the job output for the utility. The utility waited 50 seconds
in the SWITCH phase before abending.
Example 7-1 DRAIN_WAIT parameter
//DSNUPROC.SYSIN DD *
LISTDEF LISTA1 INCLUDE TABLESPACE DB246129.TS612901
INCLUDE TABLESPACE DB246129.TS612902
Therefore, examine the requirement for the REORG to finish successfully and quickly. If it is
less than the availability requirements, set this limit lower than IRLMRWT. Similarly, when
used with RETRY parameters, it is recommended to set this value lower than IRLMRWT.
7.7.3 RETRY
RETRY specifies the number of times that REORG will attempt to drain all objects being
REORGed. When REORG fails to get a drain, the RETRY count is decremented by 1, when
the count reaches 0, the REORG terminates in accordance with the TIMEOUT parameter.
Before retrying the drains, REORG will make the objects fully available again and process
more log, for the time specified in the RETRY_DELAY parameter. Before retrying the drain,
the MAXRO condition must be met. If the RETRY parameter is not specified, there is no retry
processing and the REORG ends unsuccessfully.
7.7.4 RETRY_DELAY
This parameter specifies the amount of time, in seconds, that REORG will wait before
attempting to drain the objects in the case of RETRY being specified. The default is 300
seconds, although in most occasions this is probably too high and can be safely reduced.
Example 7-3 Online REORG with DRAIN_WAIT, RETRY and RETRY_DELAY
//DSNUPROC.SYSIN DD *
LISTDEF LISTA1 INCLUDE TABLESPACE DB246129.TS612901
INCLUDE TABLESPACE DB246129.TS612902
178 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Example 7-4 Online REORG DRAIN_WAIT, RETRY and RETRY_DELAY output
DSNU349I -DB2G DSNURBXA - BUILD PHASE STATISTICS - NUMBER OF KEYS=160 FOR INDEX PAOLOR7.XCUSTNO
DSNU349I -DB2G DSNURBXA - BUILD PHASE STATISTICS - NUMBER OF KEYS=160 FOR INDEX DSN8710.XMAP_TBL
DSNU258I DSNURBXD - BUILD PHASE STATISTICS - NUMBER OF INDEXES=2
DSNU259I DSNURBXD - BUILD PHASE COMPLETE, ELAPSED TIME=00:00:00
DSNU1122I -DB2G DSNURLOG - JOB PAOLOR4A PERFORMING REORG
WITH UTILID TEMP UNABLE TO DRAIN DB246129.TS612901.
RETRY 1 OF 2 WILL BE ATTEMPTED IN 10 SECONDS
DSNU1122I -DB2G DSNURLOG - JOB PAOLOR4A PERFORMING REORG
WITH UTILID TEMP UNABLE TO DRAIN DB246129.TS612901.
RETRY 2 OF 2 WILL BE ATTEMPTED IN 10 SECONDS
DSNU590I -DB2G DSNURDRN - RESOURCE NOT AVAILABLE, REASON=X'00C9008E', ON DB246129.TS612901 PROHIBITS PROCESSING
DSNT500I DSNUGBAC - RESOURCE UNAVAILABLE
REASON 00C200EA
TYPE 00000200
NAME DB246129.TS612901
DSNU017I DSNUGBAC - UTILITY DATA BASE SERVICES MEMORY EXECUTION ABENDED, REASON=X'00E40009'
Example 7-3 and Example 7-4 have shown the JCL and the output from an Online REORG
using the DRAIN_WAIT, RETRY and RETRY_WAIT parameters. In total, the job waited for 2
minutes and 50 seconds, which can be calculated using as follows:
TOTAL TIME = (DRAIN_WAIT + RETRY_DELAY) * RETRY + DRAIN_WAIT
7.7.5 MAXRO
MAXRO specifies goal for the maximum amount of time that REORG is allowed to keep the
object in UTRO, for DRAIN WRITERS, or UTUT, for DRAIN ALL, while applying the last
iteration of log records. During log processing, REORG calculates the amount of time that will
be taken to apply the log records. If this value is less than the MAXRO value, then DB2 enters
the last log iteration, sets the object status (UTUT or UTRO), and attempts to drain the
objects.
The value for MAXRO can be an integer value or the word DEFER. The value DEFER
indicates to REORG that it is not to enter the final log iteration until told. REORG will keep
processing the log until the MAXRO value is changed to an integer value. The MAXRO is
changed to an integer via the command -ALTER UTILITY(utilid) MAXRO(integer). Once
changed, the REORG begins processing the log as normal, looking for a time to enter the last
log iteration.
The benefits of MAXRO DEFER is that it enables the REORG to be finished at a time that is
convenient to the application. An Online REORG could be left processing in the background
and finished by the ALTER command when low activity is detected. Be aware that the image
copy data sets could get large if there is a lot of update activity during the DEFER period.
The DEFER option is recommended for controlling the finishing time of a REORG when the
time that is available for SWITCH phase is variable. This option should be used in conjunction
with LONGLOG CONTINUE.
If DEFER is specified, and DB2 determines that the actual time for an iteration and the
estimated time for the next iteration are both less than 5 seconds, DB2 adds a 5-second
pause to the next iteration to reduce resource consumption. The first time this situation occurs
for a given execution of REORG, DB2 sends the message DSNU362I to the console. The
message states that the number of log records that must be processed is small and that the
pause will occur.
When this message is issued, this is an appropriate time to execute ALTER UTILITY to
change the MAXRO value, and thus cause REORG to finish. DB2 adds the pause whenever
the situation occurs. However, DB2 sends the message only if 30 minutes have elapsed since
the last message was sent for a given execution of REORG. DB2 will issue an operator
message every 30 minutes.
A value of CONTINUE, the default, along with MAXRO DEFER, means that the REORG
never switches to the shadow data sets and allows continuous read and write access to the
original objects. Processing will continue when the MAXRO has been altered to an integer
value.
This parameter affects the size of the first incremental image copy. If LONGLOG is
CONTINUE and the update activity is significant, then the incremental size will be larger.
It is recommended that you use LONGLOG CONTINUE when specifying MAXRO DEFER.
7.7.7 DELAY
This specifies the time, in seconds, between the LONGLOG condition being met and the
console message being issued, and is the time that REORG performs the action specified in
the LONGLOG parameter.
7.7.8 TIMEOUT
TIMEOUT specifies the action to be taken if the REORG is timed out during the LOG or the
SWITCH phase.
Online REORG is only restartable in the BUILD2 and SWITCH phase, and therefore, ABEND
is the default.
The recommendation is that ABEND is the correct parameter for long running REORGs to
allow for restartability and to utilize machine resources wisely. Using ABEND will have an
affect on the availability of the object, because restart will be needed or a manual terminate.
For smaller table spaces and shorter running REORGs, TERM should be used where
restartability is not an issue.
180 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
7.7.9 DEADLINE
DEADLINE is either a timestamp value or a timestamp expression that indicates a time when
the SWITCH phase must finish by. If it is not specified, then there is no deadline.
If REORG estimates that it cannot finish processing by the deadline, then it will issue the
DISPLAY UTILITY command, terminate the REORG, and issue the message DSNU374I with
reason code 2. All objects are left in a non-restrictive status.
The output from the DISPLAY UTILITY should be viewed and decisions made as to whether
the deadline time is suitable for the object and the workload at the time the REORG was
running.
7.7.10 Recommendations
The chances of Online REORGs completing successfully is greatly increased if there are
good application standards regarding commit frequency. Long units of work can inhibit the
SWITCH phase because drains cannot be obtained. Therefore it is recommended that Online
REORGs should not be run while long running units of work are executing.
When moving to DB2 Version 7, it is strongly recommended that the following options should
be used for Online REORG:
DRAIN
DRAIN_WAIT
RETRY
RETRY_DELAY
These options will increase the chances of REORG finishing successfully without impacting
the applications.
If an Online REORG fails, the implications are not as severe as for a SHRLEVEL NONE
REORG. The Online REORG can be terminated, dependent upon which phase, and all
objects revert to their original state. This potentially can lead to fewer out-of-hours support
being necessary.
It is also recommended that inline copies and statistics (at least for space) are gathered
during the REORG to minimize impact on resources and to increase availability.
7.8.1 LISTS
With DB2 Version 7, the concept of LISTDEF and TEMPLATES was introduced (see
Chapter 2, “Wildcarding and Templates” on page 17), which enables lists of objects to be built
and acted upon by a utilities. All objects in a given list are processed, potentially in parallel, by
the same utility with the same options. Templates allow for dynamic data set allocations and
automatic intelligent data set sizing.
REORG can fully exploit this functionality, and the use of the LISTDEF and TEMPLATE
functionality is highly recommended to reduce administration.
It is recommended that the lists built should contain table spaces/index spaces of a similar
nature to which “global” parameters can be applied.
For examples of LISTDEF, see Example 7-1 on page 177 and Example 7-3 on page 178.
7.8.2 SORTKEYS
SORTKEYS eliminates the need for index keys to be written to data sets for sorting and then
read from the data set for building of the index. All keys are instead passed in memory to a
combined SORT and BUILD phase named SORTBLD. SORTKEYS can now also work in
parallel using multiple subtasks to reduce the elapsed time of the REORG.
In Version 7, REORG can now exploit parallel index build features which allows
non-partitioning indexes to be built in parallel with a separate subtask per build; see 7.6.6,
“BUILD2” on page 175 for further discussions. This reduces the contention on the NPI build
that was a problem in DB2 Version 6.
Further information about REORGs and use of SORTKEYS can be found in Chapter 3, “Inline
executions and parallelism” on page 39.
7.8.3 SORTDATA
SORTDATA specifies that the data in the table space is to be UNLOADed via a table space
scan. Once UNLOADed the data is then sorted into clustering sequence using DFSORT or
similar program.
This option is recommended unless any of the following conditions are true:
The data is in near-perfect clustered order and REORG is being done to reclaim space.
There is insufficient SORTWORK space available.
The record length for the composite record is greater than 32760.
REORG is running against catalog or directory table spaces that are prohibited from using
this option.
An explicitly defined clustering index does not exist.
SORTDATA is strongly recommended unless the data set being reorganized is very large and
the SORTWORK space is at a premium.
7.8.4 NOSYSREC
NOSYSREC specifies that the output from the SORTDATA is to be the input for reloading
without externalizing the data to the SYSREC data set. This option is only available if
SORTKEYS is specified and the SHRLEVEL of the REORG is REFERENCE or NONE.
The advantage of using NOSYSREC is the reduction in elapsed time due to the removal of
I/O to the SYSREC data set.
182 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Using this option has restart implications in the event of a failure:
If the utility fails during UNLOAD of a SHRLEVEL REFERENCE REORG, then the
REORG has to be restarted from the start of this phase.
If the utility fails during RELOAD of a SHRLEVEL NONE REORG, then the REORG has to
be terminated and a RECOVER TABLESPACE utility. It is therefore recommended that
image copies should be taken immediately prior to the execution of the REORG.
SHRLEVEL CHANGE does not use a SYSREC file, and therefore this option is automatically
used along with SORTDATA and SORTKEYS.
7.8.5 DFSORT
When specifying SORTDATA or SORTKEYS with REORG against a large table, it may be
necessary to override the default DFSORT parameters. To achieve this, add a DFSPARM DD
card in the JCL; see Example 7-5. In this example, the storage available to the sort process is
set to 24M, with work data sets being spread across 8 volumes.
Example 7-5 Changing DFSORT defaults
//DFSPARM DD *
OPTION MAINSIZE=24000K,NOWRKREL,DYNALLOC=(WRKDSK,8),DSPSIZE=60
This method can also be used to override the FILESZ parameter, which may be necessary
when dealing with compressed rows.
7.8.6 KEEPDICTIONARY
The option KEEPDICTIONARY tells REORG TABLESPACE not to rebuild the compression
dictionary during RELOAD, but to keep the dictionary currently in the table space. This option
is only available for compressed table spaces and where a compression dictionary exists. If a
dictionary does not exist, then one is built, and warning messages are issued.
The building of a dictionary is an overhead to the REORG utility and is an overhead that
needs not be incurred each time a table space is REORGed. It does not guarantee a better
functioning dictionary. It is recommended that the effectiveness of the existing dictionary is
monitored by using the column PAGESAVE in the new Version 7 catalog table
SYSTABLEPART_HIST.
By monitoring the PAGESAVE value, it can be determined when the degradation of the
dictionary from the original compression is greater than 10%. At this point, a dictionary rebuild
should be scheduled by omitting the KEEPDICTIONARY keyword. See Figure 7-7, for a “rule
of thumb“ to help decide when to rebuild a dictionary.
if (max(pagesave)-min(pagesave))*100
/max(pagesave) > 10
= (73-64)*100/73
= 12
The logical reset time of a VSAM data set is much shorter than the physical delete and
redefine time. REORG REUSE will not reclaim any secondary extents because it is only
logically resetting the VSAM data set.
The REUSE option can operate on the entire table space and its index spaces, or on a
partition of a partitioned table space and its corresponding partition index space. In the latter
case, you should specify the PART integer as illustrated in Example 7-6.
Example 7-6 Specifying the REUSE option during REORG
Logically reset and reuse of an entire table space and associated index spaces:
REORG TABLESPACE DB1.TS1 REUSE ........
Logically reset and reuse of a table space partition and associated index partition:
REORG TABLESPACE DB1.TS1 REUSE PART 1
7.8.8 PREFORMAT
The PREFORMAT option of the REORG utility was introduced in DB2 V5. It will preformat a
table space and its associated index spaces during the REORG and prepare it for INSERT
processing. This eliminates the need to preformat new pages in a table space during INSERT
processing and reduce execution time delays. However, when the preformatted space is
utilized and DB2 has to extend the table space, normal data set extending and preformatting
will occur.
Preformatting causes all free pages in the VSAM cluster (between the high used RBA and
high allocated RBA) to be preformatted. This includes secondary extents that may have been
allocated. Preformatting occurs after the data has been loaded and the indexes are built.
184 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Preformatting can operate on the entire table space and its index spaces, or, it can operate on
a partition of a partitioned table space and its corresponding partition index space.
Preformatting an entire partitioned table space at the table space level can inhibit concurrent
processing of separate partitions. In that case, you should specify PART integer
PREFORMAT to serialize on the partition level, as illustrated in Example 7-7.
Example 7-7 Preformatting a table space and its index spaces during REORG
Preformatting an entire table space and associated index spaces:
REORG TABLESPACE DB1.TS1 PREFORMAT
Preformatting a table space partition and associated index partition:
REORG TABLESPACE DB1.TS1 PREFORMAT PART 1
Preformatting is only recommended for table spaces and index spaces that are part of high
INSERT applications with high performance requirements where normal formatting can cause
unacceptable delays or inconsistent elapsed times.
It is not recommended for tables that are mainly used for query processing. The additional
preformatted pages can cause table space scans to read additional empty pages, extending
the elapsed time for these queries.
So, you should use the PREFORMAT option with care. General use of this option in all your
REORG jobs is not recommended. Best results may be seen when the final size of the table
space is known and when there is a high ratio of inserts to read operations.
Note: DB2 V7 introduced asynchronous INSERT preformatting. This new function will
asynchronously preformat the next range of new pages during INSERT processing if the
current page is within an internally predefined range from the end of the formatted pages.
7.9 DISCARD
Prior to DB2 Version 6, the only method of removing data from a table space was using one of
the following methods:
SQL DELETE from the table and then REORG afterwards
DSNTIAUL to select the rows to be KEPT and then run LOAD REPLACE
DSNTIAUL to UNLOAD all rows and then manipulate the UNLOAD file before running a
LOAD REPLACE of the table space.
The last two options can only be used for single table spaces. For table spaces containing
multiple tables, all tables would have to be UNLOADed and RELOADed.
DB2 Version 6 introduced the DISCARD parameter to the REORG. This parameter allows to
delete selected records during a full REORG process (UNLOAD CONTINUE or UNLOAD
PAUSE) and also allows data to be written to a discard file, specified by DISCARDDN
keyword. Only rows that did not meet the selection criteria are RELOADed back into the table
space. The selection process is specified by using the syntax FROM TABLE table WHEN
condition. If DISCARD is specified without a WHEN clause, no rows are selected for
discarding.
The WHEN clause conditions convert directly to stage 1 type predicates, and therefore the
following restrictions apply:
Column-to-column comparisons are not allowed.
Arithmetic operations are not allowed.
LIKE on columns with FIELDPROCs are not allowed.
An example of using the discard process is given in Example 7-8, which shows all rows in
table POALOR2.LINEITEM with L_PARTKEY = 107038 being discarded into the file DISOUT.
NOSYSREC, SORTDATA, and SORTKEYS have been specified to avoid allocating a
SYSREC data set. The job output from the example is shown in Example 7-9.
Example 7-8 REORG using DISCARD processing
TEMPLATE DISOUT
DSN(DB2V710G.&DB..&TS..D&DATE..DISOUT)
VOLUMES(SBOX58,SBOX59, SBOX57, SBOX60)
TEMPLATE PUNCH
DSN(DB2V710G.&DB..&TS..D&DATE..SYSPUNCH)
VOLUMES(SBOX58,SBOX59, SBOX57, SBOX60)
TEMPLATE LOCALDDN
DSN(DB2V710G.&DB..&TS..D&DATE..P&PA..L)
VOLUMES(SBOX58,SBOX59, SBOX57, SBOX60)
REORG TABLESPACE U7G01T11.TESTNPIS
SORTKEYS NOSYSREC SORTDATA
SORTNUM 4
SHRLEVEL REFERENCE
COPYDDN(LOCALDDN)
PUNCHDDN(PUNCH)
DISCARDDN(DISOUT)
DISCARD FROM TABLE POALOR2.LINEITEM
WHEN (L_PARTKEY = 107038)
186 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
PERCENT OF CHANGED PAGES =100.00
ELAPSED TIME=00:01:32
DSNU244I -DB2G DSNURWT - COMPRESSION REPORT FOR TABLE SPACE U7G01T11.TESTNPIS, PARTITION 1
22598 KB WITHOUT COMPRESSION
12624 KB WITH COMPRESSION
44 PERCENT OF THE BYTES SAVED FROM COMPRESSED DATA ROWS
100 PERCENT OF THE LOADED ROWS WERE COMPRESSED
118 BYTES FOR AVERAGE UNCOMPRESSED ROW LENGTH
67 BYTES FOR AVERAGE COMPRESSED ROW LENGTH
5985 PAGES REQUIRED WITHOUT COMPRESSION
3414 PAGES REQUIRED WITH COMPRESSION
42 PERCENT OF THE DB2 DATA PAGES SAVED USING COMPRESSED DATA
DSNU244I -DB2G DSNURWT - COMPRESSION REPORT FOR TABLE SPACE U7G01T11.TESTNPIS, PARTITION 2
22508 KB WITHOUT COMPRESSION
12694 KB WITH COMPRESSION
43 PERCENT OF THE BYTES SAVED FROM COMPRESSED DATA ROWS
100 PERCENT OF THE LOADED ROWS WERE COMPRESSED
118 BYTES FOR AVERAGE UNCOMPRESSED ROW LENGTH
68 BYTES FOR AVERAGE COMPRESSED ROW LENGTH
5961 PAGES REQUIRED WITHOUT COMPRESSION
3451 PAGES REQUIRED WITH COMPRESSION
42 PERCENT OF THE DB2 DATA PAGES SAVED USING COMPRESSED DATA
DSNU244I -DB2G DSNURWT - COMPRESSION REPORT FOR TABLE SPACE U7G01T11.TESTNPIS, PARTITION 3
174993 KB WITHOUT COMPRESSION
98307 KB WITH COMPRESSION
43 PERCENT OF THE BYTES SAVED FROM COMPRESSED DATA ROWS
100 PERCENT OF THE LOADED ROWS WERE COMPRESSED
118 BYTES FOR AVERAGE UNCOMPRESSED ROW LENGTH
67 BYTES FOR AVERAGE COMPRESSED ROW LENGTH
46345 PAGES REQUIRED WITHOUT COMPRESSION
26329 PAGES REQUIRED WITH COMPRESSION
43 PERCENT OF THE DB2 DATA PAGES SAVED USING COMPRESSED DATA
DSNU304I -DB2G DSNURWT - (RE)LOAD PHASE STATISTICS - NUMBER OF RECORDS=1951540 FOR TABLE POALOR2.LINEITEM
DSNU302I DSNURILD - (RE)LOAD PHASE STATISTICS - NUMBER OF INPUT RECORDS PROCESSED=1951540
DSNU300I DSNURILD - (RE)LOAD PHASE COMPLETE, ELAPSED TIME=00:01:33
DSNU393I -DB2G DSNURBXA - SORTBLD PHASE STATISTICS - NUMBER OF KEYS=200364 FOR INDEX POALOR2.PXL#OKSDRFSKEPDC PART 1
DSNU393I -DB2G DSNURBXA - SORTBLD PHASE STATISTICS - NUMBER OF KEYS=199549 FOR INDEX POALOR2.PXL#OKSDRFSKEPDC PART 2
DSNU393I -DB2G DSNURBXA - SORTBLD PHASE STATISTICS - NUMBER OF KEYS=1551627 FOR INDEX POALOR2.PXL#OKSDRFSKEPDC PART 3
DSNU394I -DB2G DSNURBXA - SORTBLD PHASE STATISTICS - NUMBER OF KEYS=1951540 FOR INDEX POALOR2.TESTNPI
DSNU391I DSNURPTB - SORTBLD PHASE STATISTICS. NUMBER OF INDEXES = 2
DSNU392I DSNURPTB - SORTBLD PHASE COMPLETE, ELAPSED TIME = 00:00:07
DSNU387I DSNURSWT - SWITCH PHASE COMPLETE, ELAPSED TIME = 00:00:01
DSNU428I DSNURSWT - DB2 IMAGE COPY SUCCESSFUL FOR TABLESPACE U7G01T11.TESTNPIS
DSNU010I DSNUGBAC - UTILITY EXECUTION COMPLETE, HIGHEST RETURN CODE=0
Note: The enhancement described in this section has also been made available for DB2
Version 5 with APAR PQ19897.
TEMPLATE PUNCH
DSN(DB2V710G.&DB..&TS..D&DATE..PUNCH)
VOLUMES(SBOX58,SBOX59, SBOX57, SBOX60)
TEMPLATE UNLOAD
DSN(DB2V710G.&DB..&TS..D&DATE..UNLD)
VOLUMES(SBOX58,SBOX59, SBOX57, SBOX60)
REORG TABLESPACE U7G01T11.TESTNPIS
SORTKEYS
SORTNUM 4
SHRLEVEL NONE
PUNCHDDN(PUNCH)
UNLDDN(UNLOAD)
UNLOAD EXTERNAL has largely been replaced by the new utility UNLOAD, which is covered
in Chapter 8, “Unloading data” on page 191.
Catalog and directory table spaces need only be reorganized infrequently and mainly for the
reclamation of space or for the performance of queries that use table space scans against the
catalog.
Note: PTF UQ54731 for APAR PQ49114 should be applied to disable FASTSWITCH on
catalog index REORGs.
188 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
7.13.1 Inline COPY
As well as taking RUNSTATS inline, a full image copy data set can also be taken inline during
the reorganization of a table space. Again, the advantages are a reduction in the total elapsed
time when compared to the running of two separate utilities. Also, the table space is not left in
copy pending status at the end of a REORG LOG NO, thus increasing data availability. The
copy is recorded in SYSCOPY as a FULL SHRLEVEL REFERENCE copy.
The content of the image copy is different for a REORG SHRLEVEL CHANGE, as it can
contain between 1 and ‘n’ incremental image copies in addition to the full copy. The number of
incremental image copies will depend upon the number of log iterations activated by the
REORG utility. They are still valid for the RECOVER utility.
Important: The size of the image copy data set taken by REORG SHRLEVEL CHANGE
will be larger than a FULL image copy due to the incremental copies appended to the data
set. The size is proportional to the number of log iterations and the amount of changed
pages during the iterations.
See 3.1.1, “Inline COPY” on page 40 for more information about taking inline copies.
See 10.1.1, “Inline COPY with LOAD and REORG” on page 232 for more information on
copies taken together with the REORG utility.
See Example 7-12 for the use of Statistics command. Example 7-13 gives the output from a
-DISPLAY UTIL command, which shows RUNSTATS running as a subphase of the RELOAD
step.
190 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
8
DB2 V7 has introduced the new UNLOAD utility. UNLOAD can be used to unload data from
one or more source objects to one or more BSAM sequential data sets in external formats.
The source can be:
DB2 table spaces
DB2 Image Copy data sets
The online functions of UNLOAD allow the unload of data from DB2 tables with SHRLEVEL
CHANGE, which enhances the continuous availability of DB2 subsystems.
The UNLOAD utility requires SELECT authority on the table or tables in the table space. This
is similar to the DSNTIAUL unload programs, but differs from REORG UNLOAD EXTERNAL,
which requires REORG utility authority.
The DISCARD option of REORG, which can be used to discard selected data with the WHEN
option during UNLOAD PAUSE and UNLOAD CONTINUE, is not supported in the UNLOAD
utility and the REORG UNLOAD EXTERNAL.
Authorization
To execute the UNLOAD utility, the user needs one of the following privileges:
Ownership of the tables
SELECT privilege on the tables
DBADM authority on the databases
SYSADM authority
SYSCNTL authority for catalog tables only
UNLOAD Unloading records to sequential data sets. One pass through the input data
set is made. If UNLOAD is processing a table or partition, DB2 takes
internal commits to provide commit points at which to restart in case of
operation should halt in this phase.
UTILTERM Cleanup
192 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
8.2 UNLOADing data prior to V7
Prior to DB2 V7, data was unloaded from DB2 tables using these methods:
The DB2 supplied assembler program DSNTIAUL
DSNTIAUL evolved over various versions to finally accept the selection criterion with SQL
statements in DB2 V4 onwards. Although this program was useful and widely used by DB2
end users, it was slow and lacked functions like data conversion to other formats. An SQL
statement was required for each table that was unloaded which was advantages when
selective data was unloaded with the WHERE clause. DSNTIAUL cannot unload data from
image copy.
REORG UNLOAD EXTERNAL
This has been the other popular choice of DBAs to unload data from the source and load it
at the destination. REORG UNLOAD EXTERNAL can only unload data from a table
space. It cannot unload data from the image copy.
DSN1COPY of full image copies with XLAT translation
All tables in a table space were copied to the destination table space. The source and
destination table DDL must be identical, although the OBID can be converted to the
destination OBID via the XLAT translation. RI constraints are not imposed. DSN1COPY of
image copy from SHRLEVEL CHANGE is not encouraged, although it can be done with
care.
Independent Software Vendor (ISV) products
These tools are commonly used at many DB2 sites to transfer data mainly from production
to development for development or stress testing. The ISV products also provide the
capabilities to unload data from the image copy and to load into the destination using
either ISV LOAD utility or the DB2 LOAD utility.
Programs written in-house.
These mainly use imbedded SQL in the host language to SELECT from source tables and
INSERT into destination tables.
Enhanced functionality in V7
Figure 8-1 summarizes all the functions of the UNLOAD utility. The diagram emphasizes that
REORG UNLOAD EXTERNAL utility is a subset of the UNLOAD utility.
In addition to the functions provided by REORG UNLOAD external, the UNLOAD utility
provides:
Unload from image copy data sets created by COPY utility, both full and incremental
Unload from Inline COPY data sets created by REORG and LOAD utility
Unload from data sets created by stand-alone DSN1COPY utility
Encoding scheme, ASCII and UNICODE
SHRLEVEL REFERENCE and SHRLEVEL CHANGE
Sampling, limiting of rows
Partition parallelism
In this chapter we discuss each function in detail and show examples of JCL and job outputs.
Note: The table space of the image copy MUST exist in the host DB2 subsystem for
unloading from copy data sets. Copies of dropped table spaces are not supported.
The syntax for the UNLOAD utility from image copy is shown in Figure 8-2. Here are some
minor considerations when using the FROMCOPY option:
Only a single copy data set name can be specified using the FROMCOPY option. Use
FROMCOPYDDN if multiple image copies are concatenated to a DD statement.
The table space must exist when UNLOAD is run.
The table space should not have been dropped and recreated since the image copy was
taken.
194 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
If the table was altered with ALTER ADD COLUMN after the image copy was taken,
UNLOAD will set system or user defined defaults for the added column when the data is
unloaded from such an image copy.
Image copy created by Inline COPY operation (LOAD or REORG TABLESPACE) can
contain duplicate pages. If duplicate pages exist, the UNLOAD utility issues a warning
message, and all the qualified rows in the duplicate pages will be unloaded into the output
data set.
If incremental copy is specified as the source, only the rows in the data set are unloaded.
Tip: TEMPLATE can be used for the SYSPUNCH and SYSREC data set definitions
identified as PUNCHDDN and UNLDDN options in the UNLOAD syntax respectively.
LISTDEF cannot be used to pass a list to the LIST option to specify image copy data sets.
source-spec:
TABLESPACE db-name.ts-name
ts-name PART integer
int1:int2
FROMCOPY data-set-name
FROMVOLUME CATALOG
vol-ser
FROMCOPYDDN dd-name
unload-spec:
PUNCHDDN SYSPUNCH UNLDDN SYSREC
The contents of the data set associated with PUNCHDDN (SYSPUNCH) is displayed in
Example 8-2. If the space allocation is not specified in the TEMPLATE statement (refer back
to Example 8-1), then DB2 calculates the space requirements using the formulas:
SYSPUNCH = ((#tables * 10) + (#cols * 2)) * 80 bytes
SYSREC = ((high used RBA) + (#records * (12 + length of longest clustering key)) bytes
196 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
A further filter on the volume of unloaded data can be achieved with the SAMPLE option.
SAMPLE indicates the percentage of data to be unloaded. If the WHEN clause is used to
unload selective rows, then SAMPLE is applied only on rows qualified by the WHEN selection
condition.
.
Note: The sampling is applied per individual table. If the rows from multiple tables are
unloaded with sampling enabled, the referential integrity between the tables might be lost
If the image copy is an incremental copy, or a copy of a partition or partitions, then the
compression dictionary pages must exist in the same data set, otherwise UNLOAD will fail
and DB2 will issue an error message.
Not supported:
Separate output data sets per partition
Concurrent copies
Copies of dropped tables
Copy data sets of multiple table spaces
Unload of LOB columns
198 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Unloading from table spaces
A list of table spaces
LISTDEF Granularity
When unloading multiple table spaces with a LISTDEF, you must also define a data set
TEMPLATE that corresponds to all the table spaces and specify the template-name in the
UNLDDN option.
Restrictions
Index spaces, LOB table spaces and directory objects must not be included in the LISTDEF
definition. The FROM TABLE-spec of UNLOAD is not supported with the LIST option.
Note: Unloading from a list of table spaces is fully supported by REORG UNLOAD
EXTERNAL
UNLOAD does not activate parallel unloads if only a single output data set is allocated to
each table space even though the PART or PARTLEVEL option is coded in the UNLOAD
utility statement or LISTDEF command respectively. TEMPLATE can be used to dynamically
allocate an output data set per partition by using the &PA key word.
Example 8-5 Sample UNLOAD job for partition table space and parallelism
TEMPLATE ULDDDN
DSN(DB2V710G.RAMA.&TS..P&PA..UNLOAD)
UNIT(SYSDA) CYL DISP(NEW,CATLG,DELETE)
VOLUMES(SBOX57,SBOX60)
TEMPLATE PNHDDN
DSN(DB2V710G.RAMA.&TS..PUNCH)
UNIT(SYSDA) CYL DISP(NEW,CATLG,DELETE)
VOLUMES(SBOX57,SBOX60)
LISTDEF UNLIST
INCLUDE TABLESPACE U7G01T11.TS* PARTLEVEL
UNLOAD LIST UNLIST
PUNCHDDN PNHDDN UNLDDN ULDDDN
In Example 8-5 we unloaded a list of table spaces using the LISTDEF Wildcard specification
and PARTLEVEL. We also allocated the output data sets using TEMPLATE with &PA in the
DSN definitions. It can be seen in the output of Example 8-6 that DB2 has used two parallel
tasks to unload the data by partition. UNLOAD has also created three data sets via
TEMPLATE definitions as output data sets, one for each partition. If the &PA key word was not
allocated to the DSN of TEMPLATE ULDDDN, then DB2 would have allocated only a single
output data set, and the unload of data from partitions would be done in sequence.
200 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Example 8-6 Sample UNLOAD output by partition and parallelism
DSNU000I DSNUGUTC - OUTPUT START FOR UTILITY, UTILID = UNLOAD
DSNU050I DSNUGUTC - TEMPLATE ULDDDN DSN(DB2V710G.RAMA.&TS..P&PA..UNLOAD) UNIT(SYSDA)
CYL DISP(NEW,CATLG,DELETE)
VOLUMES(SBOX57,SBOX60)
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - TEMPLATE PNHDDN DSN(DB2V710G.RAMA.&TS..PUNCH) UNIT(SYSDA) CYL
DISP(NEW,CATLG,DELETE)
VOLUMES(SBOX57,SBOX60)
DSNU1035I DSNUJTDR - TEMPLATE STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - LISTDEF UNLIST INCLUDE TABLESPACE U7G01T11.TS* PARTLEVEL
DSNU1035I DSNUILDR - LISTDEF STATEMENT PROCESSED SUCCESSFULLY
DSNU050I DSNUGUTC - UNLOAD LIST UNLIST PUNCHDDN PNHDDN UNLDDN ULDDDN
DSNU1039I DSNUGULM - PROCESSING LIST ITEM: TABLESPACE U7G01T11.TSPSUPP1 PARTITION 1
DSNU1039I DSNUGULM - PROCESSING LIST ITEM: TABLESPACE U7G01T11.TSPSUPP1 PARTITION 2
DSNU1039I DSNUGULM - PROCESSING LIST ITEM: TABLESPACE U7G01T11.TSPSUPP1 PARTITION 3
DSNU1201I DSNUUNLD - PARTITIONS WILL BE UNLOADED IN PARALLEL, NUMBER OF TASKS = 2
DSNU397I DSNUUNLD - NUMBER OF TASKS CONSTRAINED BY CPUS
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=PNHDDN
DDNAME=SYS00003
DSN=DB2V710G.RAMA.TSPSUPP1.PUNCH
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=ULDDDN
DDNAME=SYS00004
DSN=DB2V710G.RAMA.TSPSUPP1.P00001.UNLOAD
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=ULDDDN
DDNAME=SYS00005
DSN=DB2V710G.RAMA.TSPSUPP1.P00002.UNLOAD
DSNU251I DSNUULQB - UNLOAD PHASE STATISTICS - NUMBER OF RECORDS UNLOADED=108000 FOR
TABLESPACE U7G01T11.TSPSUPP1
PART 1
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=ULDDDN
DDNAME=SYS00006
DSN=DB2V710G.RAMA.TSPSUPP1.P00003.UNLOAD
DSNU251I DSNUULQB - UNLOAD PHASE STATISTICS - NUMBER OF RECORDS UNLOADED=108000 FOR
TABLESPACE U7G01T11.TSPSUPP1
PART 2
DSNU251I DSNUULQB - UNLOAD PHASE STATISTICS - NUMBER OF RECORDS UNLOADED=446421 FOR
TABLESPACE U7G01T11.TSPSUPP1
PART 3
DSNU253I DSNUUNLD - UNLOAD PHASE STATISTICS - NUMBER OF RECORDS UNLOADED=662421 FOR
TABLE PAOLOR4.PARTSUPP1
DSNU252I DSNUUNLD - UNLOAD PHASE STATISTICS - NUMBER OF RECORDS UNLOADED=662421 FOR
TABLESPACE U7G01T11.TSPSUPP1
DSNU250I DSNUUNLD - UNLOAD PHASE COMPLETE, ELAPSED TIME=00:00:18
SHRLEVEL CHANGE
SHRLEVEL CHANGE allows users to update the tables in the table space while data is being
unloaded. When data is fetched from the table space with ISOLATION CS, the UNLOAD
utility assumes CURRENTDATA(NO). This ensures that uncommitted data is not unloaded
and retains data currency. Data can also be unloaded with ISOLATION UR, where any
uncommitted data will be unloaded. No locks are taken on the objects; this allows other DB2
operations to continue on the objects from which the data is being unloaded.
SHRLEVEL REFERENCE
This operation allows users to read the data during the unload. All writers are drained from
the table space before commencement of the unload. When data is unloaded from multiple
partitions, the drain lock will be obtained for all of the selected partitions in the UTILINIT
phase.
8.4.4 Unload from table space using the FROM TABLE option
When data is unloaded from a single table space or partitioned table space, the FROM
TABLE can be used to selectively unload data for a particular table from the table space. This
is particularly useful when a table space contains multiple tables and the user is interested in
one or more tables only. In Example 8-7 we unload three of the fourteen tables defined in
SYSDBASE table space. Only a single set of SYSPUNCH and SYSREC data sets are
created by the UNLOAD. The first two bytes of each record in the output data set identifies the
OBID of the table. The LOAD utility uses the WHEN option to identify the output data related
to each table. Please refer back to Example 8-2 on page 196 for sample contents of the
SYSPUNCH data set and the WHEN option of the LOAD utility.
In Example 8-7 we also show the usage of the WHEN option with selection criteria.
Each WHEN option applies to the FROM TABLE option only. Here we unload rows from
SYSTABLEPART, SYSTABLES and SYSTABLESPACE where the WHEN option explicitly
qualifies the data selection criteria for each table respectively.
202 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Note: If the rows from multiple tables are unloaded with sampling enabled, the referential
integrity between the tables may be lost.
When partition parallelism is activated, the LIMIT option is applied to each partition instead of
the entire table.
Note: If multiple tables are unloaded from with the LIMIT option, the referential integrity
between the tables may be lost.
Example 8-7 Unload selective tables from SYSDBASE using FROM TABLE
//STEP1 EXEC DSNUPROC,SYSTEM=DB2G,UID=UNLOAD,
// UTSTATS=''
//SYSIN DD *
TEMPLATE PNHDDN
DSN(DB2V710G.RAMA.&TS..PUNCH)
UNIT(SYSDA) CYL DISP(NEW,CATLG,DELETE)
VOLUMES(SBOX57,SBOX60)
TEMPLATE ULDDDN
DSN(DB2V710G.RAMA.&TS..UNLOAD)
UNIT(SYSDA) CYL DISP(NEW,CATLG,DELETE)
VOLUMES(SBOX57,SBOX60)
UNLOAD TABLESPACE DSNDB06.SYSDBASE
PUNCHDDN PNHDDN UNLDDN ULDDDN
SHRLEVEL CHANGE ISOLATION CS
FROM TABLE SYSIBM.SYSTABLEPART SAMPLE 50.0
(PARTITION,
TSNAME,
DBNAME,
IXNAME VARCHAR 10 STRIP BOTH TRUNCATE) WHEN
(PARTITION=0)
FROM TABLE SYSIBM.SYSTABLES LIMIT 100 WHEN
(CREATOR='SYSIBM')
FROM TABLE SYSIBM.SYSTABLESPACE WHEN
(DBNAME='U7G01T11')
The job outputs two dataset that contain the SYSPUNCH and SYSREC information.
DB2V710G.RAMA.SYSDBASE.PUNCH
DB2V710G.RAMA.SYSDBASE.UNLOAD
If the length parameter is omitted, the default is the smaller of 255 and the maximum length
defined on the source table column.
For a complete list and description of column functions on data types, please refer to Chapter
29 of DB2 UDB for OS/390 and z/OS V7 Utility Guide and Reference, SC26-9945-00.
Tables can also be created with the European English character set of 1140 (Euro support) by
changing the CCSID EBCDIC to CCSID 1140.
Example 8-8 Sample database, table space, table, and subset of DSNHDECP
CREATE DATABASE DSN8D71A
STOGROUP DSN8G710
BUFFERPOOL BP0
CCSID EBCDIC;
CREATE TABLESPACE DSN8S71E
IN DSN8D71A
USING STOGROUP DSN8G710
PRIQTY 20
SECQTY 20
ERASE NO
NUMPARTS 4
(PART 1 USING STOGROUP DSN8G710
PRIQTY 12
SECQTY 12,
PART 3 USING STOGROUP DSN8G710
PRIQTY 12
SECQTY 12)
LOCKSIZE PAGE LOCKMAX SYSTEM
BUFFERPOOL BP0
CLOSE NO
COMPRESS YES
CCSID EBCDIC;
CREATE TABLE DSN8710.DEPT
(DEPTNO CHAR(3) NOT NULL,
DEPTNAME VARCHAR(36) NOT NULL,
MGRNO CHAR(6) ,
ADMRDEPT CHAR(3) NOT NULL,
LOCATION CHAR(16) ,
PRIMARY KEY(DEPTNO))
IN DSN8D71A.DSN8S71D
CCSID EBCDIC;
COMMIT;
204 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
DSNHDECP sample definition
DSNHDECM CHARSET=ALPHANUM, X
ASCCSID=1252, X
AMCCSID=65534, X
AGCCSID=65534, X
SCCSID=37, X
MCCSID=65534, X
GCCSID=65534, X
USCCSID=367, X
UMCCSID=1208, X
UGCCSID=1200, X
ENSCHEME=EBCDIC, X
APPENSCH=EBCDIC, X
All character type data can be converted from the host internal code, predominantly from
EBCDIC to other data types, such as ASCII and UNICODE on the S/390 DB2 databases. The
UNLOAD statement in Example 8-9 converts all the character fields on table REGION into
ASCII.
The alternate way to convert to ASCII is to use the CCSID option in the UNLOAD, as follows:
UNLOAD TABLESPACE U7G01T11.TSREGION PUNCHDDN PNHDDN UNLDDN ULDDDN
CCSID(01252,00000,00000)
In Example 8-10, table PART in table space TSPART was created with CCSID 1140, Euro
English code page. The table space TSPART was unloaded and CCSID was converted to
UNICODE. UNLOAD converted all character data from CCSID 1140 to 367.
Example 8-10 UNLOAD with character conversion to UNICODE
//STEP1 EXEC DSNUPROC,SYSTEM=DB2G,UID=UNLOAD,
// UTSTATS=''
//SYSIN DD *
TEMPLATE PNHDDN
DSN(DB2V710G.RAMA.&TS..PUNCH)
UNIT(SYSDA) CYL DISP(NEW,CATLG,DELETE)
VOLUMES(SBOX57,SBOX60)
TEMPLATE ULDDDN
DSN(DB2V710G.RAMA.&TS..UNLOAD)
UNIT(SYSDA) CYL DISP(NEW,CATLG,DELETE)
VOLUMES(SBOX57,SBOX60)
UNLOAD TABLESPACE U7G01T11.TSPART
PUNCHDDN PNHDDN UNLDDN ULDDDN UNICODE
Restrictions
UNLOAD is not supported for table spaces in DSNDB01.
The FROM TABLE option cannot be used when the LIST option is already specified.
The following table spaces and image copy source objects are not supported:
– Auxiliary (LOB) table spaces
– Index spaces
– Dropped tables
– Views
– Global temporary tables
Performance measurements
We performed the measurements in an uncontrolled environment, using the LINEITEM table
that has 1951548 rows and 33832 active pages. The table has 16 columns with a mixture of
INTEGER, CHAR, VARCHAR, DATE and FLOAD data types. We made a full copy of the table
space with SHRLEVEL REFERENCE. We obtained the measurements for the three jobs:
UNLOAD TABLESPACE U7G01T11.TSLINEI.....
UNLOAD TABLESPACE U7G01T11.TSLINEI FROMCOPY image copy
REORG TABLSPACE U7G01T11.TSLINEI UNLOAD EXTERNAL.....
The top half of Table 8-2 has both the elapsed time and CPU times in seconds. The second
half of the table contains the percentage differences when compared to the measurements
taken for first job.
There is a 36.9% increase in elapsed time when the unload is done from an image copy. Most
of this elapse time can be attributed to the I/O from the input sequential data where the I/O
buffer size is limited by the number of buffers (DCB BUFNO) allocated to the data set. In this
job there were 5659 read I/Os to the data set as compared to 533 prefetches for the other two
jobs. The elapsed time of REORG UNLOAD EXTERNAL is similar in value to UNLOAD from
table space.
206 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Table 8-2 Performance measurements and comparison
UNLOAD TABLESPACE REORG UNLOAD EXTERNAL
Elapsed (sec) CPU (sec) Elapsed (sec) CPU (sec) Elapsed (sec) CPU (sec)
The percentage changes (delta %) are calculated with respective to measurement made with
UNLOAD Utility from active table space.
Delta % Delta %
(B-A)*100/A (C-A)*100/A
The CPU measurements show an increase of 23% when data is unloaded using the REORG
utility. There is an insignificant amount of CPU time differences between unload from table
space and unload from image copy using the UNLOAD utility. On the other hand, there is a
37% increase in elapsed time for unload from image copy.
UNLOAD from image copies only reads the catalog and does not access the table space. We
recommend unload from image copy if elapsed time is not a concern and high availability is
more important to your environment.
Terminating UNLOAD
The UNLOAD utility can be terminated with TERM UTILITY command during the UNLOAD
phase. The output records in SYSREC are not erased, but the data set remains incomplete
until the data set is deleted or the UNLOAD utility is restarted.
Restarting UNLOAD
When restarting a terminated UNLOAD utility, the following occurs:
Processing starts with the table spaces or partitions that had not yet been completed.
For a table space or partitions that were being processed at termination, UNLOAD resets
the output data sets and processes those table spaces or partitions again.
When the source is one or more image copy data sets (FROMCOPY or
FROMCOPYDDN), UNLOAD always starts processing from the beginning.
Note: Verify that the PTF for APAR PQ50223 is applied to avoid a problem identified by the
message:
DSNU1036I DSNURELD - UNABLE TO ESTIMATE SPACE REQUIREMENTS FOR INDDN/UNLDDN
This message is generated when a partitioned table space with PARTLEVEL in LISTDEF
is unloaded with the UNLOAD utility and loaded with the LOAD utility.
The DDL for the shadow database DSHADOW and its objects was generated using the DB2
Administration Tool via the GEN command on database DSNDB06. The output was edited to
define the table spaces with STOGROUP SYSDEFLT, and the tables with the LIKE SQL
syntax. A copy of the DDL is displayed in Example C-1 on page 289.
The UNLOAD utility was used to unload data from all table spaces in DSNDB06 except
SYSJAUXA and SYSJAUXB, which are related to LOB auxiliary tables.
The SYSPUNCH data sets were edited to change all SYSIBM to SHADOW, the new table
owner in database DSHADOW.
The LOAD statement was modified to change RESUME YES to RESUME NO REPLACE.
The edited SYSPUNCH data sets may be saved for future loads, since the catalog table
definitions do not change within a single DB2 SM level unless by an APAR. Example C-3 on
page 291 contains a sample of the SYSVIEWS load statements. Please note that the
TEMPLATE command is used to define the input data set to LOAD utility. Thus a consistent
naming standard which complies with the TEMPLATE DSN definition for the output data set
from the UNLOAD utility will be beneficial.
The source for these jobs is available for download as described in Appendix E., “Additional
material” on page 297.
208 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
9
The RECOVER utility supports the list generated by the previous LISTDEF command.
Example 9-1 shows a sample job with the OPTIONS PREVIEW command that lists the table
spaces which will participate in the RECOVER utility.
Example 9-1 RECOVER using LISTDEF command with Wildcard
//STEP1 EXEC DSNUPROC,SYSTEM=DB2G,UID=PAOLOR1,
// UTSTATS=''
//SYSIN DD *
OPTIONS PREVIEW
LISTDEF DB01A
INCLUDE TABLESPACE U7G01T11.*
INCLUDE TABLESPACE U7G01T11.TSLINEI PARTLEVEL
EXCLUDE TABLESPACE U7G01T11.TSLINEI
EXCLUDE TABLESPACE U7G01T11.TESTNPIS
EXCLUDE TABLESPACE U7G01T11.TSP*
RECOVER LIST DB01A PARALLEL 10
Example 9-2 is the associated output from the JCL in Example 9-1. Please note that the
LISTDEF expansion does not include the table spaces that are explicitly excluded in the
LISTDEF command.
210 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
INCLUDE TABLESPACE U7G01T11.TSLINEI PARTLEVEL(00001)
INCLUDE TABLESPACE U7G01T11.TSLINEI PARTLEVEL(00002)
INCLUDE TABLESPACE U7G01T11.TSLINEI PARTLEVEL(00003)
DSNU050I DSNUGUTC - RECOVER LIST DB01A PARALLEL 5
DSNU010I DSNUGBAC - UTILITY EXECUTION COMPLETE, HIGHEST RETURN CODE=0
The utility list manager will invoke RECOVER once for the entire list generated by the
LISTDEF. Therefore, LIST cannot be used with these commands:
PAGE
ERROR RANGE
TOCOPY
DSNUM. Instead use PARTLEVEL as in Example 9-1
0 or N
Not specified
>N N
<N num-objects
The job in Example 9-1 was rerun without the OPTIONS PREVIEW command and
PARALLEL 10. The output in Example 9-3 indicates that RECOVER processed seven objects
in parallel.
Note: If the number of parallel tasks are limited, the following message can be issued:
‘DSNU397I DSNUCBMT - NUMBER OF TASKS CONSTRAINED BY CONNECTIONS‘
Check the values of IDBACK and CTHREAD in ZPARM. Increase the value of IDBACK.
Restriction: This is only true when all the image copies reside on disk. If RECOVER
encounters a tape volume in the list, processing of the remaining objects pauses until the
tape object has completed, and then parallel processing resumes.
Parallelism is only achieved in the RESTORE phase. The Log Apply processing does not
begin until all objects have been restored.
212 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
DB2 may reduce the number of objects processed in parallel if there is a virtual storage
constraint in the utility batch address space.
DB2 disables parallelism if the PARALLEL keyword is omitted.
The keywords, PARALLEL and LOGONLY, are mutually exclusive for the RECOVER utility.
In order to benefit from parallelism, the RECOVER utility must be run with the PARALLEL
keyword and an object list must be provided. RECOVER supports two forms of object lists:
The object-list explicitly defined to RECOVER utility as in Example 9-4.
LISTDEF command to build the object-list and pass the list to RECOVER using the LIST
keyword, see 9.1, “RECOVER using the LISTDEF command” on page 210.
In both cases, RECOVER will dispatch the optimum number of subtasks to recover the
objects in parallel.
We recommend using the PARALLEL option with the value 0 or without a value, and let
RECOVER determine the optimum number of parallel subtasks.
Example 9-4 RECOVER with object-list
//STEP1 EXEC DSNUPROC,SYSTEM=DB2G,UID=PAOLOR1,
// UTSTATS=''
//SYSIN DD *
RECOVER
TABLESPACE U7G01T11.TSORDER
TABLESPACE U7G01T11.TSCUST
TABLESPACE U7G01T11.TSSUPLY
TABLESPACE U7G01T11.TSNATION
TABLESPACE U7G01T11.TSREGION
PARALLEL
9.3 LOGONLY
A DB2 object is recovered with LOGONLY option in the following instances:.
A backup of the object is restored outside DB2.
A Concurrent COPY of the object is restored outside of DB2’s control.
An image copy was never taken and RECOVER will use LOGONLY.
The external backup of the object must have been done when one or more of the following
conditions is true:
The DB2 subsystem is inactive.
The object is closed using the STOP DATABASE(db) SPACE(space-name).
The object was QUIESCEd just prior to the offline backup with WRITE YES.
Important: Please be aware of the instance node of the VSAM data set during the restore
(refer to 9.3.2, “DB2 object names with REORG and FASTSWITCH” on page 215). If the
DB2 object is reorganized with FASTSWITCH, the node may have changed, either I or J
as indicated by the IPREFIX column (see Table 9-2). If the object is backed up with
DFSMShsm, then the data set will be restored with the same name as the VSAM cluster at
the time of the backup. The DFSMShsm restored data set needs to be manually renamed
to the correct VSAM cluster name as indicated by the IPREFIX value.
214 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
9.3.2 DB2 object names with REORG and FASTSWITCH
Prior to V7, all physical data sets had the naming convention hlq.DSNDBC.db.ts.I0001.Annn,
where the I0001 indicates the node name. DB2 V7 introduced new columns in the catalog to
record the node name of the physical VSAM data set for table spaces and index spaces, as
listed in Table 9-2.
Table 9-2 DB2 V7 catalog table entries
DB2 table Column name Valid Values
SYSINDEXPART IPREFIX I or J
SYSTABLEPART IPREFIX I or J
If the DB2 object is reorganized using the REORG SHRLEVEL CHANGE or SHRLEVEL
REFERENCE with FASTSWITCH option, the catalog records the node name of the DB2
object in SYSTABLEPART or SYSINDEXPART. When the DB2 object is image copied using
the COPY utility and CONCURRENT keyword, the STYPE field in SYSCOPY is recorded with
the value C (I0001) or J (J0001).
The DB2 object can be recovered from Concurrent COPY in a similar manner to any recovery
with ICTYPE=F. The object can be recovered TOCOPY, TOLOGRBA, or current point-in-time.
DB2 will apply the logs after the object is restored from the copy and roll forward to the current
point-in-time.
RECOVER handles the VSAM data set node names automatically (see Table 9-2 on
page 215) if the object is reorganized with FASTSWITCH after the Concurrent COPY. During
the recovery, the TOCOPY utility redefines the VSAM cluster to the correct IPREFIX node
name.
The RECOVER keywords PAGE or ERRORRANGE can not be used with DFSMS
Concurrent COPY. If these keywords are specified, RECOVER will bypass any Concurrent
COPY records when searching the SYSCOPY table for recoverable point.
Note: Backing up a single piece of a multi-linear page set is not recommended. It can
cause a data integrity problem if the backup is used to restore only a single data set at a
later time.
In DB2 V7, the HPGRBRBA value is updated every nth checkpoint, where n is controlled by
the value of the DSNZPARM parameter DLDFREQ (down-level detection.)
216 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
9.5 Fast Log Apply
Fast Log Apply was introduced in DB2 V6 to improve the restart of DB2 subsystems after a
failure and the recovery of DB2 objects. It reduced both the elapsed time and the CPU time by
utilizing an intermediate buffer storage area which reduces the I/O to the table spaces.
The Fast Log Apply (FLA) design consists of one log read task per recovery job and one or
more Log Apply tasks employing the use of multi-tasking whenever possible. It also takes
advantage of a list prefetch technique to nearly eliminate duplicate synchronous read
operations for the same page.
It achieves these efficiencies by sorting groups of log records together that pertain to the
same page set before applying those changes; it then applies those records in parallel using
several Log Apply tasks.
Fast Log Apply is activated during the installation process by setting any non-zero value to
Log Apply Storage in the DSNTIPL panel; see Figure 9-2. This sets a value to the ZPARM
variable LOGAPSTG in the DSN6SYSP macro.
If the storage size specified is not available when the FLA feature attempts to allocate it, DB2
tries to allocate a smaller amount. It is possible that insufficient space is available for FLA to
continue; in this case, DB2 reverts to normal log recovery. A minimum of 64 KB is necessary
for the FLA feature.
FLA is a non-optional feature for DB2 restart even if you specify zero for the Fast Log Apply
storage. A minimum buffer size of 10 MB is allocated during the Forward Log Recovery phase
and is freed when this phase completes. The rationale behind this decision is that restart is
the only process running and there should be no problem obtaining 10 MB of storage for the
FLA feature at this time. Of course, if you choose a Log Apply storage value larger than 10
MB, DB2 uses that value during restart.
Each RECOVER utility job uses a size of 10 MB, if available, for the Fast Log Apply buffer. You
need to consider the number of concurrent RECOVER utility jobs when determining the size
of the FLA buffer. If you specify 30 MB for the total size of the Log Apply buffer, then only 3
concurrent jobs can use 10 MB each.
DB2 acquires FLA buffer storage at the start of the Log Apply phase and releases it at the end
of the Log Apply phase of the RECOVER utility. If there are four concurrent RECOVER utility
jobs running, the first three jobs get 10 MB each. If the fourth job arrives at the Log Apply
phase while the other three jobs are also in the Log Apply phase, then the fourth job is not be
able to use the FLA feature. However, recovery of the fourth job is not be delayed; its recovery
simply occurs without FLA.
If the fourth job arrives, at the Log Apply phase, at the same time that one of these three jobs
completes the Log Apply phase, then the fourth job may be able to get the 10 MB storage,
and can then also take advantage of FLA.
Each sorted FLA buffer is divided into several smaller groups organized by the object level
(DBID.PSID.PART).
One task is dispatched to apply those log records associated with each group. Each task
opens the data set, if necessary, builds and initiates the list prefetch. If the process requires
the use of multiple tasks, they are run in parallel.
218 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
A maximum of 98 parallel tasks can be dispatched, one for each page set. If log records for
more than 98 objects are found, then 98 tasks are scheduled. As a task completes, a new
task is dispatched to apply the log records to the next object.
In addition, FLA takes advantage of the ability to perform up to 20 data set opens in parallel
during restart.
This sets the COPY field in SYSIBM.SYSINDEXES to Y for YES. Another catalog field
COPYLRSN in the same table records the RBA or the LRSN (in data sharing) of the index
when it was altered to COPY YES or created with COPY YES. Please refer to section 10.1.2,
“Copying indexes” on page 232.
One of the main advantages of the image copy index is the ability to recover the index
independently from the table space or in parallel to the recovery of the table space. The latter
saves an extra step to rebuild the index after the table space is recovered. In Example 9-5 we
take an image copy of the table space U7G01T11.TSLINEI and all the indexes of this table
space. The table space has three partitioned indexes and two NPIs. COPY utility dispatched
six parallel tasks to copy the objects. The maximum number of tasks is constraint by virtual
storage and by available threads specified by CTHREAD and IDBACK in ZPARM.
An index can also be recovered individually either from image copy or by the REBUILD utility.
The decision to image copy and recover an index will depend on the size of the index space.
Refer to 10.1.2, “Copying indexes” on page 232.
Note: Indexes with COPY NO will not have any image copies. These indexes will not
participate in the recovery. Their status will be set to RBDP. These indexes must be rebuilt
using the REBUILD utility. Use OPTIONS EVENT(ITEMERROR,SKIP) to bypass indexes
with COPY NO when passing list from LISTDEF.
220 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
DSNU532I DSNUCBMT - RECOVER INDEXSPACE U7G01T11.PXL#OKSD DSNUM 1 START
DSNU515I DSNUCBAL - THE IMAGE COPY DATA SET PAOLOR1.U7G01T11.PXL#OKSD.P00001.T213329L WITH DATE=20010625
AND TIME=173347
IS PARTICIPATING IN RECOVERY OF INDEXSPACE U7G01T11.PXL#OKSD DSNUM 1
DSNU532I DSNUCBMT - RECOVER INDEXSPACE U7G01T11.PXL#OKSD DSNUM 2 START
DSNU515I DSNUCBAL - THE IMAGE COPY DATA SET PAOLOR1.U7G01T11.PXL#OKSD.P00002.T213329L WITH DATE=20010625
AND TIME=173400
IS PARTICIPATING IN RECOVERY OF INDEXSPACE U7G01T11.PXL#OKSD DSNUM 2
DSNU504I DSNUCBRT - MERGE STATISTICS FOR INDEXSPACE U7G01T11.PXL#OKSD DSNUM 1 -
NUMBER OF COPIES=1
NUMBER OF PAGES MERGED=2318
ELAPSED TIME=00:00:08
DSNU532I DSNUCBMT - RECOVER INDEXSPACE U7G01T11.PXL#OKSD DSNUM 3 START
DSNU515I DSNUCBAL - THE IMAGE COPY DATA SET PAOLOR1.U7G01T11.PXL#OKSD.P00003.T213329L WITH DATE=20010625
AND TIME=173425
IS PARTICIPATING IN RECOVERY OF INDEXSPACE U7G01T11.PXL#OKSD DSNUM 3
DSNU504I DSNUCBRT - MERGE STATISTICS FOR INDEXSPACE U7G01T11.PXL#OKSD DSNUM 2 -
NUMBER OF COPIES=1
NUMBER OF PAGES MERGED=2308
ELAPSED TIME=00:00:07
DSNU504I DSNUCBRT - MERGE STATISTICS FOR TABLESPACE U7G01T11.TSLINEI DSNUM 1 -
NUMBER OF COPIES=1
NUMBER OF PAGES MERGED=13595
ELAPSED TIME=00:00:24
DSNU504I DSNUCBRT - MERGE STATISTICS FOR INDEXSPACE U7G01T11.TESTNPI2 -
NUMBER OF COPIES=1
NUMBER OF PAGES MERGED=9751
ELAPSED TIME=00:00:27
DSNU504I DSNUCBRT - MERGE STATISTICS FOR INDEXSPACE U7G01T11.TESTNPI -
NUMBER OF COPIES=1
NUMBER OF PAGES MERGED=10128
ELAPSED TIME=00:00:28
DSNU504I DSNUCBRT - MERGE STATISTICS FOR TABLESPACE U7G01T11.TSLINEI DSNUM 2 -
NUMBER OF COPIES=1
NUMBER OF PAGES MERGED=13831
ELAPSED TIME=00:00:35
DSNU504I DSNUCBRT - MERGE STATISTICS FOR INDEXSPACE U7G01T11.PXL#OKSD DSNUM 3 -
NUMBER OF COPIES=1
NUMBER OF PAGES MERGED=17912
ELAPSED TIME=00:00:29
DSNU504I DSNUCBRT - MERGE STATISTICS FOR TABLESPACE U7G01T11.TSLINEI DSNUM 3 -
NUMBER OF COPIES=1
NUMBER OF PAGES MERGED=107667
ELAPSED TIME=00:01:25
DSNU500I DSNUCBDR - RECOVERY COMPLETE, ELAPSED TIME=00:01:28
DSNU010I DSNUGBAC - UTILITY EXECUTION COMPLETE, HIGHEST RETURN CODE=0
Note: Apply the PTF for APAR PQ45184 to benefit from this enhancement.
The improvements are over 90% in both CPU and elapsed time.
TABLESPACESET
The TABLESPACESET option can be used to find names of all table spaces and tables in a
referential structure, including LOB table spaces.
222 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Example 9-9 is the associated output of the sample REPORT RECOVERY job. The output
shows that a full image copy exist for the table space DSN8S71D at START LRSN
00010DEA81F9. The associated entry in SYSLGRNX shows that the table space DSN8S71D
was open at UCDATE and UCTIME and at START LRSN 00010DF08B11. The
000000000000 value for STOP LRSN indicates that the table space is still open.
The SYSCOPY and SYSLGRNX information can be used to determine the recovery details of
the table space. It can also be used to decide if an image copy is required. If the is an entry
START LRSN in SYSLGRNX and its value is higher than the START LRSN of the last image
copy entry in SYSCOPY then the table space is open for update. This table space is a
candidate for image copy using the COPY utility.
One advantage of using REPORT RECOVERY to determine image copy status is that the
base table space is not accessed. All the required information is obtained form SYSCOPY
and SYSLGRNX. Therefore, if the base table space is migrated by DFSMShsm, it will not be
recalled.
DSNU584I -DB2G DSNUPPBS - REPORT RECOVERY TABLESPACE DSN8D71A.DSN8S71D ARCHLOG1 BSDS VOLUMES
DSNU588I -DB2G DSNUPPBS - NO DATA TO BE REPORTED
You need to specify the CURRENT option to avoid unnecessary access of archive logs. If
archive logs are on tape, then there will be unnecessary mounting of archive tapes. If the
CURRENT option is specified and the last recoverable point does not exist on the active log,
DB2 will access archive logs until the point is found.
The space you want to make a copy of must have a complete, valid full image copy registered
in the SYSCOPY catalog table. The DB2 Change Accumulation Tool takes the full image
copy, applies any incremental image copies, and then applies log data up to the specified
recovery point. The resulting file is logged in SYSIBM.SYSCOPY as a primary local or
backup full SHRLEVEL REFRENCE image copy that can be used to recover the object.
This process is similar to MERGECOPY, except that the DB2 Change Accumulation Tool
allows you to select specific points up to which you want the image copy created. For
instance, you can select the TOCURRENT recovery point, or a specific RBA from the log.
The DB2 Change Accumulation Tool also allows you to customize processing by specifying
parameters via control controls in the JCL; see Example 9-10. For example, you can specify
the recovery point or choose from two processing methods.
224 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
The DB2 Change Accumulation Tool provides DB2 administrators with powerful tool for
backing up data base objects in the most precise and lease destructive manner by:
Making precise ‘point-in-time’ recovery of data base objects simple and reliable
Allowing recovery routines to focus on single objects and previous states
Producing SHRLEVEL REFERENCE image copies without the associated overhead and
data locking
Controlling the scope and specificity of image copy creation precisely through control
cards
Maintaining data integrity without recovery to RBA
Reducing recovery session times significantly in many cases
Providing low overhead and minimizing downtimes for high-volume, complex data bases
with large number of tables and dependencies
For more details on the DB2 Change Accumulation Tool, please refer to DB2 Change
Accumulation Tool for z/OS V1R1 User's Guide, SC27-1191-00, or see the Web site:
http://www.ibm.com/software/data/db2imstools/details/#rr
9.10 QUIESCE
The QUIESCE is an online utility that establishes consistent recovery point (the current log
RBA or LRSN) for a table space, partition, table space set and index with COPY YES set.
QUESCE supports a list of table spaces as in Example 9-11 or building of a list using the
LISTDEF command as in Example 9-12.
A new TABLESPACESET option for QUIESCE was introduced in V6. This option specifies
that all of the referentially related table spaces in the table space set are to be quiesced. A
table space set is either:
Note: TABLESPACESET is not supported by LISTDEF. The LIST option cannot be used in
conjunction with TABLESPACE or TABLESPACESET.
SYSCOPY is updated with ICTYPE=Q for each table space quiesced. Other enhancements
and restrictions for the QUIESCE utility are:
Remove the restriction of 1,165 maximum table spaces in the list.
Duplicate table spaces or table space sets are quiesced only once.
For a table space being quiesced with WRITE YES, a SYSCOPY record with ICTYPE=Q
is also inserted for each of its indexes defined with COPY YES.
PART and TABLESPACE are mutually exclusive keywords.
An index or index space cannot be specified on a QUIESCE statement
The following Example 9-13 shows the quiesce of partition indexes of a partition table space
TSLINEI when only the table space is specified in the QUIESCE list.
9.11 CHECK
There are three online utilities, CHECK DATA, CHECK INDEX, CHECK LOB which are used
to check the consistency of table spaces, index spaces and LOB table spaces respectively.
226 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Point-in-time recovery on all table spaces whose indexes may not be consistent with the
data.
Before CHECK DATA on table space to ensure indexes used by CHECK DATA are valid.
Before CHECK DATA with DELETE YES
To verify the auxiliary table index of LOB table space
Example 9-14 is a sample job to check the indexes of table space TSLINEI in data base
U7G01T11. We used the TEMPLATE for work data set and LISTDEF to generate the list of
objects for CHECK INDEX. Example 9-15 shows the output of the CHECK INDEX job. The
LISTDEF command created a list of five indexes, three partition indexes and two NPI.
Example 9-14 Sample CHECK INDEX JCL
//STEP1 EXEC DSNUPROC,SYSTEM=DB2G,UID=CHECKIX
//SYSIN DD *
TEMPLATE WORKDDN
DSN(DB2V710G.RAMA.CHECK.INDEX)
UNIT(SYSDA) DISP(NEW,DELETE,CATLG)
VOLUMES(SBOX60,SBOX59)
LISTDEF DB01A
INCLUDE INDEXSPACES TABLESPACE U7G01T11.TSLINEI PARTLEVEL
CHECK INDEX
LIST DB01A WORKDDN WORKDDN
UTILINIT Initialization
SCANTAB Extract foreign keys; use foreign key index if it exist, else scan table
SORT Sort foreign keys if not extracted from foreign key index
CHECKDAT Look in primary indexes for foreign key parents, and issue messages to
report errors detected
REPORTCK Copy error rows into exception tables, and delete them from source table
if DELETE YES is specified
UTILTERM Cleanup
228 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Sample CHECK DATA jobs
Example 9-16 is a sample CHECK DATA job that checks two table spaces. The CHECK DATA
utility supports list of table spaces, but it does not support the LIST option. Therefore, lists
generated from the LISTDEF command cannot be input to the CHECK DATA utility. The utility
supports TEMPLATE commands for the definition of work and error data sets.
Example 9-16 Sample CHECK DATA JCL
//STEP1 EXEC DSNUPROC,SYSTEM=DB2G,UID=CHECKDA
//SYSIN DD *
TEMPLATE WORKDDN
DSN(DB2V710G.RAMA.CHECK.INDEX)
UNIT(SYSDA) DISP(NEW,DELETE,CATLG)
VOLUMES(SBOX60,SBOX59)
CHECK DATA
TABLESPACE U7G01T11.TSLINEI
TABLESPACE U7G01T11.TSPSUPP
WORKDDN WORKDDN
Note: The table space to be checked needs to be in CHECK pending (CHKP), else the
utility will complete with RC=4 and message:
DSNU727I -DB2G DSNUKINP - TABLESPACE 'U7G01T11.TSLINEI' IS NOT CHECK PENDING
If no errors are found, CHECK LOB utility resets the CHKP and auxiliary warning (AUXW)
statuses.
UTILINIT Initialization
UTILTERM Cleanup
230 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
10
We then discuss the new COPYTOCOPY utility, which allows you to make asynchronous
extra image copies, and we briefly examine the Concurrent COPY option.
Previously, after the execution of one of these utilities, assuming LOG NO is specified for
speed, an image copy had to be taken to ensure the integrity of the data. The time taken to
run the COPY added to the outage for the object. This new functionality addressed this issue.
When a LOAD REPLACE/LOAD RESUME NO is executed with an Inline COPY, the copy is
taken during the LOAD phase. This means that if there are any rows on the table that are later
discarded, these will still remain on the image copy. For this reason it is recommended NOT to
use inline image copies in a RECOVERY TOCOPY operation. For more information; see
“Using inline copies with the RECOVER utility” on page 43.
When an Inline COPY is taken by the REORG SHRLEVEL CHANGE utility, the contents of
the image copy data set is different from the copy taken by LOAD. Online REORG takes an
initial image copy during the RELOAD phase, then while it is applying changes from the log, it
takes an incremental image copy and appends it onto the full image copy. This process is
repeated during each log iteration as many times as needed to complete the online REORG.
The end result is an image copy that has incremental copies added to the end of it, and
therefore will contain duplicate pages.
This is not the same as an image copy produced using the MERGECOPY utility, which does
not contain duplicates. The recover utility has been modified to handle these image copies, as
it recognizes that the last version of the page is the valid one. The UNLOAD utility does not
recognize these extra pages as duplicates and unloads all pages. This may lead to duplicate
rows in the output data set; for more information; see 8.3, “UNLOAD from image copy” on
page 194. DSN1COPY and DSN1PRNT will produce error messages when inline copies are
used with them if the new keyword INLCOPY is not specified.
The use of Inline COPY is also discussed in 3.1.1, “Inline COPY” on page 40.
The Version 5 RECOVER INDEX utility was renamed to REBUILD INDEX to more accurate
reflect the purpose of the utility, that is, to rebuild the index from the data in the table. The
REBUILD INDEX utility is dependent upon the contents of the table space and therefore
indexes cannot be built in parallel while the table space is being recovered.
232 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
DB2 Version 6 addressed this issue by introducing the ability to take full image copy indexes.
These copies can then be used by the RECOVER INDEX utility. The RECOVER utility
recovers an index in the same manner as a table space, that is, restoring an image copy and
then applying the log. This method allows for faster recovering of an index, assuming timely
image copies are taken, when compared to rebuilding the index, and also it allows for the
recovery of table spaces and indexes in parallel, thus reducing the outage required.
To allow indexes to be copied, a new option is available on the CREATE/ALTER index DDL
statements. COPY YES can be specified to indicate that the index can be copied by the
COPY utility.
Altering the index to COPY NO prohibits and disables the above and also removes the
recovery information from SYSCOPY and SYSLGRNX for the index.
The first new index state, ICOPY (informational copy pending) indicates that a copy of the
index should be taken. This is set when an unrecoverable from the log event has occurred on
the index or the underlying table space, this means that one of the following has happened:
REBUILD INDEX
REORG INDEX
REORG TABLESPACE (LOG YES or LOG NO)
LOAD TABLE (LOG YES or LOG NO)
After the ICOPY state has been set, the index must be copied if the RECOVER utility is to be
used against the index. If the copy is not taken, then in the event of a recovery a REBUILD
INDEX must be used to build the index from the underlying table. The ICOPY state does NOT
prevent any processing against the table or the index. This state is also set on initial creation
of the index with COPY YES and if the index is altered to be COPY YES.
The second new index state is CHKP (check pending). It indicates that the index might be out
of step with the underlying table and that the CHECK INDEX utility should be run. This state
can be set by an index point-in-time recovery. During the running of the check the index is
unavailable for any read or write activity.
The catalog SYSCOPY has a new column added, OTYPE, which has the values for ‘T’ for
table space and ‘I’ for index. The index name is recorded under the column TSNAME and
there is a new value of ‘B’ for ICTYPE which indicates REBUILD INDEX
Recommendations
The following are recommendations for the image copying of indexes:
Use image copy on large indexes with low update activity that require high availability.
Copy indexes in conjunction with the underlying table space.
Recover table spaces and their indexes to the same point-in-time to avoid CHKP being set
234 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
10.1.3 Lists
DB2 Version 6 introduced the ability to define lists to a COPY utility, that enabled a group of
objects to be copied by the one COPY utility. This is different from the LISTDEF command,
described in 2.2, “Wildcards” on page 18, in that the objects have to be explicitly coded in the
COPY statement.
The utility processes the objects in the order that they are specified in the list. An example is
shown in Example 10-1
Example 10-1 Specifying lists in the COPY utility
COPY
TABLESPACE DB1.TS1 COPYDDN (IC1,IC2) RECOVERYDDN(IC3,IC4)
INDEX SYSADM1.IND1 COPYDDN(IC5,IC6) RECOVERYDDN(IC7,IC8)
The phases of COPY can now switch between REPORT and COPY because each table
space in the list with a CHANGELIMIT specification; see 10.1.5, “The CHANGELIMIT option”
on page 238, will have a REPORT phase.
If SHRLEVEL REFERENCE is specified, then DB2 drains the write claim class on ALL
table/index spaces during the UTILINIT phase, and holds the claim until the last object in the
list has been copied. The SYSCOPY row for all the objects in the list are written at the same
time when the last object has completed, and the START_RBA is the same for all rows, that
being the RBA/LRSN after the last object has completed.
With SHRLEVEL CHANGE, DB2 establishes the read class claim on each object individually
and releases the claim when it completes the object; it does not wait until every object has
been completed. The SYSCOPY row for each object is inserted when the individual copy has
completed and contains the START_RBA pertinent to the RBA/LRSN at the start of the
processing for that object.
When processing a list, if COPY encounters an error with one of the objects it continues
processing the next object in the list after issuing an error message. The return code at the
end of the COPY utility will reflect the highest error set by the utility. To determine which
objects caused the return code the utility output will have to be used.
With DB2 Version 7, the new functions of LISTDEF provide a dynamic method of building the
LISTs and removes much of the LIST maintenance when new objects are created. See
Chapter 2, “Wildcarding and Templates” on page 17 for details about LISTDEF.
The objective of the introduction of parallelism is to reduce the elapsed time of the COPY
utility when run against a list of objects. To use parallelism with the COPY, and RECOVER,
utility the new keyword PARALLEL n has to be specified; this tells DB2 the optimum number
of parallel threads that are to be processed. If the keyword is omitted, then parallelism is not
used. DB2 will decide the optimum number of parallel threads if the PARALLEL keyword is
specified without a value. DB2 will also reduce the specified number of parallel tasks if there
is a shortage of resources. If this is done, then message DSNU397I is issued, and the
message DSNU427I will also be issued to identify the number of parallel tasks being used.
The value n is the number of objects that should be processed in parallel, regardless of the
total number of objects in the list.
Parallelism is only supported for copies written to disk. If tape is used, then the parallelism is
stopped until the copy to tape is finished. For example, if there are 6 objects in a list to be
processed with PARALLEL(4) and the fourth object is written to tape, then objects 5 and 6 will
not start until object 4 is finished, even if the other 3 threads have completed.
Restart is only supported from the last commit point of each object processed in parallel.
C o p y P a ra lle l (3 )
O b je c t 1
O b je c t 2
O b je c t 3
O b je c t 4
O b je c t 5
P ip e P ip e
P ip e
W rite W r ite
Copy Copy Copy
W rite
S u b ta s k 1 S u b ta s k 2
D a ta s e ts D a ta s e ts D a ta s e ts S u b ta s k 3
Example 10-2 show the statements required to build a COPY list using the LISTDEF
functionality.
236 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
It should be noted that the partitioning index does not have the COPY YES parameter. It is
still included in the list and produces an error message stating that the copy parameter is not
set. In this case the OPTIONS EVENT(ITEMERROR,SKIP) bypasses the error and COPY
continues processing. The output from the job is shown in Example 10-3.
DSNU425I -DB2G DSNUBAII - INDEXSPACE U7G01T11.PXL#OKSD DOES NOT HAVE THE COPY YES ATTRIBUTE
DSNU1027I -DB2G DSNUBAII - PROCESSING CONTINUES DUE TO OPTIONS ITEMERROR SKIP
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=LCOPY
DDNAME=SYS00001
DSN=DB2V710G.U7G01T11.TSLINEI.T002257.P00001L
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=LCOPY
DDNAME=SYS00002
DSN=DB2V710G.U7G01T11.TSLINEI.T002257.P00002L
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=LCOPY
DDNAME=SYS00003
DSN=DB2V710G.U7G01T11.TSLINEI.T002257.P00003L
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=LCOPY
DDNAME=SYS00004
DSN=DB2V710G.U7G01T11.TESTNPI2.T002257.P00000L
DSNU400I DSNUBBID - COPY PROCESSED FOR INDEXSPACE U7G01T11.TESTNPI2
NUMBER OF PAGES=9751
AVERAGE PERCENT FREE SPACE PER PAGE = 0.00
PERCENT OF CHANGED PAGES = 0.00
ELAPSED TIME=00:00:22
DSNU1038I DSNUGDYN - DATASET ALLOCATED. TEMPLATE=LCOPY
DDNAME=SYS00005
DSN=DB2V710G.U7G01T11.TESTNPI.T002257.P00000L
DSNU400I DSNUBBID - COPY PROCESSED FOR TABLESPACE U7G01T11.TSLINEI DSNUM 1
NUMBER OF PAGES=13595
AVERAGE PERCENT FREE SPACE PER PAGE = 3.34
PERCENT OF CHANGED PAGES = 0.00
ELAPSED TIME=00:00:27
DSNU400I DSNUBBID - COPY PROCESSED FOR TABLESPACE U7G01T11.TSLINEI DSNUM 2
NUMBER OF PAGES=13831
AVERAGE PERCENT FREE SPACE PER PAGE = 3.35
PERCENT OF CHANGED PAGES = 0.00
ELAPSED TIME=00:00:28
DSNU400I DSNUBBID - COPY PROCESSED FOR INDEXSPACE U7G01T11.TESTNPI
NUMBER OF PAGES=10128
AVERAGE PERCENT FREE SPACE PER PAGE = 0.00
PERCENT OF CHANGED PAGES = 0.00
ELAPSED TIME=00:00:13
DSNU400I DSNUBBID - COPY PROCESSED FOR TABLESPACE U7G01T11.TSLINEI DSNUM 3
NUMBER OF PAGES=107667
AVERAGE PERCENT FREE SPACE PER PAGE = 3.36
PERCENT OF CHANGED PAGES = 0.00
ELAPSED TIME=00:01:22
DSNU428I DSNUBBID - DB2 IMAGE COPY SUCCESSFUL FOR TABLESPACE U7G01T11.TSLINEI DSNUM 1
DSNU428I DSNUBBID - DB2 IMAGE COPY SUCCESSFUL FOR TABLESPACE U7G01T11.TSLINEI DSNUM 2
DSNU428I DSNUBBID - DB2 IMAGE COPY SUCCESSFUL FOR TABLESPACE U7G01T11.TSLINEI DSNUM 3
DSNU428I DSNUBBID - DB2 IMAGE COPY SUCCESSFUL FOR INDEXSPACE U7G01T11.TESTNPI2
DSNU428I DSNUBBID - DB2 IMAGE COPY SUCCESSFUL FOR INDEXSPACE U7G01T11.TESTNPI
DSNU012I DSNUGBAC - UTILITY EXECUTION TERMINATED, HIGHEST RETURN CODE=8
The number of parallel tasks is set to 4 and the number of objects available for copying is 5
(ignoring the rejected partitioning index). The output from a -DISPLAY UTIL command, in
Example 10-4, shows that there were 4 pairs of subtasks running in parallel.
Also worth noting is that the number of objects in the list is 6 (consisting of 3 table space
partitions, 1 partitioning index, and 2 NPIs), and that the partitioning index is included and
later rejected by the COPY utility.
There are two modes to the CHANGELIMIT, a report-only mode and a run mode. If
REPORTONLY is specified, DB2 will only execute the REPORT phase of the utility and will
report on the following values:
The total number of pages. This is the number of pages in the table space and may
include preformatted pages. It is the number of pages that a full copy will copy.
Empty pages in a segmented table space.
Number of changed pages, that is, the number of pages an incremental copy would copy.
Percentage of changed pages.
Recommendation of the type of image copy required.
The report for table spaces will be displayed at the following levels:
Partition level, if the table space is partitioned.
Data set level, if the table space is not partitioned.
The entire table space.
The recommendation given by the REPORT phase is based on values provided by the
CHANGELIMIT parameter values. These values specify the range of changed pages
expressed as a percentage of total pages in the table space.
238 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
One or two values can be supplied by the CHANGELIMIT parameter:
One value:
– An incremental image copy is recommended if the percentage of changed pages is
between zero and the supplied value.
– A full image copy is recommended if the percentage of changed pages is greater than
the supplied value.
– No image copy is recommended if no pages have been changed.
Two values (v1,v2):
– An incremental is recommended if the percentage of changed pages is greater than v1
and less than v2.
– A full copy is recommended if the percentage of changed pages is greater than v2.
– No image copy is recommended if the percentage of changed pages is less than v1.
The default values are 1,10. Example 10-5 illustrates the utility stream input to check for
image copies using the range 0 to 25, while Example 10-6 shows the report from the COPY
command.
The report shows that the COPY takes an incremental copy, at DSNUM ALL level.
NOTE: Image copy data sets are always allocated by COPY, even if REPORTONLY is
specified. Care should be exercised when using GDGs to ensure that valid image copies
are not rolled out. This is the case whether specifying a DD statement or using Templates.
This enables the job stream to automate the processing of image copies, that is, a
REPORTONLY STEP is run and subsequent step process a full copy, or an incremental copy
dependent upon the return code. If using this method ensure that the SYSCOPY DDNAME in
the REPORTONLY step is set to DUMMY to ensure that extra data sets are not cataloged.
Now additional checking can be undertaken using the CHECKPAGE option of the COPY
utility, introduced in DB2 Version 6. This option is equivalent to the checking undertaken by
DSN1COPY with CHECK. By specifying this option, DB2 will validate each page of the table
space or index page as it is processed.
The benefit of using this option is that any errors in the pages are reported at the time that the
backup is taken and not when the image copy is required by the RECOVER utility. The risk of
there being any errors when taking the image is low, but the impact of finding out at recover
time is high.
N ew C H E C K P A G E o p tio n fo r
COPY
COPY
TA B L E S P A C E P A O L O R 8.C H C K P A G E
F U LL YE S
C o py che cks ev e ry in de x an d
C O P YD D N (C O P Y L 1 ) tab le sp ace d ata and sp ace
IN D E X S P A C E P A O L O R 8 .C H C K 1 V $ I
C O P YD D N (IX C P L 1) m a p pa g e
C H E C K P AG E
P A R A LL E L (2 ) p ag e 1 p ag e 2 p ag e 3 p age 4
SHR LEVEL R EFEREN CE
A B C D
rep air o r
tim e
im age copy im age copy co ntin gency re co ve r
check ed
reco ver
n ot checked co py re tu rn c o d e = 0
re turn code =0 return code =8
Figure 10-2 shows how CHECKPAGE can be used to validate images copies. Suppose a
defect is introduced by an undetected hardware error (these are rare events) — it is then
possible to create a defective image copy (time A). If this went undetected, the situation could
arise where all image copies were defective and were not valid for recovery. A periodic COPY
with the CHECKPAGE option would report the error (time B) which provides an opportunity to
rectify the error, either using REPAIR or recovery back to a valid image copy. After repairing
the error, a new valid image copy should then be taken.
240 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
If COPY identifies an invalid page, it will:
Identify the page as broken.
Issue message DSNU518I if the page is a data page.
Issue message DSNU441I if the page is a space map page.
Complete RC8.
Set the copy pending flag on (COPY or ICOPY).
Remedial action should be initiated immediately by using either RECOVER PAGE (page
numbers are given by DSNT518I and DSNU441I), RECOVER, or REPAIR.
The performance impact of using checking shows that the elapsed time is not impacted by
this option and that the CPU time for the option shows a rise of 5% for indexes and 15% for
table spaces, which is negligible for an I/O bound utility.
It is recommended that CHECKPAGE be incorporated into the normal image copy cycle to
ensure the integrity of image copies. Be aware that CHECKPAGE will leave any object in copy
pending, which may lead to a service outage, something that does not happen with the
DSN1COPY.
This enhancement meant that the ability to take incremental copies was more realistic.
Previously, the recommended limit for incremental was 10% of the pages being dirty; with this
change, that limit is raised to 50%. To identify the percentage of changed pages that exist in a
data set, use the CHANGELIMIT REPORTONLY option of COPY; see 10.1.5, “The
CHANGELIMIT option” on page 238.
You can take incremental image copies if the following conditions apply:
A full image copy of the table space exists
The COPY-pending status is not on for that table space
The last copy was taken without the CONCURRENT option
TRACKMOD YES is specified
When using incremental copy, one point to remember is that to improve recovery time,
MERGECOPY should be run to incorporate the incremental copies and the full copy into a
new full copy, this will improve the time taken by the RECOVER utility. When running
MERGECOPY, DB2 will request that all the incremental copies and the full copy be available
at the same time. This may cause a problem if the copies are on tape drives and there are
more copies than drives available. Therefore, MERGECOPY should be run frequently,
especially for copies taken to tape, but also for all media to speed up the recovery process.
IF SHRLEVEL REFERENCE is specified, applications can only read the read the data and
not update the data. This may be contrary to business requirements that demand higher
availability. To help meet this need, the use of SHRLEVEL CHANGE should be used; this will
allow full read and write access during the copy process.
Image copy data sets taken with SHRLEVEL CHANGE should NOT be used in a RECOVER
TOCOPY utility, as it may contain data that is not committed. These copies should only be
used in a point-in-time recovery to a previously defined point of consistency. Therefore, it is
important to establish a point of consistency to be used a recovery point, this could be a
QUIESCE RBA, an -ARCHIVE LOG MODE(QUIESCE) point, or the current time.
Some utilities, for example, REORG, demand that a SHRLEVEL REFERENCE copy be taken
after the utility has completed, which led to the outage from REORGs being extended.
REORG can now take inline image copies, which are SHRLEVEL REFERENCE copies,
during execution which eliminates the delay in availability; see 10.1.1, “Inline COPY with
LOAD and REORG” on page 232.
The advantage of image copy to disk is that the elapsed time of the copy is less, due to the
speed of the media and because DB2 can initiate parallelism when using lists with the copy;
see 10.1.4, “COPY parallelism” on page 235. The number of tape units may also restrict the
running of multiple copy jobs and parallelism. The preferred choice is therefore writing to disk.
The secondary, or recovery, image copies have traditionally been sent to tape, or any remote
device, while sending the primary image copy to disk, by specifying the appropriate unit in the
RECOVDDN option. This can drastically reduce the benefits of writing the primary copy to
disk as the elapsed time of the utility is determined by the slowest device or narrowest
bandwidth is case of transmission. With the new utility, COPYTOCOPY (refer to 10.2,
“COPYTOCOPY” on page 243), this delay can be eliminated and the secondary or recovery
copy initiated asynchronously, thus allowing higher availability of the application.
With the cost of new disk coming down and the new disk technology, it is recommended that
image copies be taken to disk and the new utility COPYTOCOPY be used to take recovery
backups, or all should be taken to disk and then migrated with HSM type of function.
Note: Apply the PTF for PQ45835 for problems with COPY when using LISTDEF and
TEMPLATE.
242 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
10.2 COPYTOCOPY
The COPY utility and the LOAD and REORG utilities with Inline COPY can produce local
primary and backup copies and recovery site primary and backup copies. Taking two local
and two recovery sites copies all at once can tie up four tape drives per utility, and if multiple
copies are running concurrently, then there may be a need for more tape drives than are
available to the site. In addition, some customers use remote attached tape drives for their
recovery site copies, thus causing the copy to take longer and decreasing data availability
because of bandwidth constraints. The requirement, therefore, is to be able to take a single
copy and later make additional copies, asynchronously from the original, and record the
copies in SYSCOPY; see Figure 10-3. This will increase the data availability of objects, due to
reducing the elapsed time, and will also reduce the costs of running the COPY utility.
CopyToCopy
Partitioned
Simple table space table space
Segmented
table
spaces
COPYTOCOPY is a new utility that addresses this issue, it can be used to make copies from
an image copy that was taken by the COPY utility, including inline copies taken by REORG
and LOAD utilities. COPYTOCOPY will leave the target object in read write access (UTRW),
which allows for most other utilities and SQL statements to run concurrently on the same
target objects. The utilities that are not allowed to run concurrently with COPYTOCOPY on
the same target object are utilities that insert or delete records into SYSCOPY, these are:
COPY
LOAD
MERGECOPY
MODIFY
RECOVER
QUIESCE
REORG
The only other utilities not allowed concurrently are utilities that operate with SYSCOPY as its
target.
The copies created by COPYTOCOPY can be used by RECOVER when recovering a table
space and are also available to MERGECOPY, UNLOAD, and also another COPYTOCOPY.
Output from the utility is up to three “new” image copies and rows in SYSCOPY that describe
the image copies. All entries in SYSCOPY are the same as the original copy, apart from
ICBACKUP, which donates the type of image copy, for example, remote primary, and so on.
Note: The TIMESTAMP field in SYSCOPY is the timestamp of the original image copy. If
date and time are used in the data set name, as in Example 10-7, these values will contain
the date and time that the COPYTOCOPY was run and not the date and time in the input
copy data set.
244 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
COPYTOCOPY LIST DB01A FROMLASTCOPY
COPYDDN(,LOCALDDN)
RECOVERYDDN(RECOVDDN,RECOVDD2)
FROMLASTCOPY:
Specifies the most recent image copy that was taken for the table space or index space to
be the input to the COPYTOCOPY utility. This could be a full image copy or incremental
copy retrieved from SYSIBM.SYSCOPY
FROMLASTFULLCOPY
Specifies the most recent full image copy that was taken for the object to be the input to
COPYTOCOPY job
FROMLASTINCRCOPY
Specifies the most recent incremental image copy that was taken for the object to be the
input to COPYTOCOPY job. If specified with an INDEX or INDEX SPACE this option will
revert to FROMLASTFULLCOPY
FROMCOPY dsn
Specifies a particular image copy data set (dsn) as the input to COPYTOCOPY job. This
option is not valid for LIST. The GDG names must be specified fully.
The measurements shows that you can save up to 4.9% in elapsed time per copy by using
COPYTOCOPY utility instead of using the COPY utility.
The main benefit from COPYTOCOPY is the ability to make additional image copies
asynchronously and it mostly beneficial when making remote copies to slower devices.
To use COPY to take DFSMS concurrent copies, you must have the following hardware and
software installed:
OS/390 Release 3.
3990 Model 3 or 3990 Model 6 controller at the extended platform attached to the disk. A
COPY job fails if one or more of the table spaces are on a disk that does not have the right
storage controller.
In Figure 10-9 you see the error messages you will get if you are using this option without the
appropriated hardware and software support.
246 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Example 10-9 Output job for Concurrent COPY with error
Restrictions:
You cannot use a copy made with DFSMS Concurrent COPY with the PAGE or
ERRORRANGE options of the RECOVER utility.
You cannot run the following DB2 stand-alone utilities on copies made by DFSMS
Concurrent COPY:
– DSN1COMP, DSN1COPY, DSN1PRNT
– You can manually restore the DFSMS copy data set under a different name and run
these utilities against the restored data set.
When a recoverable point that indicates that a DFSMS Concurrent COPY is found in
SYSCOPY, the RECOVER utility invokes a DFSMSdss RESTORE command to restore the
copy. Refer to Example 10-11.
PAGE 0001 5695-DF175 DFSMSDSS V2R10.0 DATA SET SERVICES 2001.177 16:34
ADR030I (SCH)-PRIME(01), DCB VALUES HAVE BEEN MODIFIED FOR SYSPRINT
RESTORE DATASET(INCL(PAOLOR1.DSNDBC.U7G01T11.TSPSUPP2.J0001.A001)) -
INDD(SYS00001) CANCE DYNALLOC REPLACE SHARE TOL(ENQF) WAIT(0,0) OUTDY(+
(SBOX58), (SBOX59), (SBOX60), (SBOX57))
ADR101I (R/I)-RI01 (01), TASKID 001 HAS BEEN ASSIGNED TO COMMAND 'RESTORE '
ADR109I (R/I)-RI01 (01), 2001.177 16:34:17 INITIAL SCAN OF USER CONTROL STATEMENTS COMPLETED.
ADR050I (001)-PRIME(01), DFSMSDSS INVOKED VIA APPLICATION INTERFACE
ADR016I (001)-PRIME(01), RACF LOGGING OPTION IN EFFECT FOR THIS TASK
ADR006I (001)-STEND(01), 2001.177 16:34:17 EXECUTION BEGINS
ADR780I (001)-TDDS (01), THE INPUT DUMP DATA SET BEING PROCESSED IS IN LOGICAL DATA SET FORMAT AND
WAS CREATED BY
DFSMSDSS VERSION 2 RELEASE 10 MODIFICATION LEVEL 0
ADR442I (001)-FRLBO(01), DATA SET PAOLOR1.DSNDBC.U7G01T11.TSPSUPP2.J0001.A001 PREALLOCATED, IN
CATALOG UCAT.VSBOX01,
ON VOLUME(S): SBOX58
ADR489I (001)-TDLOG(02), CLUSTER PAOLOR1.DSNDBC.U7G01T11.TSPSUPP2.J0001.A001 WAS RESTORED
CATALOG UCAT.VSBOX01
COMPONENT PAOLOR1.DSNDBD.U7G01T11.TSPSUPP2.J0001.A001
248 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
ADR454I (001)-TDLOG(01), THE FOLLOWING DATA SETS WERE SUCCESSFULLY PROCESSED
PAOLOR1.DSNDBC.U7G01T11.TSPSUPP2.J0001.A001
ADR006I (001)-STEND(02), 2001.177 16:34:21 EXECUTION ENDS
ADR013I (001)-CLTSK(01), 2001.177 16:34:21 TASK COMPLETED WITH RETURN CODE 0000
ADR012I (SCH)-DSSU (01), 2001.177 16:34:21 DFSMSDSS PROCESSING COMPLETE. HIGHEST RETURN CODE IS 0000
With SHRLEVEL REFERENCE the objects are not available for update until the copy
operation is logically completed:
All writes are drained, and the objects being copied have a restrictive state of UTRO.
The objects are quiesced to ensure that all updated buffers are written to disk before
DFSMS Concurrent COPY is invoked. The SYSCOPY records with ICTYPE=Q are
inserted to indicate that the objects are successfully quiesced.
As soon as the DFSMS Concurrent COPY is logically complete, the objects are
dedrained, and the restrictive state is changed from UTRO to UTRW.
When you use the DFSMS Concurrent COPY, you improve the availability of the application
data. However, if your table space data sets have more empty space, the physical copy
operation may take longer than the DB2 image copy. DFSMS Concurrent COPY does not
process the space page map the way DB2 image copy processes it, so the output data sets of
DFSMS Concurrent COPY may require more disk space than the output data sets created by
the DB2 COPY utility.
Ideally, the logical completions of all the table spaces are done together, and all the actual
copying can be done later. However, if you specify a list of table spaces to be copied by
Concurrent COPY with SHRLEVEL REFERENCE, with each of the copies going to the same
output device, all table spaces are put into UTRO and they each remain in UTRO until their
respective copies are logically completed, one by one. This means that each table space on
the list is not logically completed until the previous one is both logically and physically
completed.
This is a big impact to customers who want to cut down the amount of time that their table
spaces are unavailable. This is only a problem when each copy output data set is going to the
same volume.
To get around this restriction with the existing COPY and DFSMS/DSS infrastructure, you can
use the FILTERDDN keyword and copy a list of table spaces using Concurrent COPY to a
single image copy output data set (or up to four identical data sets in the case of Local
Backup, Recovery site Primary, and Recovery site Backup data sets), instead of the current
method of specifying one image copy data set (or up to four in the case of LB, RP, and RB
data sets) for each table space in the list. This way, all of the table spaces are processed with
a single DFDSS DUMP statement.
Users who do not need relief from this availability problem can use the syntax without the
FILTERDDN keyword.
Example 10-12 shows the syntax of a COPY statement without the FILTERDDN, whereas
Example 10-13 is using the FILTERDDN option.
The usage of the FILTERDDN keyword allows for only one COPYDDN parameter for the
entire list of table spaces. Additionally, the use of a new parameter, FILTERDDN, is required.
FILTERDDN points to a DD card for a data set that is passed to DFSMS/DSS that contains a
list of all the table spaces to be processed. The contents of the data set will be written by
DB2, and will be transparent to the user. The user only has to provide a DD card for the
FILTERDDN and make sure that the image copy data set is large enough to contain all the
table spaces in the list.
When you use the FILTERDDN option, one row will be inserted in SYSIBM.SYSCOPY for
each table space or partition (if DSNUM is specified) in the table space list. This is identical as
when you are not using the FILTERDDN option. However, the DSNAME column in SYSCOPY
will have the same data set name for each table space which is copied at one time using
FILTERDDN.
The FILTERDDN keyword is supported when using the DSNUTILS stored procedure to start
DB2 utilities.
A sample JCL using FILTERDDN and TEMPLATEs could look like Example 10-14, and the
sample job output can be found in Example 10-15. As you can see in the job output, there is
only one call to DFSMSdss for both partitions that are being copied.There is only one set of
‘ADR006I execution begins - execution ends’ messages, as well as message ADR801I,
indicating that two data sets were selected when using FILTERDDN, instead of one for every
call to DFSMSdss, if FILTERDDN is not used.
250 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
INCLUDE TABLESPACE U7G01T11.TSPSUPP PARTLEVEL(1)
When you use a data set that was produced with the FILTERDDN option (and contains the
copy data of multiple data sets) in a RECOVER operation, DFDSS will still ask for the data set
multiple times instead of restoring all table spaces in a single RESTORE operation.
252 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
11
We show:
Why collect statistics?
When to use RUNSTATS
History tables
SAMPLE option
KEYCARD
FREQVAL
LOB table space
Performance
Inline execution with LOAD/REORG/REBUILD INDEX
Both sets of statistics can also be used by DBAs to reevaluate physical database design
Two utilities now have the ability to collect statistics inline as they are executing, for example,
REORG and LOAD REPLACE. By collecting inline statistics, the total elapsed time of running
the two utilities, for example REORG followed by RUNSTATS, is greatly reduced due to the
elimination of the I/O required to perform RUNSTATS. This is done at the cost of a slight
increase in CPU time. It is highly recommended that inline statistics be collected where a
utility allows the option. For more details on inline statistics see 3.1.2, “Inline RUNSTATS” on
page 46.
11.3.1 LISTDEF
Like most other utilities, RUNSTATS supports the use of the LISTDEF to pass a list of objects
to a utility statement for further processing. The benefits of using LISTDEF are defined in
Chapter 2, “Wildcarding and Templates” on page 17.
254 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
The list generated by the LISTDEF must only contain objects that can be used in the
particular RUNSTATS command. Therefore the LISTDEF for RUNSTATS TABLESPACE must
only contain table spaces and for RUNSTATS INDEX only indexes are allowed. If this is not
the case the utility finishes with a return code of 8 and states that an object was invalid for the
utility.
Example 11-1 illustrates the use of LISTDEF, while Example 11-2 shows the output.
Example 11-1 Example of LISTDEF
//SYSIN DD *
LISTDEF LISTA1 INCLUDE TABLESPACE DB246129.TS612901
INCLUDE TABLESPACE DB246129.TS612902
RUNSTATS TABLESPACE LIST LISTA1
11.3.2 UPDATE
The UPDATE option controls the scope of statistics that are collected by the RUNSTATS
utility. The options are:
ALL
– Indicates that all collected statistics are to be updated in the catalog
ACCESSPATH
– Indicates that only statistics relevant to access paths are to be updated in the catalog
SPACE
– Indicates that only statistics relevant to helping DBA’s monitor the status of objects are
to be updated in the catalog
NONE
– Indicates that no statistics are to be updated. This is valid only with REPORT.
It is important that statistics are collected regularly when using templates and dynamic data
set sizing. If the statistics are not up-to-date, the wrong sizes will be allocated.
The options for HISTORY are the same as for UPDATE, and the scope of the UPDATE affects
the options allowed for HISTORY. Table 11-1 illustrates the affect of UPDATE parameter on
HISTORY.
Table 11-1 Allowable HISTORY/UPDATE combinations
UPDATE HISTORY
NONE NONE
By using the historical statistics, it is easy to use trend analysis to plan capacity, to identify if
utilities are being run too often or not often enough, and to identify objects that may benefit
from changes to its physical design. Figure 11-1 shows an example of monitoring space
growth over time.
Assuming that the number of rows is constantly increasing so that the highest number is the
latest, the query shows the percentage of rows added over specific time period. This could be
extrapolated to provide an annual growth figure.
256 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Another example is the monitoring of the effectiveness of a compression dictionary. To
determine how often a compression dictionary should be rebuilt, monitor the PAGESAVE
value. Determine when the degradation of the dictionary from the original compression is
greater than 10%. At this point, a dictionary rebuild should be scheduled by omitting the
KEEPDICTIONARY keyword from the REORG; see Figure 11-2 for a rule of thumb to help
decide when to rebuild a dictionary.
i f ( m a x ( p a g e s a v e ) - m in ( p a g e s a v e ) ) * 1 0 0
/m a x (p a g e s a v e ) > 1 0
= (7 3 -6 4 )* 1 0 0 /7 3
= 12
It is recommended that the effectiveness of the existing dictionary be monitored by using the
column PAGESAVE in the new Version 7 catalog table SYSTABLEPART_HIST.
11.3.4 KEYCARD
KEYCARD indicates that RUNSTATS is to collect all of the distinct values in all of the 1 to n
key column combinations for the specified indexes, where n is the number of columns in the
index. For example, if KEYCARD is specified for a three column index (columns IXC1, IXC2,
IXC3) then extra statistics are collected showing the combination of IXC1+IXC2. The values
of IXC1+IXC2+IXC3 are collected in FULLKEYCARF and IXC1 statistics are held in
FIRSTKEYCARF.
Example 11-3 shows KEYCARD being used with RUNSTATS, while Example 11-3 shows the
output job.
It is recommended that KEYCARD be used to collect index statistics when there is correlation
in a multicolumn index keys and queries have less than fully matching equal predicates.
Example 11-5 shows the FREQVAL syntax to collect the 5 most frequent values for the first
two columns of the indexes on tables within table space U7G01T11.TSLINEI. A sample of the
output is shown in Example 11-6.
It is recommended to start with the default COUNT of 10 and adjust upward, as required. It is
not recommended to use a high value for COUNT in an attempt to collect statistics for 100%
of the data; this is not necessary and will increase CPU consumption.
If the frequent values are required for multiple columns, then the FREQVAL keyword should
be repeated as shown in Figure 11-7.
Example 11-7 Multiple FREQVAL statements
//SYSIN DD *
LISTDEF LISTA1 INCLUDE TABLESPACE U7G01T11.TSLINEI
RUNSTATS TABLESPACE LIST LISTA1 TABLE(ALL)
INDEX(ALL KEYCARD FREQVAL NUMCOLS 1 COUNT 5
FREQVAL NUMCOLS 2 COUNT 5
FREQVAL NUMCOLS 3 COUNT 5
FREQVAL NUMCOLS 4 COUNT 5
FREQVAL NUMCOLS 5 COUNT 5 ) REPORT YES
258 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
It is recommended that FREQVAL be used to collect frequent values statistics if there are
literals within partially matching equal predicates. The same recommendation applies if there
are input host variables and the query is reoptimized at run time.
11.3.6 SAMPLE
SAMPLE indicates the percentage of rows to sample when collecting column statistics. Any
value from 1 through 100 can be specified. The default is 25, if SAMPLE is specified. If not
specified, then sampling is not used, equivalent to SAMPLE 100.
When sampling, DB2 still reads all the pages because sampling is at the row level. This
enhancement was introduced to minimize the vast amount of CPU required when hashing
column values to estimate cardinality.
The default value of 25 provides a saving on CPU without compromising access path
selection. In tests smaller sampling values were also used without noticing degradation of
query performance.
Example 11-8, Example 11-9 and Example 11-10 shows the difference in cardinality when
RUNSTATS are taken using SAMPLE 10, 25, and 100 respectively. It can be seen from the
figures that the difference in the cardinality is marginal in all cases.
Example 11-8 RUNSTATS SAMPLE 10
Sel Name Owner T DB Name TS Name Cols Rows Checks
* * * * * * * *
----- ------------------ -------- - -------- -------- ------ ----------- ------
* LINEITEM PAOLOR4 T U7G01T11 TSLINEI 16 1951548 0
Columns of table
Select Column Name Col No Col Type Length Scale Null Def FP Col Card
* * * * * * * * *
------ ------------------ ------ -------- ------ ------ ---- --- -- -----------
L_ORDERKEY 1 INTEGER 4 0 N N N 487775
L_PARTKEY 2 INTEGER 4 0 N N N 165747
L_SUPPKEY 3 INTEGER 4 0 N N N 10112
L_LINENUMBER 4 INTEGER 4 0 N N N 7
L_QUANTITY 5 INTEGER 4 0 N N N 50
L_EXTENDEDPRICE 6 FLOAT 4 0 N N N 934275
L_DISCOUNT 7 FLOAT 4 0 N N N 11
L_TAX 8 FLOAT 4 0 N N N 9
L_RETURNFLAG 9 CHAR 1 0 N N N 3
L_LINESTATUS 10 CHAR 1 0 N N N 2
L_SHIPDATE 11 DATE 4 0 N N N 2304
L_COMMITDATE 12 DATE 4 0 N N N 2240
L_RECEIPTDATE 13 DATE 4 0 N N N 2368
L_SHIPINSTRUCT 14 CHAR 25 0 N N N 4
L_SHIPMODE 15 CHAR 10 0 N N N 7
Columns of index
Sel Column Name Seq No O Col Type Length Scale Null Def FP Col Card
* * * * * * * * * *
--- ------------------ ------ - -------- ------ ------ ---- --- -- -----------
L_ORDERKEY 1 A INTEGER 4 0 N N N 487775
L_SHIPDATE 2 A DATE 4 0 N N N 2304
L_RETURNFLAG 3 A CHAR 1 0 N N N 3
L_SUPPKEY 4 A INTEGER 4 0 N N N 10112
L_EXTENDEDPRICE 5 A FLOAT 4 0 N N N 934275
L_DISCOUNT 6 A FLOAT 4 0 N N N 11
Columns of index
Sel Column Name Seq No O Col Type Length Scale Null Def FP Col Card
* * * * * * * * * *
--- ------------------ ------ - -------- ------ ------ ---- --- -- -----------
L_ORDERKEY 1 A INTEGER 4 0 N N N 487775
L_SHIPDATE 2 A DATE 4 0 N N N 2304
L_RETURNFLAG 3 A CHAR 1 0 N N N 3
L_SUPPKEY 4 A INTEGER 4 0 N N N 10112
L_EXTENDEDPRICE 5 A FLOAT 4 0 N N N 863540
L_DISCOUNT 6 A FLOAT 4 0 N N N 11
260 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Columns of index
Sel Column Name Seq No O Col Type Length Scale Null Def FP Col Card
* * * * * * * * * *
--- ------------------ ------ - -------- ------ ------ ---- --- -- -----------
L_ORDERKEY 1 A INTEGER 4 0 N N N 487775
L_SHIPDATE 2 A DATE 4 0 N N N 2304
L_RETURNFLAG 3 A CHAR 1 0 N N N 3
L_SUPPKEY 4 A INTEGER 4 0 N N N 10112
L_EXTENDEDPRICE 5 A FLOAT 4 0 N N N 835584
L_DISCOUNT 6 A FLOAT 4 0 N N N 11
It is recommended that SAMPLE 25 be used initially and testing undertaken to find the break
even point that balances the savings in CPU against query performance.
.
Note:
If the sample keyword is not supplied, SAMPLE 100 is the RUNSTATS default.
If the sample keyword is supplied, SAMPLE 25 is the default.
DB2 extrapolates values, based on the percentage supplied by the SAMPLE parameter
If 100% accuracy in required, do not use sampling.
11.3.7 FORCEROLLUP
This parameter controls the population of statistics for an empty partition of a partitioned table
space. Prior to DB2 Version 7 if a partition was empty then the statistics reflected this fact.
This could affect access path selection especially for packages bound prior to data being
loaded into the partition. By using FORCEROLLUP, RUNSTATS will now populate the
statistics for the partition with values reflecting the values in the other partitions. This resolves
the access path selection issue and means that rebinding of packages need not be
undertaken when empty partitions start being populated.
262 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
LOB
– Specifies that only LOB table spaces and related index spaces containing indexes on
auxiliary tables are to be included in this element of the list. If the initial search for the
object results in a LOB object, auxiliary relationships are not followed. If the initial
search for the object results in a base object, the auxiliary relationship is applied to the
LOB table space or index space, and only those objects become part of the resulting
list.
If these options are not specified, then the statistics in the SYSLOBS and SYSLOBS_HIST
will not updated. Example 11-12 shows the LISTDEF required to generate the correct lists for
RUNSTATS to use when collecting statistics for a LOB table space. Specifying ALL
automatically includes all related LOB objects, that is, BASE, AUX and LOB. Example 11-13
shows the statements required if not using LISTDEF, while Example 11-14 provides a sample
output from the RUNSTATS utility.
It is recommended that LISTDEF with option ALL be used for simplicity when building lists for
LOB table spaces, because it automatically includes all objects related to the LOB.
Also, LOB columns statistics are not used for access path selection. For indexes on auxiliary
tables, only the NLEVELS and FIRSTKEYCARDF columns in SYSIBM.SYSINDEXES have
an effect on the access path.
Figure 11-3 shows details collected by RUNSTATS for LOB table spaces.
LOB Management
264 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
If using LOBs (Large Objects), the table space holding the actual LOB data is by managed
statistics in SYSTABLEPART in the same way as shown for regular table spaces. However,
there are also additional statistics saved in SYSLOBSTATS. These are:
AVGSIZE
– This is the average size of all LOBs in the table space, that is, the total number of LOB
bytes divided by the number of LOBs.
FREESPACE
– This gives an indication of how many more LOBs can be added in the existing extents
already allocated.
ORGRATIO
– This is a measure of fragmentation or non-optimal organization of the LOBs stored in
the table space. A value of 1.0 is optimal.
See 4.2.5, “LOB space management” on page 81 for more details.
Restriction:
The TABLE option is invalid for LOB table space.
SHRLEVEL REFERENCE or CHANGE, REPORT YES or NO, UPDATE ALL or NONE
are the only options allowed or LOB table spaces.
History tables contain the statistics that were previously contained in their counterpart tables
in SYSDBASE; see Figure 11-4. The HISTORY statistics tables are not identical, but do
contain the same statistics columns. SYSDBASE contains only one row for each object/part
which is updated when collecting statistics. For the history tables in SYSHIST, rows with
statistics for each object/part are inserted when HISTORY is specified during RUNSTATS
utility along with a STATSTIME value. The history statistics remain until the new MODIFY
STATISTICS utility in Version 7 removes them; see 11.6, “Removing HISTORY statistics” on
page 267.
SYSLOBSTATS SYSLOBSTATS_HIST
SYSTABLES SYSTABLES_HIST
SYSINDEXPART SYSINDEXPART_HIST
SYSTABLEPART SYSTABLEPART_HIST
SYSDBASE SYSHIST
And these tables contain both SPACE and ACCESSPATH related statistics:
SYSTABLES_HIST
SYSINDEXES_HIST
These tables are populated by using the HISTORY option of the RUNSTATS utility; see
11.3.3, “HISTORY” on page 256.
Table 11-3 illustrates which tables are updated when HISTORY option is used.
SYSCOLDIST_HIST X X
SYSCOLUMNS_HIST X X X X X
SYSINDEXES_HIST X X X X
266 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
SPACE ACCESSPATH ALL LOB table space
SYSINDEXPART_HIST X X X
SYSINDEXSTATS_HIST X X
SYSTABLEPART_HIST X X X X X X
SYSTABSTATS_HIST X X X X
SYSTABLES_HIST X X X X X X X X
SYSLOBSTATS_HIST NP NP NP NP NP X X X
NP - Not Permitted
There are also new fields added to the “current” statistics fields in DSNDB06 which can be
used in determining when utilities are to be executed, these are shown in more detail in 4.2,
“New values with DB2 V7” on page 74.
Two of the new fields, DSNUM and EXTENTS, are shown in the Example 11-15.
A listing of the data sets underlying the table space shows the extents; see Example 11-16.
The SPACE field can also be shown to match the tracks listed. DB2 on a DASD MODEL
3390-3 allocates 12x4K pages per track, resulting in 48K per track. Therefore the relationship
between SPACE(F) and tracks can be demonstrated as SPACE(F)/48 = TRACKS. In this
example 72000/48 = 1500.
The syntax for the MODIFY STATISTICS utility is given in Figure 11-5.
As shown in Figure 11-5, you can use LIST to specify the name of a previously-defined
LISTDEF list name, or TABLESPACE, INDEXSPACE or INDEX can be explicitly coded.
There are three options to determine which statistics to delete. The delete options are:
ALL
– Deletes all statistics history rows that are related to the specified object from all catalog
history tables.
ACCESSPATH
– Deletes all access-path statistics history rows that are related to the specified object
from the following history tables:
SYSIBM.SYSCOLDIST_HIST
SYSIBM.SYSCOLUMNS_HIST
SPACE
– Deletes all space related statistics history rows related to the specified object from the
following history tables:
SYSIBM.SYSINDEXPART_HIST
SYSIBM.SYSTABLEPART_HIST
SYSIBM.SYSLOBSTATS_HIST
Example 11-17 shows an example of deleting SPACE table space statistics based upon a
date, while Example 11-18 shows the job output.
268 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Example 11-17 MODIFY STATISTICS by date using SPACE
//DSNUPROC.SYSIN DD *
MODIFY STATISTICS TABLESPACE U7G01T11.TSLINEI DELETE SPACE
DATE(20010614)
Table 11-4 shows the behavior of MODIFY STATISTICS and RUNSTATS SPACE
Table 11-4 Deleting statistics gathered by HISTORY SPACE
Runstats MODIFY MODIFY MODIFY MODIFY
Table space Index space Table space Index space
SPACE
Table Index Delete Delete Delete (ALL) Delete (ALL)
(ALL) (ALL) Space Space
SYSCOLDIST_HIST
SYSCOLUMNS_HIST
SYSINDEXES_HIST X ND ERR
SYSINDEXPART_HIST X X X
SYSINDEXSTATS_HIST
SYSTABLEPART_HIST X X X ND X ND
SYSTABSTATS_HIST
SYSTABLES_HIST X X ND ND X ND
SYSLOBSTATS_HIST NP NP X NP X NP
Note: Apply PTF UQ56653 for APAR PQ50666 to eliminate some inconsistencies in
deleting RUNSTATS HISTORY entries related to indexes.
When running RUNSTATS TABLESPACE UPDATE SPACE HISTORY SPACE TABLE (ALL)
(column 1), DB2 update tables SYSTABLEPART_HIST and SYSTABLES_HIST (column 1).
In Table 11-5 you can see the behavior of MODIFY STATISTICS after RUNSTATS
ACCESSPATH.
Table 11-5 Deleting statistics gathered by HISTORY ACCESSPATH
Runstats MODIFY MODIFY MODIFY MODIFY
Table space Index space Table space Index space
ACCESSPATH
Table Index Delete Delete Delete (ALL) Delete (ALL)
(ALL) (ALL) Accesspath Accesspath
SYSCOLDIST_HIST X X X
SYSCOLUMNS_HIST X X X ND X X
SYSINDEXES_HIST X ND ERR
SYSINDEXPART_HIST
SYSINDEXSTATS_HIST X ND X
SYSTABLEPART_HIST
SYSTABSTATS_HIST X X ND ND X ND
SYSTABLES_HIST X X ND ND X ND
SYSLOBSTATS_HIST NP NP NP NP X NP
In Table 11-6 you can see the behavior of MODIFY STATISTICS after RUNSTATS ALL
Table 11-6 Deleting statistics gathered by HISTORY ALL
Runstats MODIFY MODIFY
Table space Index space
ALL
Table Index Delete Delete
(ALL) (ALL) ALL ALL
SYSCOLDIST_HIST X X
SYSCOLUMNS_HIST X X X X
SYSINDEXES_HIST X ERR
SYSINDEXPART_HIST X X
SYSINDEXSTATS_HIST X X
SYSTABLEPART_HIST X X X ND
SYSTABSTATS_HIST X X X ND
SYSTABLES_HIST X X X ND
SYSLOBSTATS_HIST X X X X
270 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
11.7 RUNSTATS performance
Various RUNSTATS options were run to gather performance data on a partitioned table space
with 7,900,000 rows. There were 3 partitions and no NPIs defined. Table 11-7 shows the
comparison based using RUNSTATS SAMPLE 25 as a base figure.
Table 11-7 RUNSTATS performance
Runstats Utility Delta(%)
The HISTORY options in the RUNSTATS utility did not increase either CPU time or ELAPSED
time significantly
The SAMPLE 100 is recommended when undertaking performance tuning and the cardinality
of all non-index columns is required.
Part 4 Appendixes
This part includes reference documentation distributed as follows:
Appendix A, “Tables for Real-Time Statistics” on page 275:
Details on the tables that represent the data repository for the Real Time Statistics and the
way that the execution of DB2 Utilities impacts their contents.
Appendix B, “Sample JCL for copying Catalog and Directory objects” on page 287:
A sample job stream for use when securing the catalog and directory using the COPY
utility. It illustrates a method of using LISTDEF and Template when processing the Catalog
and Directory.
Appendix C, “Creating a shadow catalog” on page 289:
Four jobs that can be used to define a shadow catalog, UNLOAD data from DSNDB06,
LOAD into shadow catalog, and RUNSTATS using LISTDEF wildcard.
Appendix D, “UNICODE support” on page 293:
Procedures to install and activate the 1140 (Euro English) to UNICODE 367 conversion for
usage with the UNLOAD Utility.
Appendix E, “Additional material” on page 297:
A description of how to download a job containing DDL to create and populate a shadow
DB2 catalog.
Sample DDL
Use the following SQL data definition statements to create the statistics database, table
space, tables, and indexes. The names and declared attributes of the objects must not be
changed. However, other attributes can be changed. For example, row locking can be
specified. This sample DDL is also shipped in data set DSN710.SDSNSAMP, member name
DSNTESS.
The amount of primary and secondary space to allocate can be calculated by using the
formulas in the DB2 UDB for OS/390 and z/OS Version 7 Administration Guide, SC26-9931.
The approximate number of rows in the two statistics tables can be determined by the
following SQL statement. The sample DDL used 20,000 objects as an estimate to determine
the amount of space to request.
SELECT C1 + C2 FROM
(SELECT COUNT(*) AS C1 FROM SYSIBM.SYSTABLEPART) AS T1,
(SELECT COUNT(*) AS C2 FROM SYSIBM.SYSINDEXPART) AS T2;
CREATE DATABASE DSNRTSDB CCSID EBCDIC;
CREATE TABLESPACE DSNRTSTS IN DSNRTSDB
CCSID EBCDIC
CLOSE NO
LOCKMAX 0
SEGSIZE 32
USING STOGROUP SYSDEFLT
PRIQTY 1600
SECQTY 160;
CREATE TABLE SYSIBM.TABLESPACESTATS
(DBNAME CHAR(8) NOT NULL,
NAME CHAR(8) NOT NULL,
PARTITION INTEGER NOT NULL,
DBID SMALLINT NOT NULL,
PSID SMALLINT NOT NULL,
276 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
CopyChanges INTEGER ,
CopyUpdateLRSN CHAR(6) FOR BIT DATA,
CopyUpdateTime TIMESTAMP )
IN DSNRTSDB.DSNRTSTS CCSID EBCDIC;
CREATE UNIQUE INDEX SYSIBM.INDEXSPACESTATS_IX
ON SYSIBM.INDEXSPACESTATS
(DBID, ISOBID, PSID, PARTITION)
CLUSTER CLOSE NO;
PARTITION Data set number within the table space. For partitioned table spaces, this value
corresponds to the partition number for a single partition, or 0 for an entire
non-partitioned table space.
UPDATESTATSTIME The timestamp of when the row was inserted or last updated.
TOTALROWS Number of rows or LOBs in the table space or partition if the table space is
partitioned. The total is the sum of rows in each table if the table space contains
more than one table. NULL indicates that the value is unknown.
NACTIVE The number of active pages in the table space or partition. This value is equivalent
to the number of preformatted pages. For multi-piece table spaces this is the sum
of preformatted pages in all data sets. NULL indicates that the value is unknown.
SPACE The amount of space allocated to the table space or partition. The value is the total
number of kilobytes allocated. For multi-piece linear page sets, this is the sum of
space in all data sets. NULL indicates that the value is unknown.
EXTENTS The number of physical extents in the table space or partition. For multi-piece table
spaces this is the number of extents of the last data set. NULL indicates that the
value is unknown.
LOADRLASTTIME The timestamp of the last LOAD REPLACE on the table space or table space
partition. NULL indicates that LOAD has never been run or the timestamp is
unknown.
REORGLASTTIME The timestamp of the last REORG on the table space or table space partition. NULL
indicates that REORG has never been run or the timestamp is unknown.
REORGINSERTS The number of records or LOBs inserted since the last REORG or LOAD
REPLACE. NULL indicates that the value is unknown.
REORGDELETES The number of records or LOBs deleted since the last REORG or LOAD REPLACE.
NULL indicates that the value is unknown.
REORGUPDATES The number of records updated since the last REORG or LOAD REPLACE. The
number of LOB updates is accumulated in Reorganizers and ReorgDeletes since
LOBs are deleted and inserted to accomplish a LOB update. NULL indicates that
the value is unknown.
REORGDISORGLOB The number of LOBs inserted that were not considered perfectly chunked since the
last REORG or LOAD REPLACE. A LOB is considered perfectly chunked if the
pages allocated are contained within the minimum number of chunks possible. That
is, pages allocated/chunk size = minimum chunks needed. NULL indicates that the
value is unknown.
REORGUNCLUSTINS The number of records inserted that were not considered well clustered with respect
to the clustering index since the last REORG or LOAD REPLACE. A record is
considered well clustered if the page is inserted on a page within plus or minus 16
pages of the ideal candidate page. The ideal candidate page is determined by the
clustering index. NULL indicates that the value is unknown.
REORGMASSDELETES The number of mass deletes from a segmented or LOB table space, or the number
of dropped tables from a non-segmented table space, since the last REORG or
LOAD REPLACE. NULL indicates that the value is unknown.
REORGNEARINDREF The number of overflow records created since the last REORG or LOAD REPLACE
that were relocated “near” the pointer record. For nonsegregated table spaces, a
page is considered "near" the present page if the two page numbers differ by 16 or
less. For segmented table spaces, a page is considered "near" the present page if
the two page numbers differ by (SEGSIZE * 2) or less. NULL indicates that the value
is unknown.
REORGFARINDREF The number of overflow records created since the last REORG or LOAD REPLACE
that were relocated “far” from the pointer record. For nonsegmented table spaces,
a page is considered "far" from the present page if the two page numbers differ by
17 or more. For segmented table spaces, a page is considered "far" from the
present page if the two page numbers differ by (SEGSIZE * 2) +1 or more. NULL
indicates that the value is unknown.
STATSLASTTIME The timestamp of the last RUNSTATS on the table space or table space partition.
NULL indicates that RUNSTATS has never been run or the timestamp is unknown.
STATSINSERTS The number of records or LOBs inserted since the last RUNSTATS. NULL indicates
that the value is unknown.
STATSDELETES The number of records or LOBs deleted since the last RUNSTATS. NULL indicates
that the value is unknown.
STATSUPDATES The number of records updated since the last RUNSTATS. The number of LOB
updates is accumulated in ReorgInserts and ReorgDeletes since LOBs are deleted
and inserted to accomplish a LOB update. NULL indicates that the value is
unknown.
STATSMASSDELETES The number of mass deletes from a segmented or LOB table space, or the number
of dropped tables from a non-segmented table space, since the last RUNSTATS.
NULL indicates that the value is unknown.
COPYLASTTIME The timestamp of the last full image copy on the table space or table space partition.
NULL indicates that COPY has never been run or the timestamp is unknown.
COPYUPDATEDPAGES The number of distinct pages updated since the last COPY. NULL indicates that the
value is unknown.
COPYCHANGES The number of inserts, deletes and updates since the last COPY. This number
approximates the number of log records to be processed to recover to currency.
NULL indicates that the value is unknown.
COPYUPDATELRSN The LRSN or RBA of the first update after the last COPY. NULL indicates that the
value is unknown.
278 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Column Content
COPYUPDATETIME The timestamp of the first update after the last COPY. NULL indicates that the value
is unknown.
PARTITION Data set number within the index space. For partitioned index spaces, this value
corresponds to the partition number for a single partition, or 0 for an entire
non-partitioned index space.
PSID Internal identifier of the table space page set descriptor in which the table, that the
index is created on, exists.
UPDATESTATSTIME The timestamp of when the row was inserted or last updated.
TOTALENTRIES Number of index entries (including duplicates) in the index space or partition if the
index is partitioned. NULL indicates that the value is unknown.
NLEVELS The number of levels in the index tree. NULL indicates that the value is unknown.
NACTIVE The number of active pages in the index space or partition. This value is equivalent
to the number of preformatted pages. NULL indicates that the value is unknown.
SPACE The amount of space allocated to the index space or partition. The value is the total
number of kilobytes allocated. NULL indicates that the value is unknown.
EXTENTS The number of physical extents in the index space or partition. For multi-piece index
spaces this is the number of extents of the last data set. NULL indicates that the
value is unknown.
LOADRLASTTIME The timestamp of the last LOAD REPLACE on the index space or partition. NULL
indicates that LOAD has never been run or the timestamp is unknown.
REBUILDLASTTIME The timestamp of the last REBUILD INDEX on the index space or partition. NULL
indicates that REBUILD INDEX has never been run or the timestamp is unknown.
REORGLASTTIME The timestamp of the last REORG on the index space or partition. NULL indicates
that REORG has never been run or the timestamp is unknown.
REORGINSERTS The number of index entries inserted since the last REORG, REBUILD INDEX or
LOAD REPLACE. NULL indicates that the value is unknown.
REORGDELETES The number of index entries deleted since the last REORG, REBUILD INDEX or
LOAD REPLACE. NULL indicates that the value is unknown.
REORGAPPENDINSERT The number of index entries inserted since the last REORG, REBUILD INDEX or
LOAD REPLACE that has a key value greater than a maximum key value in the
index or partition of the index. NULL indicates that the value is unknown.
REORGPSEUDODELETES The number of index entries that are currently pseudo deleted since the last
REORG, REBUILD INDEX or LOAD REPLACE. NULL indicates that the value is
unknown.
REORGMASSDELETES The number of times the index or index partition was mass deleted since the last
REORG, REBUILD INDEX or LOAD REPLACE. NULL indicates that the value is
unknown.
REORGLEAFNEAR Since the last REORG, REBUILD INDEX, or LOAD REPLACE, the number of index
page splits that were near-off. A page is considered near-off the present page if the
difference between the two page numbers is greater than 1 and less than 16. NULL
indicates that the value is unknown.
REORGLEAFFAR Since the last REORG, REBUILD INDEX, or LOAD REPLACE, the number of index
page splits that were far-off. A page is considered far-off the present page if the
difference between the two page numbers is 16 or greater. NULL indicates that the
value is unknown.
REORGNUMLEVELS The number of levels in the index tree added or removed since the last REORG,
REBUILD INDEX, or LOAD REPLACE. NULL indicates that the value is unknown.
STATSLASTTIME The timestamp of the last RUNSTATS on the index space or index space partition.
NULL indicates that RUNSTATS has never been run or the timestamp is unknown.
STATSINSERTS The number of index entries inserted since the last RUNSTATS. NULL indicates that
the value is unknown.
STATSDELETES The number of index entries deleted since the last RUNSTATS. NULL indicates that
the value is unknown.
STATSMASSDELETES The number of times the index or index space partition was mass deleted since the
last RUNSTATS. NULL indicates that the value is unknown.
COPYLASTTIME The timestamp of the last full image copy on the index space or index space
partition. NULL indicates that COPY has never been run or the timestamp is
unknown. NULL indicates that the value is unknown.
COPYUPDATEDPAGES The number of distinct pages updated since the last COPY for COPY YES indexes.
NULL indicates that the value is unknown.
COPYCHANGES The number of inserts and deletes since the last COPY for COPY YES indexes.
This number approximates the number of log records to be processed to recover to
currency. NULL indicates that the value is unknown.
COPYUPDATELRSN The LRSN or RBA of the first update after the last COPY for COPY YES indexes.
NULL indicates that the value is unknown.
COPYUPDATETIME The timestamp of the first update after the last COPY for COPY YES indexes. NULL
indicates that the value is unknown.
Impact of utilities
Next, we look at the values changed by the execution of DB2 utilities.
280 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Also, in the following sections, Reorg*, Stats*, and Copy* are a shorthand notation to describe
the entire set of interval counters for the general category (Reorg, Stats, or Copy). The set
does not include the timestamp column values.
LOAD REPLACE
282 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
TotalRows is set to the number of rows or LOBs loaded. Note that under some conditions
the REORG utility may not have an accurate count of loaded records (for example utility
restart). In this case, and in cases where the total rows are unknown, the TotalRows value
is set to NULL.
StatsLastTime is set to the current timestamp. The timestamp may be different for every
partition involved in the REORG. (1)
CopyLastTime is set to the current timestamp. The timestamp may be different for every
partition involved in the REORG. (2)
REBUILD INDEX
284 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
RUNSTATS UPDATE ALL
The StatsLastTime is only updated and the STATS* statistics are only reset when the utility
was run with UPDATE ALL. If UPDATE NONE, SPACE or ACCESSPATH was specified, not
all statistics are collected by RUNSTATS.
After the COPY or MERGE phase, the following statistics are changed:
CopyLastTime is set to the timestamp at the beginning of the COPY or MERGE phase.
The timestamp may be different for every partition involved in the Copy.
Copy* are accumulated during Copy and are updated in the Real-Time Statistics tables
the next time statistics are externalized for the object. Any changes that are accumulated
are relative to the beginning of the Copy or Merge.
RECOVER TO CURRENCY
After recovery to currency, the in-memory counter fields are still valid. Nactive, Space and
Extents column values are updated to reflect the new values.
After doing a recovery to a point-in-time, the statistics are potentially invalid. This is true for
table space, index space, partition, or data set level recovery. For data set level recovery, the
entire space row is affected. If the object is partitioned, only the corresponding partition row is
affected. Therefore, the following statistics are set to NULL:
Reorg*
Stats*
Copy*
Nlevels, Nactive, Space, and Extents column values are updated to reflect the new values.
286 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
B
288 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
C
The source for these jobs is available for download as described in Appendix E., “Additional
material” on page 297.
You will need to edit the SYSPUNCH output from the UNLOAD utility by replacing RESUME
YES with RESUME NO REPLACE, if you plan to refresh your shadow data base frequently
with fresh data from DSNDB06. Example C-3 contains the LOAD statements for the LOAD
utility produced by the UNLOAD utility.
290 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Tip: Save the SYSPUNCH data sets to avoid overwrite by UNLOAD utility. These data sets
can be reused. You can also avoid unnecessary editing. Use a data set naming standard
for the output data set from UNLOAD.
292 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
D
/********************************************
* INPUT STATEMENTS FOR THE IMAGE GENERATOR *
********************************************/
/*
The output of the job is shown in Example D-2. Please note that the job creates the
conversion table indirectly, since no direct conversion from 1140 to 367 is currently available.
294 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Activate the UNICODE table
These are the steps to activate the UNICODE translation table:
Copy the conversion table (the highlighted module in Example D-1 on page 293) into
SYS1.PARMLIB as CUNIMG01.
Create a new member in SYS1.PARMLIB, member name CUNUNI01. The contents of
CUNUNI01 are shown in Example D-3.
Example: D-3 Contents of CUNUNI01 in SYS1.PARMLIB
/**********************************************************/
/* */
/* CUNUNIXX - UNICODE CONVERSION CONTROL PARAMETERS */
/* */
/**********************************************************/
/* ESTABLISH A NEW ENVIRONMENT */
/**********************************************************/
/* REQUIRED KEYWORD REALSTORAGE */
/* MAXIMAL USED PAGES OF REAL STORAGE, MIN=0 MAX=524287 */
/* WHERE 0 MEANS NO EXPLICITE LIMIT (=524287) */
/**********************************************************/
REALSTORAGE 51200; /* E.G. 200 MB */
/**********************************************************/
/* REQUIRED KEYWORD IMAGE WITH */
/* REQUIRED PARAMETER: MEMBER NAME */
/* THIS MEMBER MUST BE PLACED IN A DATA SET FROM THE */
/* LOGICAL PARMLIB CONCATENATION (DEF'D IN LOADXX) */
/**********************************************************/
IMAGE CUNIMG01;
Activate the new UNICODE table with command T UNI=01. The SYSLOG output is
reported in Example D-4.
296 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
E
Select the Additional materials and open the directory that corresponds with the redbook
form number, SG246289.
298 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Special notices
References in this publication to IBM products, programs or services do not imply that IBM
intends to make these available in all countries in which IBM operates. Any reference to an
IBM product, program, or service is not intended to state or imply that only IBM's product,
program, or service may be used. Any functionally equivalent program that does not infringe
any of IBM's intellectual property rights may be used instead of the IBM product, program or
service.
Information in this book was developed in conjunction with use of the equipment specified,
and is limited in application to those specific hardware and software products and levels.
IBM may have patents or pending patent applications covering subject matter in this
document. The furnishing of this document does not give you any license to these patents.
You can send license inquiries, in writing, to the IBM Director of Licensing, IBM Corporation,
North Castle Drive, Armonk, NY 10504-1785.
Licensees of this program who wish to have information about it for the purpose of enabling:
(i) the exchange of information between independently created programs and other programs
(including this one) and (ii) the mutual use of the information which has been exchanged,
should contact IBM Corporation, Dept. 600A, Mail Drop 1329, Somers, NY 10589 USA.
Such information may be available, subject to appropriate terms and conditions, including in
some cases, payment of a fee.
The information contained in this document has not been submitted to any formal IBM test
and is distributed AS IS. The use of this information or the implementation of any of these
techniques is a customer responsibility and depends on the customer's ability to evaluate and
integrate them into the customer's operational environment. While each item may have been
reviewed by IBM for accuracy in a specific situation, there is no guarantee that the same or
similar results will be obtained elsewhere. Customers attempting to adapt these techniques to
their own environments do so at their own risk.
Any pointers in this publication to external Web sites are provided for convenience only and
do not in any manner serve as an endorsement of these Web sites.
C-bus is a trademark of Corollary, Inc. in the United States and/or other countries.
Java and all Java-based trademarks and logos are trademarks or registered trademarks of
Sun Microsystems, Inc. in the United States and/or other countries.
Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft
Corporation in the United States and/or other countries.
UNIX is a registered trademark in the United States and other countries licensed exclusively
through The Open Group.
SET, SET Secure Electronic Transaction, and the SET Logo are trademarks owned by SET
Secure Electronic Transaction LLC.
Other company, product, and service names may be trademarks or service marks of others.
300 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Related publications
The publications listed in this section are considered particularly suitable for a more detailed
discussion of the topics covered in this redbook.
IBM Redbooks
For information on ordering these publications, see “How to get IBM Redbooks” on page 302.
DB2 for z/OS and OS/390 Version 7 Performance Topics, SG24-5129
DB2 UDB Server for OS/390 and z/OS Version 7 Presentation Guide, SG24-6121
DB2 UDB Server for OS/390 Version 6 Technical Update, SG24-6108
DB2 Java Stored Procedures - Learning by Example, SG24-5945
DB2 UDB for OS/390 Version 6 Performance Topics, SG24-5351
DB2 for OS/390 Version 5 Performance Topics, SG24-2213
DB2 for MVS/ESA Version 4 Non-Data-Sharing Performance Topics, SG24-4562
DB2 UDB for OS/390 Version 6 Management Tools Package, SG24-5759
DB2 Server for OS/390 Version 5 Recent Enhancements - Reference Guide, SG24-5421
DB2 for OS/390 Capacity Planning, SG24-2244
Developing Cross-Platform DB2 Stored Procedures: SQL Procedures and the DB2 Stored
Procedure Builder, SG24-5485
DB2 for OS/390 and Continuous Availability, SG24-5486
Parallel Sysplex Configuration: Cookbook, SG24-2076-00
DB2 for OS390 Application Design for High Performance, GG24-2233
Using RVA and SnapShot for Business Intelligence Applications with OS/390 and DB2,
SG24-5333
IBM Enterprise Storage Server Performance Monitoring and Tuning Guide, SG24-5656
DFSMS Release 10 Technical Update, SG24-6120
Storage Management with DB2 for OS/390, SG24-5462
Implementing ESS Copy Services on S/390, SG24-5680
Other resources
These publications are also relevant as further information sources:
DB2 UDB for OS/390 and z/OS Version 7 What’s New, GC26-9946
DB2 UDB for OS/390 and z/OS Version 7 Installation Guide, GC26-9936
DB2 UDB for OS/390 and z/OS Version 7 Command Reference, SC26-9934
DB2 UDB for OS/390 and z/OS Version 7 Messages and Codes, GC26-9940
DB2 UDB for OS/390 and z/OS Version 7 Utility Guide and Reference, SC26-9945
DB2 UDB for OS/390 and z/OS Version 7 Programming Guide and Reference for Java,
SC26-9932
Also download additional materials (code samples or diskette/CD-ROM images) from this
Redbooks site.
Redpieces are Redbooks in progress; not all Redbooks become redpieces and sometimes
just a few chapters will be published this way. The intent is to get the information out much
quicker than the formal publishing process allows.
302 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Abbreviations and acronyms
AIX Advanced Interactive eXecutive EBCDIC extended binary coded decimal
from IBM interchange code
APAR authorized program analysis report ECS enhanced catalog sharing
ARM automatic restart manager ECSA extended common storage area
ASCII American National Standard Code EDM environment descriptor
for Information Interchange management
BLOB binary large objects ERP enterprise resource planning
CCSID coded character set identifier ESA Enterprise Systems Architecture
CCA client configuration assistant ETR external throughput rate, an
CFCC coupling facility control code elapsed time measure, focuses on
system capacity
CTT created temporary table
FDT functional track directory
CEC central electronics complex
FTP File Transfer Program
CD compact disk
GB gigabyte (1,073,741,824 bytes)
CF coupling facility
GBP group buffer pool
CFRM coupling facility resource
management GRS global resource serialization
304 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
Index
D
B D UNI,ALL 296
BUILD2 12, 175 data set extents 80
data set sizing 28
DB2 Administration Tool 94
C DB2 Automation Tool 96
CARDF 71, 84
DB2 Change Accumulation Tool 224
CCSID 204
DEADLINE 181
CHANGELIMIT 70, 73, 238
degree of parallelism 118
CHECK 226
DELAY 180
CHECK DATA 228
DFSMS 102
phases 228
DFSMSdss 248
CHECK INDEX 226, 233
DFSORT 183
CHECK LOB 229
DISCARD 156, 185, 191
phases 229
DISPLAY UTIL 237
check page 240
DISPLAY UTILITY command 181
CHECKP 150
DRAIN 177
compression dictionary 84
DRAIN_WAIT 177
Concurrent Copy 246
DSN1CHKR 8
CONTINUE 180
DSN1COMP 8
Control Center 89
DSN1COPY 8, 193, 234
COPY 73, 232
DSN1LOGP 8
CHECKPAGE 240
DSN1PRNT 9
copying to disk 242
DSN1SDMP 9
copying to tape 242
DSNACCOR 88
FULL 241
DSNJLOGF 7
INCREMENTAL 241
DSNJU003 7
Lists 235
DSNJU004 8
parallelism 235
DSNRTSDB 86
SHRLEVEL CHANGE 242
DSNRTSTS 86
SHRLEVEL REFERENCE 242
DSNTESS 86
COPY PARALLEL 49
DSNTIPB 104
COPY YES 234
DSNTIPL 217
COPYDDN 110
DSNU070I 138
copying Catalog and Directory objects 287
DSNU331I 138
copying indexes 232
DSNU364I 63
recommendations 234
DSNU374I 181
COPYP 110
DSNU397I 63, 235
COPYPAGESF 74
DSNU427I 235
COPYTOCOPY 12, 243
DSNUM 75
FROMCOPY 245
DSNUM, 75
K P
KEEPDICTIONARY 183 PAGESAVE 84
Parallel Index Build 111
Partition 123
L Partition parallelism with Parallel Index Build 123
LARGE 102 partitioning 102
LEAFDIST 70 impact on SQL 104
LEAFDISTLIMIT 70 impact on utilities 105
LEAFFAR 71, 75 reasons for 102
LEAFNEAR 71, 75 partitions
LIMIT option 203 altering 103
LISTDEF 18 partitions and parallelism 105
306 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
PARTKEYU 104 TIMEOUT 180
PCTFREE 80 UNLOAD EXTERNAL 187
PERCDROP 81 REORG and Inline COPY 189
PIB 111 REORG and Inline RUNSTATS 189
pieces 11 REORG of index 188
PIECESIZE 106 REORG of the DB2 Catalog 188
PQ19897 187 REORG UNLOAD EXTERNA 193
PQ42864 82 REORGP 103
PQ45184 221 REPORT RECOVERY 222
PQ45268 129 REPORTONLY 74, 162
PQ45835 30, 242 restartability 32
PQ46759 129 restricted status 32
PQ46859 85 RESUME YES 148, 149
PQ48447 85 RETRY 177
PQ48448 85 RETRY_DELAY 177
PQ49114 188 REUSE 80, 128, 184
PQ50223 30, 207 RORG
PQ50666 269 SHRLEVEL NONE 163
PREFORMAT 127, 184 RUNSTATS 77, 253
PREVIEW 24, 35 RUNSTATS and LOBs 262
PRIQTY 80 RUNSTATS option
PSEUDO_DEL_ENTRIES 75 FORCEROLLUP 261
FREQVAL 258
HISTORY 256
Q KEYCARD 257
QUIESCE 225 LISTDEF 254
REPORT 262
R SAMPLE 259
Real Time Statistics 85 UPDATE 255
impact of utilities 280 RUNSTATS performance 271
tables 275 RUNSTATS statistics history 12
REBUILD INDEX 232 RVA 80
RECOVER
from CONCURRENT COPY 215 S
without an image copy 215 SECQTY 80
RECOVER PARALLEL 49, 211 shadow data sets 166
restrictions 212 SORTBLD 112, 170
RECOVERYDDN 110 SORTDATA 182
redbook contents 4 SORTKEYS 111, 170, 182
Redbooks Web site 302 SPACE 266
Contact us xix space growth 84
REORG 161 SPACEF 75, 84
BUILD2 175 STACK 29
DEADLINE 181 stand-alone utilities 7
DISCARD 185 star join 145
DRAIN 176 STATISTICS option 110
FASTSWITCH 172 STATSINT 86
KEEPDICTIONARY 183 STYPE 246
LISTS 181 SYSCOLDIST_HIST 265
LOG phase 171 SYSCOLUMNS_HIST 265
LONGLOG 180 SYSCOPY 74
MAXRO 179 SYSDBASE 76
NOSYSREC 182 SYSHIST 76
PREFORMAT 184 SYSIBM.SYSINDEXSPACESTATS 86
REUSE 184 SYSIBM.SYSTABLESPACESTATS 86
SHRLEVEL CHANGE 164 SYSINDEXES 75
SHRLEVEL REFERENCE 164 SYSINDEXES_HIST 265
SORTDATA 182 SYSINDEXPART 75, 80
SORTKEYS 170, 182 SYSINDEXPART_HIST 265
SWITCH 171 SYSLOBSTATS 81
Index 307
SYSLOBSTATS_HIST 265
SYSSTATS 76
SYSTABLEPART 80
SYSTABLEPART_HIST 265
SYSTABLES 75
SYSTABLES_HIST 265
SYSTABSTATS_HIST 265
T
TABLESPACESTATS 277
Templates 25
benefits 25
how to use 26
naming standards 27
recommendations 29
restrictions 30
Templates and partitions 34
Templates and Wildcards 30
U
UDT 141
UNICODE 367 293
UNICODE support 293
UNLOAD 12, 191
converting data 204
FROM TABLE 196
FROMCOPY 196
output data sets 192
phases 192
restarting 207
SAMPLE option 202
WHEN option 202
UNLOAD and image copy 194
UNLOAD and SHRLEVEL 202
UNLOAD EXTERNAL 187
unloading compressed table space 197
unloading from a table space 198
unloading in parallel by partition 200
UQ54731 188
UQ55541 129
UQ55542 129
UQ56653 269
utilities
changes across versions 9
enhancements with V7 11
packaging 12
UTRO 164
UTRW 164
UTUT 164
W
WHEN option 197
Wildcarding
benefits 18
how to use 18
Wildcards 18
restrictions 25
Wildcards and Templates 30
308 DB2 for z/OS and OS/390 Version 7: Using the Utilities Suite
DB2 for z/OS and OS/390 Version 7 Using the Utilities Suite
(0.5” spine)
0.475”<->0.873”
250 <-> 459 pages
Back cover ®