When all business requirements are met in a development, customer complaints always come down to performance. “Can’t this run any faster?” we always hear. Performance, in a technical point of view, is a broader concept that not just includes execution time, but also optimal use of system resources like memory and network load. But powerful hardware of our age and requirement of end user satisfaction leaves us with a single option to have execution time (or speed) as the top priority performance aspect. So I will experiment on some known techniques for database access and internal table processing, and share the results here.
Disclaimer
This article shares result of an experiment and does not include any official SAP recomendation.
I will use the table DD03L (data dictionary table for table fields) for our experiment. It is available on every SAP system and it has over 6.000.000 (six million) records even in a freshly installed ECC system.
Execution time comparison of SELECT SINGLE in LOOP versus SELECT … FOR ALL ENTRIES and READ TABLE in LOOP
Answer might be obvious for some of you but let’s see the real impact on execution time. Here is our scenario:
Data set quantity is 100.000 (hundred thousand) records. I think this is a fair and realistic amount.
We can access search table with primary key.
We know we won’t come across same search key second time. This might be possible according to business requirement.
SELECT SINGLE in LOOP
Below program simply selects some data set from table DD03L, loops through it and searches database table with
SELECTSINGLE . I will run this program three times, for data sets from the beginning, from the middle and near the end of database table.
SELECT SINGLE in LOOP
ABAP
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
CLASSlcl_controllerDEFINITION.
PUBLICSECTION.
DATAt_driverTYPETABLEOFdd03l.
METHODS:retrieve_test_data
IMPORTING
ip_firstTYPEi
ip_lastTYPEi,
search.
ENDCLASS.
CLASSlcl_controllerIMPLEMENTATION.
METHODretrieve_test_data.
DATAls_dd03lTYPEdd03l.
SELECT*FROMdd03lINTOls_dd03l.
IFsy-dbcntBETWEENip_firstANDip_last.
APPENDls_dd03lTOme->t_driver.
ENDIF.
IFsy-dbcnt=ip_last.
EXIT.
ENDIF.
ENDSELECT.
ENDMETHOD.
METHODsearch.
DATA:ls_driverLIKELINEOFme->t_driver,
ls_dd03lTYPEdd03l.
LOOP ATme->t_driverINTOls_driver.
SELECTSINGLE*FROMdd03lINTOls_dd03l
WHEREtabname=ls_driver-tabname
ANDfieldname=ls_driver-fieldname
ANDas4local=ls_driver-as4local
ANDas4vers=ls_driver-as4vers
ANDposition=ls_driver-position.
ENDLOOP.
ENDMETHOD.
ENDCLASS.
DATAgo_experimentTYPEREF TOlcl_controller.
START-OF-SELECTION.
CREATE OBJECTgo_experiment.
*-Get 100.000 records as a test set
go_experiment->retrieve_test_data(
ip_first=1
ip_last=100000).
*-Loop through driver table and search in database
go_experiment->search().
Let’s run this program with transaction SAT and observe the gross runtime of our search method.
Record interval: 1-100.000 / Execution time for SEARCH method: 27.9 secondsRecord interval: 3.000.001-3.100.000 / Execution time for SEARCH method: 28.3 secondsRecord interval: 6.000.001-6.100.000 / Execution time for SEARCH method: 28.6 seconds
We can see that placement of data in database table does not impact much. This might be because of the search algorithm of database system.
SELECT … FOR ALL ENTRIES and READ TABLE in LOOP
Below program is pretty much like the previous one but this time we select the result set with
FORALLENTRIES and search on the internal table with
READ TABLE in
LOOP . Note that we are using a hashed table for our search table because according to official performance tips and tricks (see transaction SAT):
Hashed tables are optimized for single entry access
*-Loop through driver table and search in hashed table
go_experiment->search().
After running three times:
Record interval: 1-100.000 / Execution time for SEARCH method: 6.9 secondsRecord interval: 3.000.001-3.100.000 / Execution time for SEARCH method: 7.4 secondsRecord interval: 6.000.001-6.100.000 / Execution time for SEARCH method: 4.9 seconds
Conclusion
We can see that difference is significant. If we are accessing search table with primary key, we can safely prefer
FORALLENTRIES and
READ TABLE over
SELECTSINGLE in
LOOP although the latter is more convenient to implement. But we must beware two common pitfalls:
Using FOR ALL ENTRIES with empty driver table, which will result in fetching all records from database table.
Using sequential search on large internal tables, which will result in significantly slower search compared to binary search and hash algorithm.