Overview

Introduction

This chapter begins by describing two ways in which the idea of fielded records or fielded buffers can be handled: through structured records and through FML records. It goes on to tell you about the features of the Field Manipulation Language, and to describe the circumstances under which you might want to make use of them.

A comparison of FML records with traditional structured records clearly shows the advantages of using fielded buffers throughout an application.

Dividing Records Into Fields

Except under unusual conditions where a data record is a complete and indivisible entity, you need to be able to break records into fields to be able to use or change the information the record contains. In the BEA TUXEDO System there are two ways to divide records into fields:

through C language data structures or COBOL records
through fielded buffers

Structures

One common way of subdividing records is with a structure that divides a contiguous area of storage into fields. The fields are given names for identification; the kind of data carried in the field is shown by the data type declaration.

For example, if a data item in a C language program is to contain information about an employee's identification number, name, address, and sex, it could be done with a structure like the following:

struct S {
        long empid;
        char name[20];
        char addr[40];
        char sex;
};

where the data type of the field named empid is declared to be a long integer, name and addr are declared to be character arrays of 20 and 40 characters respectively, and sex is declared to be a single character, presumably with a range of m or f.

If, in your C program, the variable p points to a structure of type struct S, the references p->empid, p->name, p->addr and p->sex can be used to address the fields.

The COBOL COPY file for the same data structure would be as follows (the application would supply the 01 line).

           05 EMPID                           PIC S9(9) USAGE IS COMP-5.
           05 NAME                            PIC X(20).
           05 ADDR                            PIC X(40).
           05 SEX                             PIC X(01).
           05 FILLER                          PIC X(03).

If, in your COBOL program, the 01 line is named MYREC, the references EMPID IN MYREC, NAME IN MYREC, ADDR IN MYREC and SEX IN MYREC can be used to access the fields.

Possible Disadvantages of Structures

While this way of representing data is widely used and often appropriate, it has two major potential disadvantages:

any time the data structure is changed, all programs using the structure have to be recompiled
the size of the structure and the offsets of the component fields are all fixed; this most often results in wasted space, since not all fields will always contain a value and since fields tend to be sized to hold the largest likely entry

Fielded Buffers

Fielded buffers provide another way of subdividing a record into fields.

A fielded buffer is a data structure that provides associative access to the fields of a record; that is, the name of a field is associated with an identifier that includes the storage location as well as the data type of the field.

The main advantage of the fielded buffer is data independence. Fields can be added to the buffer, deleted from it, or changed in length without forcing programs that reference the fields to be recompiled. To achieve this data independence fields are referenced by an identifier rather than the fixed offset prescribed by record structures, and all access to fields is through function calls.

Fielded buffers can be used throughout the BEA TUXEDO System as the standard method of representing data sent between cooperating processes.

Implementing Fielded Buffers with FML

Fielded buffers are created, updated, accessed, input and output via the Field Manipulation Language (FML). FML has two main objectives:

to provide a convenient and standard discipline for creating and manipulating fielded buffers
to provide data independence to programs making use of fielded buffers

FML is implemented as a library of functions and macros that are callable from C programs. There are three major groups of FML functions:

a set of functions for creating, updating, accessing and manipulating fielded buffers;
a set of functions for converting data from one type to another upon input to (or output from) a fielded buffer structure;
a set of functions for transferring data between fielded buffers and C structures or COBOL records.

The last set of functions listed above constitutes the FML VIEWS software. VIEWS is a set of functions that exchange data between FML fielded buffers and structures in C or COBOL language applications programs. When a program receives a fielded buffer from another process, the program has the choice of:

operating on the buffer data directly in the buffer using FML function calls (this is not available in COBOL), or
transferring the data from the fielded buffer to a structure using VIEWS functions, and then operating on the data in the structure using normal C or COBOL statements

If you need to perform lengthy manipulations on buffer data, the performance of your program can be improved by transferring fielded buffer data to structures or records, and operating on the data using normal C or COBOL statements. Then, you can put the data back into a fielded buffer (again using VIEWS functions), and send the buffer off to another process.

To use VIEWS, your program must know the format of incoming fielded buffer data. This is done through a set of view descriptions kept in a cache on your system.

A view description is created and stored in a source viewfile. The view description maps fields in fielded buffers to members in C structures or COBOL records. The source view descriptions are compiled, and can then be used to map data transferred between fielded buffers and C structures or COBOL records in a program.

By keeping view descriptions cached in a central file, you can increase the data independence of your programs; you only need to change the view description(s) and recompile them to effect changes in data format throughout an application that uses VIEWS.

FML Features

This section describes the features of FML and VIEWS, and gives you some preliminary information about how you can use them in applications programs.

Fielded Buffer Structure

A fielded buffer, as mentioned earlier, is a data structure that provides associative access to the fields of a record.

Each field in an FML fielded buffer is labeled with an integer that combines information about the data type of the accompanying field with a unique identifying number. The label is called the field identifier, or fldid. For variable-length items, fldid is followed by a length indicator. The buffer can be represented as a sequence of fldid/data pairs, with fldid/length/data triples for variable-length items. Figure 1 illustrates this.

Fig. 1: A fielded buffer

In the header file that is #include'd whenever FML functions are used (fml.h or fml32.h), field identifiers are typedef'ed as FLDID (or FLDID32 for FML32), field value lengths as FLDLEN (FLDLEN32 for FML32), and field occurrence numbers as FLDOCC (FLDOCC32 for FML32).

Supported Field Types

The supported field types are short, long, float, double, character, string, and carray (character array). These types are #define'd in fml.h (or fml32.h) as shown in Figure 2.

Fig. 2: FML field types as defined in `fml.h` and `fml32.h`

#define FLD_SHORT	0	/* short int */
#define FLD_LONG	1	/* long int */
#define FLD_CHAR	2	/* character */
#define FLD_FLOAT	3	/* single-precision float */
#define FLD_DOUBLE	4	/* double-precision float */
#define FLD_STRING	5	/* string - null terminated */
#define FLD_CARRAY	6	/* character array */

FLD_STRING and FLD_CARRAY are both arrays, but differ in the following way:

a FLD_STRING is a variable-length array of non-NULL characters terminated by a NULL
a FLD_CARRAY is a variable-length array of bytes, any of which may be NULL.

Functions that add or change a field have a FLDLEN argument that must be filled in when you are dealing with FLD_CARRAY fields. The size of a string or carray is limited to 65,535 characters in FML, and 2 billion bytes for FML32.

It is not a good idea to store unsigned data types in fielded buffers. You should either convert all unsigned short data to long or cast the data into the proper unsigned data type whenever you retrieve data from fielded buffers (using the FML conversion functions).

Most FML functions do not perform type checking; they expect that the value you update or retrieve from a fielded buffer matches its native type. For example, if a buffer field is defined to be a FLD_LONG, you should always pass the address of a long value. The FML conversion functions convert data from a user specified type to the native field type (and from the field type to a user specified type) in addition to placing the data in (or retrieving the data from) the fielded buffer.

Type int in VIEWS

In addition to the data types supported by most FML functions, VIEWS indirectly supports type int in source view descriptions. When the view description is compiled (see "VIEWS Features," below), the view compiler automatically converts any int types to either short or long types, depending on your machine.

Type dec_t in VIEWS

VIEWS also supports the dec_t packed decimal type in source view descriptions. This data type is useful for transferring VIEW structures to COBOL programs. In a C program using the dec_t type, the field must be initialized and accessed using the functions described in the decimal(3c) manual page. Within the COBOL program, the field can be accessed directly using a packed decimal (COMP-3) definition. Since FML does not support a dec_t field, this field is automatically converted to the data type of the corresponding FML field in the fielded buffer (for example, a string field) when converting from a VIEW to FML.

Field Name to Identifier Mappings

In the BEA TUXEDO System, fields are usually referred to by their field identifier (fldid), an integer (see "Field Identifiers," in Chapter 4, for a detailed description of field identifiers). This allows you to reference fields in a program without using the field name, which may change.

There are two ways in which identifiers are assigned (mapped) to field names:

through field table files (which are ordinary UNIX files)
through C language header (#include) files

A typical application might use one, or both of the above methods to map field identifiers to field names.

In order for FML to access the data in fielded records, there must be some way for FML to access the field name/identifier mappings. FML gets this information in one of two ways:

at run-time, through UNIX field table files, and FML mapping functions
at compile-time, through C header files

Field name/identifier mapping is not available in COBOL.

Run-Time: Field Table Files

Field name/identifier mappings can be made available to FML programs at run-time through field table files. It is the responsibility of the programmer to set two environment variables that tell FML where the field name/identifier mapping table files are located.

The environment variable FLDTBLDIR contains a list of directories where field tables can be found. The environment variable FIELDTBLS contains a list of the files in the table directories that are to be used. For FML32, the environment variable names are FLDTBLDIR32 and FIELDTBLS32.

Within applications programs, the FML function Fldid provides for a run-time translation of a field name to its field identifier. Fname translates a field identifier to its field name (see Fldid(3fml) and Fname(3fml)). (The function names for FML32 are Fldid32 and Fname32.) The first invocation of either function causes space in memory to be dynamically allocated for the field tables and the tables to be loaded into the address space of the process. The space can be recovered when the tables are no longer needed (see "Loading the Field Tables," in Chapter 4).

This method should be used when field name/identifier mappings are likely to change throughout the life of the application. This topic is covered in more detail in Chapter 4.

Compile-Time: Header Files

mkfldhdr(1) (or mkfldhdr32) is provided to make header files out of field table files. These header files are #include'd in C programs, and provide another way to map field names to field identifiers: at compile-time. mkfldhdr32(1)

Using field header files, the C preprocessor converts all field name references to field identifiers at compile-time; thus, you do not need to use the Fldid or Fname functions as you would with the field table files described in the previous section.

If you always know what field names your program needs, #include-ing your field table header file(s) saves some data space and means your program can get to the task at hand more quickly.

However, since this method resolves mappings at compile-time, it should not be used if the field name/identifier mappings in the application are likely to change. This topic is covered in more detail in Chapter 4.

Fielded Buffer Indexes

When a fielded buffer has many fields, access is expedited in FML by the use of an internal index. The user is normally unaware of the existence of this index.

Fielded buffer indexes do, however, take up space in memory and on disk. When you store a fielded buffer on disk, or transmit a fielded buffer between processes or between computers, you can save disk space and/or transmittal time by first discarding the index.

A function, Funindex, is provided to do that. When the fielded buffer is read from disk (or received from a sending process), the index can be explicitly reconstructed with the function Findex.

Note that these space savings do not apply to memory. The function Funindex does not recover in-core memory used by a the index of a fielded buffer.

Multiply Occurring Fields

Any field in a fielded buffer can occur more than once. Many FML functions take an argument that specifies which occurrence of a field is to be retrieved or modified. If a field occurs more than once, the first occurrence is numbered 0, and additional occurrences are numbered sequentially. The set of all occurrences make up a logical sequence, but no overhead is associated with the occurrence number (that is, it is not stored in the fielded buffer).

If another occurrence of a field is added, it is added at the end of the set and is referred to as the next highest occurrence. When an occurrence other than the highest is deleted, all higher occurrences of the field are shifted down by one (for example, occurrence 6 becomes occurrence 5, 5 becomes 4, etc.).

Boolean Expressions and Fielded Buffers

Often, applications programs receive a fielded buffer from another source (from a user's terminal, from a database record, etc.) and the values of one or more fields will determine the next action taken by the application program. FML provides several functions that create boolean expressions on fielded buffers or VIEWs and determine if a given buffer or VIEW meets the criteria specified by the expression.

Once you create a boolean expression, it is compiled into an evaluation tree. The evaluation tree is then used to determine if a fielded buffer or VIEW matches the specified boolean conditions.

For instance, a program may read a data record into a fielded buffer (Buffer A), and apply a boolean expression to the buffer. If Buffer A meets the conditions specified by the boolean expression, then an FML function is used to update another buffer, Buffer B, with data from Buffer A.

VIEWS Features

VIEWS is particularly useful when a program does a lot of processing on the data in a fielded buffer, either after the program has received the buffer or before the program sends the buffer to another program.

Under such conditions, you may gain in processing efficiency by using the VIEWS functions to transfer fielded buffer data from the buffer to a C structure before you manipulate it. This is because the FML functions for manipulating fields in a buffer require more processing time than C functions. Then, when you finish processing the data in the C structure, you can transfer it back to the fielded buffer and send it on to another program.

The VIEWS software has the following features:

you can create source view descriptions that specify C structure-to-fielded buffer mappings or COBOL record-to-fielded buffer mappings, and make possible the transfer of data between structures and buffers
a view compiler, viewc(1) (or viewc32), is used to generate object view descriptions (stored in binary files) that are interpreted by your applications programs at run time; the compiler also generates header files that can be used in C programs to define the structures used in view descriptions, and optionally generates COPY files that can be used in COBOL programs to define the records used in the view descriptions.
a view disassembler is provided to translate object view descriptions into readable form (that is, back into source view descriptions); the output of the disassembler can be re-input to the view compiler
data transfers from C structures or COBOL records to fielded buffers can be done in any one of four modes: FUPDATE, FJOIN, FOJOIN, and FCONCAT; these modes are similar to the ones supported by the following FML functions: Fupdate, Fjoin, Fojoin, and Fconcat
at runtime, object view descriptions are read into a viewfile cache on demand, and remain there until the cache is full; when the cache is full and an object view description that is not in the cache is needed, the least recently accessed object view description is removed from the cache to make room for the new one
all FML supported types can be used in view descriptions, and in addition, integer and packed decimal are supported
when transferring data between fielded buffers and structures, the source data is automatically converted to the type of the destination data; for instance, if a string field is mapped to an integer member, the string is converted to an integer using Ftypcvt automatically
multiple field occurrences are supported
user-specified and default null values in view descriptions are supported
As described earlier for fielded buffers, functions exist to compile and evaluate boolean expressions against application data in a VIEW.

A source viewfile is an ordinary UNIX text file that contains one or more source view descriptions. Source viewfiles are used as input to the view compiler, viewc(1) (or viewc32), which compiles the source view descriptions and stores them in object viewfiles.

The view compiler also creates C header files for object viewfiles. These header files can be included in applications programs to define the structures used in object view descriptions.

The view compiler optionally creates COBOL COPY files for object viewfiles. These COPY files can be included in COPY programs to define the records formats used in object view descriptions.

Null values are used to indicate empty members in a structure, and can be specified by the user for each structure member in a viewfile. If the user does not specify a null value for a member, default null values are used.

Note that a structure member containing the null value for that member is not transferred during a structure-to-fielded buffer transfer.

It is also possible to inhibit the transfer of data between a C or COBOL structure member and a field in a fielded buffer, even though a mapping exists between them. This is specified in the source viewfile.

The FML VIEWS functions are Fvstof, Fvftos, Fvnull, Fvopt, Fvselinit, and Fvsinit. For COBOL, the VIEWS procedures provided are FVSTOF and FVFTOS. Upon calling any view function, the named object viewfile, if found, is loaded into the viewfile cache automatically. Each file specified in the environment variable VIEWFILES is searched in order (see Chapter 3). The first object viewfile with the specified name will be loaded. Subsequent object viewfiles with the same name, if any, are ignored.

Note that arrays of structures, pointers, unions, and typedefs are not supported in VIEWS.

Multiply Occurring Fields in VIEWS

Since VIEWS is concerned with moving fields between fielded buffers and C structures or COBOL records, it has to deal with the possibility of multiply occurring fields in the buffer.

To store multiple occurrences of a field in a structure, a member is declared as an array in C or with the OCCURS clause in COBOL; each occurrence of a field occupies one element of the array. The size of the array reflects the maximum number of field occurrences in the buffer.

When transferring data from fielded buffers to C structures or COBOL records, if the receiving array has more elements than there are occurrences in the fielded buffer, the extra elements are assigned the (default or user-specified) null value. If there are more occurrences in the buffer than there are elements in the array, the extra occurrences in the buffer are ignored.

When data is transferred from C structures or COBOL records to fielded buffers, array members with the value equal to the (default or user-specified) null values are ignored.

Error Handling

When an FML function detects an error, one of the following values is returned:

NULL is returned for functions that return a pointer
BADFLDID is returned for functions that return a FLDID
-1 is returned for all others

All FML function call returns should be checked against the appropriate value above to detect errors.

In all error cases, the external integer Ferror is set to the error number as defined in fml.h. Ferror32 is set to the error number for FML32 as defined in fml32.h.

The F_error (or F_error32) function is provided to produce a message on the standard error output. It takes one parameter, a string; prints the argument string appended with a colon and a blank; and then prints an error message followed by a newline character. The error message displayed is the one defined for the error number currently in Ferror, which is set when errors occur.

To be of most use, the argument string to the F_error (or F_error32) function should include the name of the program that incurred the error.

Fstrerror(3fml) can be used to retrieve from a message catalog the text of an error message; it returns a pointer that can be used to as an argument to userlog(3c) or to F_error(3fml) or F_error32(3).

The error codes that can be produced by an FML function are described on each FML manual page in the BEA TUXEDO Reference Manual: Section 3fml.