This chapter discusses the porting of programs from other dialects of Fortran to Sun compilers. VAX VMS Fortran programs compile almost exactly as is with Sun f77; this is discussed further in the chapter on VMS extensions in the FORTRAN 77 Language Reference Manual.
Porting issues bear mostly upon FORTRAN 77 programs. The Sun Fortran 90 compiler, f90, incorporates few nonstandard extensions, and these are described in the Fortran User's Guide.
Library functions that return the time of day or elapsed CPU time vary from system to system.
The following time functions are not supported directly in the Sun Fortran libraries, but you can write subroutines to duplicate their functions:
Time-of-day in 10h format
Date in A10 format
Milliseconds of job CPU time
Julian date in ASCII
The time functions supported in the Sun Fortran library are listed in the following table:
Table 7-1 Sun Fortran Time Functions
Name |
Function |
Man Page |
---|---|---|
time |
Returns the number of seconds elapsed since January, 1, 1970 |
time(3F) |
date | Returns date as a character string | date(3F) |
fdate |
Returns the current time and date as a character string |
fdate(3F) |
idate |
Returns the current month, day, and year in an integer array |
idate(3F) |
itime |
Returns the current hour, minute, and second in an integer array |
itime(3F) |
ctime |
Converts the time returned by the time function to a character string |
ctime(3F) |
ltime |
Converts the time returned by the time function to the local time |
ltime(3F) |
gmtime |
Converts the time returned by the time function to Greenwich time |
gmtime(3F) |
etime |
Single processor: Returns elapsed user and system time for program execution Multiple processors: Returns the wall clock time |
etime(3F) |
dtime |
Returns the elapsed user and system time since last call to dtime |
dtime(3F) |
date_and_time |
Returns date and time in character and numeric form |
date_and_time(3F) |
For details, see Fortran Library Reference Manual or the individual man pages for these functions. The routines listed in the following table provide compatibility with VMS Fortran system routines idate and time. To use these routines, you must include the -lV77 option on the f77 command line, in which case you also get these VMS versions instead of the standard f77 versions.
Table 7-2 Summary: Nonstandard VMS Fortran System Routines
Name |
Definition |
Calling Sequence |
Argument Type |
---|---|---|---|
idate |
Date as day, month, year |
call idate( d, m, y ) |
integer |
time |
Current time as hhmmss |
call time( t ) |
character*8 |
The date(3F) routine and the VMS version of idate(3F) cannot be Year 2000 safe because they return 2-digit values for the year. Programs that compute time duration by subtracting dates returned by these routines will compute erroneous results after December 31, 1999. The Fortran 90 routine date_and_time(3F) is available for both FORTRAN 77 and Fortran 90 programs, and should be used instead. See the Fortran Library Reference Manual for details.
The error condition subroutine errsns is not provided, because it is totally specific to the VMS operating system.
Here is a simple example of the use of these time functions (TestTim.f):
subroutine startclock common / myclock / mytime integer mytime, time mytime = time() return end function wallclock integer wallclock common / myclock / mytime integer mytime, time, newtime newtime = time() wallclock = newtime - mytime mytime = newtime return end integer wallclock, elapsed character*24 greeting real dtime, timediff, timearray(2) c print a heading call fdate( greeting ) print*, " Hello, Time Now Is: ", greeting print*, "See how long 'sleep 4' takes, in seconds" call startclock call system( 'sleep 4' ) elapsed = wallclock() print*, "Elapsed time for sleep 4 was: ", elapsed," seconds" c now test the cpu time for some trivial computing timediff = dtime( timearray ) q = 0.01 do 30 i = 1, 1000 q = atan( q ) 30 continue timediff = dtime( timearray ) print*, "atan(q) 1000 times took: ", timediff ," seconds" end
Running this program produces the following results:
demo% TimeTest Hello, Time Now Is: Mon Feb 12 11:53:54 1996 See how long 'sleep 4' takes, in seconds Elapsed time for sleep 4 was: 5 seconds atan(q) 1000 times took: 2.26550E-03 seconds demo%
Some f77 and f90 format edit descriptors can behave differently on other systems. Here are some format specifiers that f77 treats differently than some other implementations:
A - Alphanumeric conversion. Used with character type data elements. In FORTRAN 77, this specifier worked with any variable type. f77 supports the older usage, up to four characters to a word.
$ - Suppresses newline character output.
R - Sets an arbitrary radix for the I formats that follow in the descriptor.
SU - Selects unsigned output for following I formats. For example, you can convert output to either hexadecimal or octal with the following formats, instead of using the Z or O edit descriptors:
10 FORMAT( SU, 16R, I4 ) 20 FORMAT( SU, 8R, I4 )
Fortran carriage-control grew out of the capabilities of the equipment used when Fortran was originally developed. For similar historical reasons, an operating system derived from the UNIX operating system, does not have Fortran carriage control, but you can simulate it in two ways.
Use the asa filter to transform Fortran carriage-control conventions into the UNIX carriage-control format (see the asa (1) man page) before printing files with the lpr command.
f77: For simple jobs, use OPEN(N, FORM='PRINT') to enable single or double spacing, formfeed, and stripping off of column one. It is legal to reopen unit 6 to change the form parameter to PRINT. For example:
OPEN( 6, FORM='PRINT')
You can use lp(1) to print a file that is opened in this manner.
Early Fortran systems did not use named files, but did provide a command line mechanism to equate actual file names with internal unit numbers. This facility can be emulated in a number of ways, including standard UNIX redirection.
Example: Redirecting stdin to redir.data (using csh(1)):
demo% cat redir.data The data file 9 9.9 demo% cat redir.f The source file read(*,*) i, z The program reads standard input print *, i, z stop end demo% f77 -silent -o redir redir.f The compilation step demo% redir < redir.data Run with redirection reads data file 9 9.90000 demo%
If the application code was originally developed for 64-bit (or 60-bit) mainframes such as CRAY or CDC, you might want to compile these codes with the f77 -xtypemap option:"-xtypemap=real:64,double:128,integer:64 "
to preserve the expected precision of the original. This option automatically promotes all default REAL variables to REAL*8, default DOUBLE to REAL*16, and COMPLEX to COMPLEX*16. Only undeclared variables or variables declared as simply REAL or DOUBLE or DOUBLE PRECISION or COMPLEX are promoted; variables declared explicitly (for example, REAL*4) are not promoted.
On x86, or for better performance on SPARC, use:"-xtypemap=real:64,double:64,integer:64 "
which does not promote default DOUBLE PRECISION.
The -xtypemap option, is preferred over -dbl and -r8 and -i2. See the Fortran User's Guide and the f77(1) man pages for details.
To further recreate the original mainframe environment, it is probably preferable to stop on overflows, division by zero, and invalid operations. Compile the main program with -ftrap=common to ensure this.
The FORTRAN 77 Language Reference Manual, Fortran User's Guide, and the Sun Numerical Computation Guide discuss in detail the hardware representation of data objects in Fortran. Differences between data representations across systems and hardware platforms usually generate the most significant portability problems.
The following issues should be noted:
Sun adheres to the IEEE Standard 754 for floating-point arithmetic. Therefore, the first four bytes in a REAL*8 are not the same as in a REAL*4.
The default sizes for reals, integers, and logicals are described in the FORTRAN 77 standard, except when these default sizes are changed by the -xtypemap= option (or by -i2, -dbl, or -r8).
Character variables can be freely mixed and equivalenced to variables of other types, but be careful of potential alignment problems.
f77 IEEE floating-point arithmetic does raise exceptions on overflow or divide by zero but does not signal SIGFPE or trap by default. It does deliver IEEE indeterminate forms in cases where exceptions would otherwise be signaled. This is explained in the Floating Point Arithmetic chapter of this Guide.
The extreme finite, normalized values can be determined. See libm_single(3F) and libm_double(3F). The indeterminate forms can be written and read, using formatted and list-directed I/O statements.
Many "dusty-deck" Fortran applications store Hollerith ASCII data into numerical data objects. With the 1977 Fortran standard (and Fortran 90), the CHARACTER data type was provided for this purpose and its use is recommended. You can still initialize variables with the older Fortran Hollerith (nH) feature, but this is not standard practice. The following table indicates the maximum number of characters that will fit into certain data types. (In this table, boldfaced data types indicate default types subject to promotion by the f77 command-line flags -dbl, -r8, or -xtypemap= ).
Table 7-3 f77: Maximum Characters in Data Types
|
Maximum Number of Standard ASCII Characters |
||||
---|---|---|---|---|---|
Data Type |
No -i2, -i4, -r8, -dbl |
-i2 |
-i4 |
-r8 |
-dbl |
BYTE |
1 |
1 |
1 |
1 |
1 |
COMPLEX |
8 |
8 |
8 |
16 |
16 |
COMPLEX*16 |
16 |
16 |
16 |
16 |
16 |
COMPLEX*32 |
32 |
32 |
32 |
32 |
32 |
DOUBLE COMPLEX |
16 |
16 |
16 |
32 |
32 |
DOUBLE PRECISION |
8 |
8 |
8 |
16 |
16 |
INTEGER |
4 |
2 |
4 |
4 |
8 |
INTEGER*2 |
2 |
2 |
2 |
2 |
2 |
INTEGER*4 |
4 |
4 |
4 |
4 |
4 |
INTEGER*8 |
8 |
8 |
8 |
8 |
8 |
LOGICAL |
4 |
2 |
4 |
4 |
8 |
LOGICAL*1 |
1 |
1 |
1 |
1 |
1 |
LOGICAL*2 |
2 |
2 |
2 |
2 |
2 |
LOGICAL*4 |
4 |
4 |
4 |
4 |
4 |
LOGICAL*8 |
8 |
8 |
8 |
8 |
8 |
REAL |
4 |
4 |
4 |
8 |
8 |
REAL*4 |
4 |
4 |
4 |
4 |
4 |
REAL*8 |
8 |
8 |
8 |
8 |
8 |
REAL*16 |
16 |
16 |
16 |
16 |
16 |
When storing standard ASCII characters with normal Fortran:
With -r8, unspecified size INTEGER and LOGICAL do not hold double.
With -dbl, unspecified size INTEGER and LOGICAL do hold double.
The storage is allocated with both options, but it is unavailable in normal Fortran with -r8.
Options -i2, -r8, and -dbl are now considered obsolete; use -xtypemap instead.
Example: Initialize variables with Hollerith:
demo% cat FourA8.f double complex x(2) data x /16Habcdefghijklmnop, 16Hqrstuvwxyz012345/ write( 6, '(4A8, "!")' ) x end demo% f77 -silent -o FourA8 FourA8.f demo% FourA8 abcdefghijklmnopqrstuvwxyz012345! demo%
If you pass Hollerith constants as arguments, or if you use them in expressions or comparisons, they are interpreted as character-type expressions.
If needed, you can initialize a data item of a compatible type with a Hollerith and then pass it to other routines.
program respond integer yes, no integer ask data yes, no / 3hyes, 2hno / if ( ask() .eq. yes ) then print *, 'You may proceed!' else print *, 'Request Rejected!' endif end integer function ask() double precision solaris, response integer yes, no data yes, no / 3hyes, 2hno / data solaris/ 7hSOLARIS/ 10 format( "What system? ", $ ) 20 format( a8 ) write( 6, 10 ) read ( 5, 20 ) response ask = no if ( response .eq. solaris ) ask = yes return end
As a general rule, porting an application program from one system and compiler to another can be made easier by eliminating any nonstandard coding. Optimizations or work-arounds that were successful on one system might only obscure and confuse compilers on other systems. In particular, optimized hand-tuning for one particular architecture can cause degradations in performance elsewhere. This is discussed later in the chapters on performance and tuning. However, the following issues are worth considering with regards to porting in general.
Some systems automatically initialize local and COMMON variables to zero or some "not-a-number" (NaN) value. However, there is no standard practice, and programs should not make assumptions regarding the initial value of any variable. To assure maximum portability, a program should initialize all variables.
Aliasing occurs when the same storage address is referenced by more than one name. This happens when actual arguments to a subprogram overlap between themselves or between COMMON variables within the subprogram. For example, arguments X and Z refer to the same storage locations, as do B and H:
COMMON /INS/B(100) REAL S(100), T(100) ... CALL SUB(S,T,S,B,100) ... SUBROUTINE SUB(X,Y,Z,H,N) REAL X(N),Y(N),Z(N),H(N) COMMON /INS/B(100) ...
Avoid aliasing in this manner in all portable code. The results on some systems and with higher optimization levels could be unpredictable.
Legacy codes may contain source-code restructurings of ordinary computational DO loops intended to cause older vectorizing compilers to generate optimal code for a particular architecture. In most cases, these restructurings are no longer needed and may degrade the portability of a program. Two common restructurings are strip-mining and loop unrolling.
Fixed-length vector registers on some architectures led programmers to manually "strip-mine" the array computations in a loop into segments:
REAL TX(0:63) ... DO IOUTER = 1,NX,64 DO IINNER = 0,63 TX(IINNER) = AX(IOUTER+IINNER) * BX(IOUTER+IINNER)/2. QX(IOUTER+IINNER) = TX(IINNER)**2 END DO END DO
Strip-mining is no longer appropriate with modern compilers; the loop can be written much less obscurely as:
DO IX = 1,N TX = AX(I)*BX(I)/2. QX(I) = TX**2 END DO
Unrolling loops by hand was a typical source-code optimization technique before compilers were available that could perform this restructuring automatically. A loop written as:
DO K = 1, N-5, 6 DO J = 1, N DO I = 1,N A(I,J) = A(I,J) + B(I,K ) * C(K ,J) * + B(I,K+1) * C(K+1,J) * + B(I,K+2) * C(K+2,J) * + B(I,K+3) * C(K+3,J) * + B(I,K+4) * C(K+4,J) * + B(I,K+5) * C(K+5,J) END DO END DO END DO DO KK = K,N DO J =1,N DO I =1,N A(I,J) = A(I,J) + B(I,KK) * C(KK,J) END DO END DO END DO
should be rewritten the way it was originally intended:
DO K = 1,N DO J = 1,N DO I = 1,N A(I,J) = A(I,J) + B(I,K) * C(K,J) END DO END DO END DO
Here are a few suggestions for what to try when programs ported to Sun Fortran do not run as expected.
Try the following:
Pay attention to the size and the engineering units. Numbers very close to zero can appear to be different, but the difference is not significant, especially if this number is the difference between two large numbers, such as the distance across the continent in feet, as calculated on two different computers. For example, 1.9999999e-30 is very near -9.9992112e-33, even though they differ in sign.
VAX math is not as good as IEEE math, and even different IEEE processors may differ. This is especially true if the mathematics involves many trigonometric functions. These functions are much more complicated than one might think, and the standard defines only the basic arithmetic functions. There can be subtle differences, even between IEEE machines. Review the Floating-Point Arithmetic chapter in this Guide.
Try running with a call nonstandard_arithmetic(). Doing so can also improve performance considerably, and make your Sun workstation behave more like a VAX system. If you have access to a VAX or some other system, run it there also. It is quite common for many numerical applications to produce slightly different results on each floating-point implementation.
Check for NaN, +Inf, and other signs of probable errors. See the Floating-Point Arithmetic chapter in this Guide, or the man page ieee_handler(3m) for instructions on how to trap the various exceptions. On most machines, these exceptions simply abort the run.
Two numbers can differ by 6 x 1029 and still have the same floating-point form. Here is an example of different numbers, with the same representation:
real*4 x,y x=99999990e+29 y=99999996e+29 write (*,10), x, x 10 format('99,999,990 x 10^29 = ', e14.8, ' = ', z8) write(*,20) y, y 20 format('99,999,996 x 10^29 = ', e14.8, ' = ', z8) end
99,999,990 x 10^29 = 0.99999993E+37 = 7cf0bdc1 99,999,996 x 10^29 = 0.99999993E+37 = 7cf0bdc1
In this example, the difference is 6 x 1029. The reason for this indistinguishable, wide gap is that in IEEE single-precision arithmetic, you are guaranteed only six decimal digits for any one decimal-to-binary conversion. You may be able to convert seven or eight digits correctly, but it depends on the number.
If the program fails without warning and runs different lengths of time between failures, then:
Compile with minimal optimization (-O1). If the program then works, compile only selective routines with higher optimization levels.
Understand that optimizers must make assumptions about the program. Nonstandard coding or constructs can cause problems. Almost no optimizer handles all programs at all levels of optimization.