The functions listed in Table 2-1are used to control user processes:
Table 2-1 Process Functions
Function Name |
Purpose |
---|---|
Create a new process |
|
Execute a program |
|
Terminate a process |
|
Wait for a child process to stop or terminate |
|
Translate address to symbolic information |
|
Close a shared object |
|
Get diagnostic information |
|
Open a shared object |
|
Get the address of a symbol in a shared object |
|
Set user and group IDs |
|
Set process group ID |
|
Change working directory |
|
Change root directory |
|
Change priority of a process |
|
Get and set current user context |
|
Get or set supplementary group access list IDs |
|
Get process, process group, and parent process IDs |
|
Get real user, effective user, real group, and effective group IDs |
|
Suspend process until signal |
|
Control process scheduler |
|
Set process group ID |
|
Set session ID |
|
Wait for a child process to change state |
fork(2) creates a new process that is an exact copy of the calling process. The new process is the child process; the old process is the parent process. The child gets a new, unique process ID. fork(2) returns a 0 to the child process and the child's process ID to the parent. The returned value is how a forked program determines whether it is the parent process or the child process.
The new process created by fork(2) or exec(2) function inherits all open file descriptors from the parent including the three standard files: stdin, stdout, and stderr. When the parent has buffered output that should appear before output from the child, the buffers must be flushed before the fork(2).
The following code is an example of a call to fork(2) and the subsequent actions:
pid_t pid; pid = fork(); switch (pid) { case -1: /* fork failed */ perror ("fork"); exit (1); case 0: /* in new child process */ printf ("In child, my pid is: %d\n", getpid(); ); do_child_stuff(); exit (0); default: /* in parent, pid contains PID of child */ printf ("In parent, my pid is %d, my child is %d\n", getpid(), pid); break; } /* Parent process code */ ...
If the parent and the child process both read input from a stream, whatever is read by one process is lost to the other. So, once something has been delivered from the input buffer to a process, the buffer pointer has moved on.
An obsolete practice is to use fork(2) and exec(2) to start another executable, then wait for the new process to die. In effect, a second process is created to perform a subroutine call. It is much more efficient to use dlopen(3DL), dlsym(3DL), and dlclose(3DL) as described in "Runtime Linking" to make a subroutine temporarily resident in memory.
exec(2) is the name of a family of functions that includes execl(2), execv(2), execle(2), execve(2), execlp(2), and execvp(2). All load a new process over the calling process, but with different ways of pulling together and presenting the arguments of the function. For example, execl(2) could be used like this
execl("/usr/bin/prog2", "prog2", progarg1, progarg2, (char (*)0));
The execl argument list is:
/usr/bin/prog2 |
The path name of the new program file |
prog2 |
The name the new process gets in its argv[0] |
progarg1, progarg2 |
The arguments to prog2 as char (*)s |
(char (*)0) |
A null char pointer to mark the end of the arguments |
There is no return from a successful execution of any variation of exec(2); the new process overlays the process that calls exec(2). The new process also takes over the process ID and other attributes of the old process. If a call to exec(2) fails, control is returned to the calling program with a return value of -1. You can check errno to learn why it failed.
An application can extend its address space during execution by binding to additional shared objects. There are several advantages in this delayed binding of shared objects:
Processing a shared object when it is required, rather than during the initialization of an application, may greatly reduce start-up time. Also, the shared object may not be required during a particular run of the application, for example, objects containing help or debugging information.
The application may choose between a number of different shared objects depending on the exact services required; for example, networking protocols.
Any shared objects added to the process address space during execution may be freed after use.
The following is a typical scenario that an application may perform to access an additional shared object:
A shared object is located and added to the address space of a running application using dlopen(3DL). Any dependencies of shared object are also located and added at this time. For example:
#include <stdio.h> #include <dlfcn.h> main(int argc, char ** argv) { void * handle; ..... if ((handle = dlopen("foo.so.1", RTLD_LAZY)) == NULL) { (void) printf("dlopen: %s\n", dlerror()); exit (1); } .....
The added shared objects are relocated and any initialization sections in the new shared objects are called.
The application locates symbols in the added shared objects using dlsym(3DL). The application can then reference the data or call the functions defined by these new symbols. Continuing the preceding example:
if (((fptr = (int (*)())dlsym(handle, "foo")) == NULL) || ((dptr = (int *)dlsym(handle, "bar")) == NULL)) { (void) printf("dlsym: %s\n", dlerror()); exit (1); }
After the application has finished with the shared objects, the address space is freed using dlclose(3DL). Any termination sections within the shared objects being freed are called at this time. For example:
if (dlcose (handle) != 0) { (void) printf("dlclose: %s\n", dlerror()); exit (1); }
Any error conditions that occur as a result of using these runtime linker interface routines can be displayed using dlerror(3DL).
The services of the runtime linker are defined in the header file <dlfcn.h> and are made available to an application by the shared library libdl.so.1. For example:
$ cc -o prog main.c -ldl |
Here the file main.c can refer to any of the dlopen(3DL) family of routines, and the application prog will be bound to these routines at runtime.
For a thorough discussion of application directed runtime linking, see the Linker and Libraries Guide. See dladdr(3DL), dlclose(3DL), dlerror(3DL), dlopen(3DL), and dlsym(3DL) for use details.
The UNIX system scheduler determines when processes run. It maintains process priorities based on configuration parameters, process behavior, and user requests. It uses these priorities to assign processes to the CPU.
Scheduler functions give users varying degrees of control over the order in which certain processes run and the amount of time each process can use the CPU before another process gets a chance.
By default, the scheduler uses a time-sharing policy. A time-sharing policy adjusts process priorities dynamically in an attempt to give good response time to interactive processes and good throughput to CPU-intensive processes.
The scheduler also provides an alternate real-time scheduling policy. Real-time scheduling allows users to set fixed priorities--priorities that the system does not change. The highest priority real-time user process always gets the CPU as soon as it can be run, even if other system processes are also eligible to be run. A program can therefore specify the exact order in which processes run. You can also write a program so that its real-time processes have a guaranteed response time from the system.
For most versions of SunOS 5.0 through 5.8, the default scheduler configuration works well and no real-time processes are needed: administrators need not change configuration parameters and users need not change scheduler properties of their processes. However, for some programs with strict timing constraints, real-time processes are the only way to guarantee that the timing requirements are met.
For more information, see priocntl(1), priocntl(2), and dispadmin(1M). For a fuller discussion of this subject, see Chapter 3, Process Scheduler .