Processes

Intro:

Processes are probably the most basic and powerful abstraction provided by operating systems. In this document, we will follow the book's discussions of processes, sometimes briefly because much of this topic has been discussed in another system's document. Our discussion of the topic will be divided into 3 lage sections:
- The first section will discuss the general idea of what a process is and how an OS uses it.
- The second section will dive deep into some of the most important UNIX system calls designed specifically for creating and managing processes.
- The third section studies the concept of limited direct execution and how the OS uses itt to create efficient processes that it can still control and supervise.

Processes:

How do we get the illusion of having so many CPUs in a computer? This is done mainly through what's often called one of the greatest ideas (abstractions) in all of computer science, namely processes. A process is the live running version of a program.
The OS "wraps" programs in processes and manages these processes making running multiple (many many) programs at the same time an easy task. You don't have to worry about the availability of the CPU or any of that nonsense.
The CPU is virtualized through the use of processes. The CPU would run a process for some time, stops running it and jumps to another process, then jumps back to the current process and then another continuously and tirelessly. This jumping around is called time sharing. Time sharing is one of those nerdy words OS designers like to throw around.
CPU virtualization can be achieved through the use of:
- Low level mechanisms such as the so-called context switching which refers to the continuous switching between different processes.
- High-level intelligence referred to usually as policies is also used to guide CPU and other kinds of virtualization. For example, scheduling policies based past behavior of programs, their performance and other metrics, help the OS decide scheduling priorities for processes, what process to run next and how much time is allocated to a process.

A Process:

A process can be thought of in terms of the system components it affects/ accesses during its lifetime which are collectively called its machine state. The machine state of a process includes the following:
- Memory, which contains the program's instructions and the data the process reads/writes. This memory is part of the process and is called its address space.
- Registers are also a crucial part of a process including the program counter (PC), stack pointer, and frame pointer which manage various aspects of the program such as its return address, etc.

Process API:

This is not a real process API but generic features/aspects a typical process might or should have:
- Create: An OS needs to offer a way for the user or other programs to start a process. When you double-click a program you're instructing the OS to start a new process to run that program.
- Destroy: A way to kill a process. Some processes might terminate on their own upon the completion of their tasks.
- Wait: ??! Waiting for a process to stop running??!! But why? 😕.
- Control: A process can be suspended and then resumed.
- Status: The API might need to tell us if the process is running, suspended, etc. General information about the process's current status.

Creating a Process:

Turning a program into a process, in other words getting the program to be running involves:
- Loading an executable containing the program's code and static data (such as its initialized data) from disk to the process's address space.
- Once the program is loaded, the OS allocates a runtime stack for the process which will be used for the return address, local variables, function parameters, etc.
- Memory is also allocated for the heap which is used for dynamically-allocated data for data structures like linked lists and whatnot. Memory is allocated on the heap by calling malloc() and freed with free(). The OS manages such allocation/freeing of memory.
- Initialization tasks related mainly to IO devices. Each Unix process, for example, has three open file descriptors: standard input, standard output, and standard error.
After all the steps listed above take place, the OS can jump to the program's entry point main() and transfer execution to the process/program.

Process States:

A running process can be in one of three states:
- Running.
- Ready: The process is ready and can be run if the OS chooses to run it.
- Blocked: The process cannot run currently, because it might be waiting for another operation to finish, example: waiting for an I/O operation to finish or return something.
These states have to do directly with OS scheduling. A running program can be either scheduled, meaning it is running currently or de-scheduled, meaning it is ready when the scheduler decides to give it a slice of time to run. When a process is blocked, it might also be de-scheduled while in this state of blockage.
There can be other states we didn't mention earlier such as:
- Initial state for when the process has just started.
- Final or zombie state before the process is torn down. This for example, allows a parent process to examine the return value of a process for whatever reason.

Data Structures:

A process keeps several data structures to track different information about processes, such as a list of running processes, ready processes, and blocked ones. This might be called a process list.
Another important piece of information the OS holds is so-called register contexts for de-scheduled processes. They contain snapshots of the register state for processes so when the process is scheduled again, their register state is restored. This is used for context switching which we will see later.

Unix Process API:

This will be a basic overview of process creation in the UNIX system(s). It studies three very common Unix system calls fork(), exec() which are used for creating new processes, and wait() which allows a process to wait for its child process(es) to finish executing.

`fork()`:

fork() is a weird system call and can be confusing. Check the following example. It shows a function calling fork(). It involves the use of getpid() function which allows you to get the process identifier (PID) for a function, which allows you identify a specific process to, for example, stop it from running or something:

#include <stdio.h>
#include <unistd.h>

int main(int argc, char const *argv[]) {
	int rc = fork();
	printf("Hello, world! I am %d\n", (int) getpid());
	printf("%d\n===========\n", rc);
return 0;
}

The output of the function is as follows:

Hello, world! I am 86057
86058
===========
Hello, world! I am 86058
0
===========

Notice that the second print statement is executed twice, and it has echoed two different PIDs. The first print statement was executed only once. It looks like thee are two similar copies of the same function but not exactly. All statements starting with the call to fork() get also executed for the child process.
Notice, though, that although the same instructions are executed for both the parent and child processes (starting with the call to fork()), the values are different as every one of the two processes has its own address spaces, registers, and whatnot. rc for example is used to denote the PID of the newly created child process. Notice that the rc value for the child process is 0 because it hasn't created any children of its own!
ANother interesting observation, about this program is that the order of execution between child and parent processes is not deterministic. Print statements within the parent process might execute before those of the child or vice-versa. This non-deterministic behavior is mainly due to the CPU scheduler and how it works. The following code illustrates the use of wait():

#include <stdio.h>
#include <unistd.h>
#include <sys/wait.h>

int main(int argc, char const *argv[]) {
	int rc = fork();
	int rc_wait = wait(NULL);

	printf("Hello, world! I am %d\n", (int) getpid());
	printf("rc: %d, rc_wait: %d\n===========\n", rc, rc_wait);
	
	return 0;
}

The output is as follows (The child process with a larger PID will always finish first):

Hello, world! I am 86216
rc: 0, rc_wait: -1
===========
Hello, world! I am 86215
rc: 86216, rc_wait: 86216

`wait()`:

The wait() (alternatively waitpid(), which is more complete) allows you the make the order of execution between child and parent more complete. It basically allows the child to finish executing before.

`exec()`:

wait() allows you to run copies of the same program as different processes, but you can also run different programs through creating new processes by calling exec() as in the following example:

#include <stdio.h>
#include <unistd.h>
#include <string.h>
#include <sys/wait.h>

int main(int argc, char const *argv[]) {
	char *argz[3];
	argz[0] = strdup("vim");
	argz[1] = strdup("os.c");
	argz[2] = NULL;

	execvp(argz[0], argz);

	return 0;
}

The example allows you to call shell programs form your C code. Notice that we are actually calling execvp(). There multiple variants of exec().
exec() and its variants don't create new processes, but instead it "it loads code (and static data) from that executable and overwrites its current code segment (and current static data) with it." The heap and stack of the function or program calling exec() get reinitialized. exec() basically "transforms the currently running program ... into a different running program".

Why an API?

The book claims that the existence of these three seemingly convoluted routines for creating new processes instead of just offering a simple way of creating a new process is to make it easier to create shell programs. I am not really convinced! ANwyays, what happens is that the shell is just another program running as a process. When you invoke a shell program such ls or grep, the shell process fork a child process of its own, calls wait() to let the process child finish before returning to it, then calls exec() which replaces its code and static data with those of the program it calls.
The book claims that easy redirection of input/output is one great example why these routines exist.

Limited Direct Execution:

The authors came up with the term limited direct execution to describe how processes are controlled by a modern regular OS. Processes need to be both performant, and also be controlled by the OS. Performance means its instructions run natively on the CPU, and controls refer to the requirement that a process shouldn't access resources it is not supposed to access or run longer than the time allotted to it.

Direct but Limited Execution:

Direct execution means the process runs directly on the CPU. This is why C, for example, is faster than Java (or why x86 instructions are faster than Java bytecode as the former run directly on the metal, while the latter run on a intermediate machine which translates it to x86 instructions). The problem with direct execution is that process code can do anything it wishes. Process instructions are identical to the instructions used to make the OS itself, so the processes might change and access data it's not meant to access, or might run forever and take over the system's resources.
We know that an OS manages computer resources, so it allows certain operations and restrict others. It gives users different levels of control over the system, etc. If processes are not limited, they can do anything, so the OS becomes less usable.
To make the instructions of a process less dangerous, we limit their capabilities. Certain operations such as accessing IO devices, creating and destroying other processes, etc. cannot be done directly by processes, but are done through indirectly, through system calls and whatnot!
But how can processes be limited? This is done by having processes run in two modes:
- User mode in which the process is limited in what it can do. It cannot, for example, perform an IO operation.
- Kernel mode in which the kernel code runs. It can do anything it want
If the user process wishes to execute certain restricted operations, it needs to do so through a system call. These system calls are done by the OS itself on behalf of the user process. When the user process wants to perform a restricted operation, it executes a special trap instruction. This instruction jumps into the kernel raising privilege to the kernel mode. The system performs the restricted operation, and the OS calls a return-from-trap instruction which returns to the calling user and lowers privileges.

Exception!!

It turned out that much what I've been reading so far of this book has already been already detailed in Computer Systems, a Programmer's Perspective!! That book didn't discuss "policies", and the "whys" of things, but did thoroughly described much of OS functionality, especially Unix. I will continue reading this book but will mostly focus on stuff not covered yet in the systems book. I will also spend more times on they whys and design aspects of OSs and why certain aspects were in certain ways in Unix or whatever OS. At least, I am glad that spending a few months reading the systems book was so far really really worth it!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

processes.md

processes.md

Processes

Table of Contents:

Intro:

Processes:

A Process:

Process API:

Creating a Process:

Process States:

Data Structures:

Unix Process API:

`fork()`:

`wait()`:

`exec()`:

Why an API?

Limited Direct Execution:

Direct but Limited Execution:

Exception!!

Files

processes.md

Latest commit

History

processes.md

File metadata and controls

Processes

Table of Contents:

Intro:

Processes:

A Process:

Process API:

Creating a Process:

Process States:

Data Structures:

Unix Process API:

fork():

wait():

exec():

Why an API?

Limited Direct Execution:

Direct but Limited Execution:

Exception!!

`fork()`:

`wait()`:

`exec()`: