Processes

This lecture is about the first process in xv6.

Processes

Recall the goals of processes:
- Give each process a private memory area for code, data, stack.
- Prevent one process from reading/writing outside its address space.
- Allow sharing when needed.
- Bring a program to live
Usually the implementation is split between the O/S and the hardware.
The O/S manages processes:
- Allocate physical memory for them (for creation, growth, deletion).
- Keep track of them when they are not executing.
- Switch between them (to switch processes).
- Configure the hardware.
The hardware performs address translation and protection:
- Translate user addresses to physical addresses.
- Detect and prevent attempts to use memory outside the address space.
- Allow cross-space transfers (system calls, interrupts).
Also:
- O/S has its own address space.
- O/S must be able to conveniently read/write user memory.
Hardware support may or may not correspond well to what the O/S wants.
Two main approaches: segments and page tables. Paging has won: most O/S are designed for paging, most modern CPU designs support only paging. BUT x86 provides many features only via segmentation h/w (interrupts, protection), so we must learn about x86 segments, a little bit. Also xv6 uses segments, not paging.

Example hardware for address spaces: x86 segments

The operating system can switch the x86 to protected mode, which supports virtual and physical addresses, and allows the O/S to set up address spaces so that user processes can't change them. Translation in protected mode is as follows:

selector:offset (virtual / logical addr)
==SEGMENTATION==>
linear address
==PAGING ==>
physical address

Next lecture covers paging; now we focus on segmentation.

Protected-mode segmentation works as follows (see handout):

segment register holds segment selector
selector: 13 bits of index, local vs global flag, 2-bit RPL
selector indexes into global descriptor table (GDT)
segment descriptor holds 32-bit base, limit, type, protection
la = va + base ; assert(va < limit);
choice of seg register usually implicit in instruction
- ESP uses SS, EIP uses CS, others (mostly) use DS
- some instructions can take far addresses:
  - ljmp $selector, $offset
GDT lives in memory, CPU's GDTR register points to base of GDT
LGDT instruction loads GDTR
you turn on protected mode by setting PE bit in CR0 register
What about protection?
- instructions can only r/w/x memory reachable through seg regs
- not before base, not after limit
- can my program change a segment register? yes, but...
- can my program re-load GDTR? no!
- how does h/w know if user or kernel?
- Current privilege level (CPL) is in the low 2 bits of CS
- CPL=0 is privileged O/S, CPL=3 is user
- why can't app modify the descriptors in the GDT? it's in memory...
- what about system calls? how do they transfer to kernel?
- app cannot just lower the CPL

Case study (xv6)

xv6 is a reimplementation of Unix 6th edition.

v6 is an early Unix operating system for DEC PDP11
- Thompson and Ritchie, 1976
- PDP11: 16 bit data and addresses, 18 bit physical addresses
- ancestor of Linux &c but much smaller
- recognizable: shell, multi-user, directories
- written in C
- 6.828 used to use it instead of xv6
- Unix papers.
xv6 written for 6.828:
- even smaller than v6, maybe not useable as is
- preserves basic structure (processes, files, pipes, &c)
- you don't have to learn PDP11 and x86
- runs on multi-processor PCs.

Newer Unixs have inherited many of the conceptual ideas even though they added paging, networking, graphics, improve performance, etc.

You will need to read most of the source code multiple times. Your goal is to explain every line to yourself. The chapters published on the schedule page may be helpful.

Overview of processes in xv6

In today's lecture we see how xv6 creates the kernel address spaces, and the first user process. A process consists of an address space and one thread of control (to run the program) in xv6. The kernel address space is the only address space with multiple threads of control. We will study context switching and process management in detail next weeks; creation of the first user process (init) will get you a first flavor.

The process chapter covers the material below in more detail.

xv6 uses only the segmentation hardware on the x86; it doesn't use paging. (In JOS you will use page-table hardware too, which we cover in next lecture.)

The kernel address space:

  the code segment runs from 0 to 2^32 and is mapped X and R
  the data segment runs from 0 to 2^32 but is mapped W (read and write).

Each process has an address space, laid out as follows starting at virtual address zero:
```
  text
  original data and bss
  fixed-size stack
  expandable heap
```
A process's code, data, and stack segments all map this virtual address space to the same range of linear addresses. That is, all three segments are the same.

The x86 designers probably had in mind more interesting uses of segments. What might they have been?

xv6 process structure

we're about to look at how the first XV6 process starts up
  it will run initcode.S, which does exec("/init")
  /init is a program that starts up a shell we can type to

what's the important state of an xv6 process?
  kernel proc[] table has an entry for each process
    p->mem points to user mem phys address
    p->kstack points to kern stack phys address
    struct context holds saved kernel registers
      EIP, ESP, EAX, &c
      for when a system call is waiting for input
  user half: user memory
    user process sees memory as starting at zero
    instructions, data, stack, expandable heap
  kernel half: kernel executing a system call for a process
    on the process's kernel stack

xv6 has two kinds of transitions
  trap + return: user->kernel, kernel->user
    system calls, interrupts, divide-by-zero, &c
    save user process state ... run in kernel ... restore state
  process switch: between kernel halves
    one process is waiting for input, run another
      or time-slicing between compute-bound processes
    save p1's kernel-half state ... restore p2's kernel-half state
  setting up first process involves manually initializing this state

saved state for trap
  during trap, the CPU:
    switches to process's kernel stack
    pushes SS, ESP, EFLAGS, CS, EIP onto kernel stack
    jumps into kernel
  kernel then pushes the other user registers
  this is struct trapframe
  trap return reverses this, resuming at saved user EIP
  for first process:
    manually set up these "saved" registers on the kernel stack
    EIP 0, ESP top of user memory, &c

saved state for process switch
  save registers (EIP, ESP, EAX, &c) in oldp->context
  restore registers from newp->context
  now we are at the EIP of newp, and using its kernel stack
  this is the only way xv6 switches among processes
    there is no direct user->user process switch
    instead, user TRAP kernel PROCESS-SWITCH kernel TRAP-RETURN user
  for first process:
    manually set up EIP and ESP to run forkret, which returns from trap

Since an xv6 process's address space is essentially a single segment, a process's physical memory must be contiguous. So xv6 may run into fragmentation if process sizes are a significant fraction of physical memory.

xv6 kernel address space

Let's see how xv6 creates the kernel address space by tracing xv6 from when it boots, focusing on address space management.

Start with ksegment(), which is called from main()
look at GDT after lgdt with info gdt
How are text, data, and stack mapped?
What do the protection bits mean?
How does address translation change right after lgdt completes? (what happened during boot?)

main() calls userinit() to create first process

then scheduler() to start running processes

creating the first process

userinit()
What is in binary_initcode?
How does the user stack look like?
allocproc()
how does the kernel stack look for the new process? what is on it?

running the first process

remember that main calls scheduler() after userinit()
we'll see how scheduler() works in a few lectures, overview now
usegment---how is the address for the user space setup? protection?
swtch
forkret
Which stack is the kernel using now?
What is on this stack?
trapret?
What is on the stack?
In which address space is processor after executing iret?
What is on the stack in that address space?

Managing physical memory

To create an address space we must allocate physical memory, which will be freed when an address space is deleted (e.g., when a user program terminates). xv6 implements a first-fit memory allocator (see kalloc.c).

kalloc() maintains a list of ranges of free memory. The allocator finds the first range that is larger than the amount of requested memory. It splits that range in two: one range of the size requested and one of the remainder. It returns the first range. When memory is freed, kfree will merge ranges that are adjacent in memory.

Under what scenarios is a first-fit memory allocator undesirable?

Growing an address space

How can a user process grow its address space? growproc.

allocate a new segment of old size plus n
copy the old segment into the new (ouch!)
why bother zeroing the rest?
free the old physical memory

We could do a lot better if segments didn't have to be contiguous in physical memory. How could we arrange that? Using page tables, which is our next topic. This is one place where page tables would be useful, but there are others too (e.g., in fork).