Def. The live data are all of the tuples that might be accessed by the program in the future. We can overapproximate this as all of the tuples that are reachable, transitively, from the registers or procedure call stack. We refer to the registers and stack collectively as the root set.
The goal of a garbage collector is to reclaim the data that is not live.
We shall use a 2-space copying collector, using Cheney's algorithm (BFS) for the copy.
Alternative garbage collection techniques:
- generational copy collectors
- mark and sweep
- reference counting + mark and sweep
Overview of how GC fits into a running program.:
- Ask the OS for 2 big chunks of memory. Call them FromSpace and ToSpace.
- Run the program, allocating tuples into the FromSpace.
- When the FromSpace is full, copy the live data into the ToSpace.
- Swap the roles of the ToSpace and FromSpace and go back to step 1.
Draw Fig. 5.6. (just the FromSpace)
- breadth-first search (quick reminder what that is) uses a queue
- Cheney: use the ToSpace as the queue, use two pointers to keep
track of the front (scan pointer) and back (free pointer) of the queue.
- Copy tuples pointed to by the root set into the ToSpace to form the initial queue.
- While copying a tuple, mark the old one and store the address of the new tuple inside the old tuple. This is called a forwarding pointer.
- Start processing tuples from the front of the queue. For each tuple, copy the tuples that are directly reachable from it to the back of the queue in the ToSpace, unless the tuple has already been copied. Update the pointers in the processed tuple to the copies or the forwarding pointer.
- Draw Fig. 5.7
An implementation of a garbage collector is in runtime.c
.
- Problems:
- how to differentiate pointers from other things on the procedure call stack?
- how can the GC access the pointers that are in registers?
- how to differentiate poitners from other things inside tuples?
- Solutions
- Use a root stack (aka. shadow stack), i.e., place all tuples in a separate stack that works in parallel to the normal stack. Draw Fig. 5.7.
- Spill vector-typed variables to the root stack if they are live during a call to the collector.
- Add a 64-bit header or "tag" to each tuple. (Fig. 5.8)
The header includes
- 1 bit to indicate forwarding (0) or not (1). If 0, then the header is the forwarding pointer.
- 6 bits to store the length of the tuple (max of 50)
- 50 bits for the pointer mask to indicate which elements of the tuple are pointers.
Add cases for the new expressions. (Fig 5.1)
type ::= ... | (Vector type+) | Void
exp ::= ... | (vector exp+) | (vector-ref exp int)
| (vector-set! exp int exp) | (void)
To help the GC identify vectors (pointers to the heap), wrap every sub-expression with its type. This information will be propagated in the flatten pass to all variables.
(HasType exp type)
Add HasType to the generated code.
Lower vector creation into a call to collect, a call to allocate, and then initialize the memory (see 5.3.1).
Make sure to place the code for sub-expressions prior to the sequence collect-allocate-initialize. Sub-expressions may also call collect, and we can't have partially constructed vectors during collect!
New forms in the output language:
exp ::=
(Collect int) call the GC and you're going to need `int` bytes
| (Allocate int type) allocate `int` bytes
| (GlobalValue name) access global variables
free_ptr
: the next empty spot in the FromSpacefromspace_end
: the end of the FromSpace
The new forms Collect, Allocate, GlobalValue should be treated as complex operands.
Add case for HasType.
Adapt case for Prim to make sure the enclosing HasType does not get separated from it.