This question already has an answer here:
- How do malloc() and free() work? 14 answers
Can anyone explain how malloc()
works internally?
I have sometimes done strace program
and I see a lot of sbrk
system calls, doing man sbrk
talks about it being used in malloc()
but not much more.
Simplistically malloc and free work like this:
malloc provides access to a process's heap. The heap is a construct in the C core library (commonly libc) that allows objects to obtain exclusive access to some space on the process's heap.
Each allocation on the heap is called a heap cell. This typically consists of a header that hold information on the size of the cell as well as a pointer to the next heap cell. This makes a heap effectively a linked list.
When one starts a process, the heap contains a single cell that contains all the heap space assigned on startup. This cell exists on the heap's free list.
When one calls malloc, memory is taken from the large heap cell, which is returned by malloc. The rest is formed into a new heap cell that consists of all the rest of the memory.
When one frees memory, the heap cell is added to the end of the heap's free list. Subsequent mallocs walk the free list looking for a cell of suitable size.
As can be expected the heap can get fragmented and the heap manager may from time to time, try to merge adjacent heap cells.
When there is no memory left on the free list for a desired allocation, malloc calls brk or sbrk which are the system calls requesting more memory pages from the operating system.
Now there are a few modification to optimize heap operations.
It's also important to realize that simply moving the program break pointer around with
brk
andsbrk
doesn't actually allocate the memory, it just sets up the address space. On Linux, for example, the memory will be "backed" by actual physical pages when that address range is accessed, which will result in a page fault, and will eventually lead to the kernel calling into the page allocator to get a backing page.The
sbrk
system call moves the "border" of the data segment. This means it moves a border of an area in which a program may read/write data (letting it grow or shrink, although AFAIK nomalloc
really gives memory segments back to the kernel with that method). Aside from that, there's alsommap
which is used to map files into memory but is also used to allocate memory (if you need to allocate shared memory,mmap
is how you do it).So you have two methods of getting more memory from the kernel:
sbrk
andmmap
. There are various strategies on how to organize the memory that you've got from the kernel.One naive way is to partition it into zones, often called "buckets", which are dedicated to certain structure sizes. For example, a
malloc
implementation could create buckets for 16, 64, 256 and 1024 byte structures. If you askmalloc
to give you memory of a given size it rounds that number up to the next bucket size and then gives you an element from that bucket. If you need a bigger areamalloc
could usemmap
to allocate directly with the kernel. If the bucket of a certain size is emptymalloc
could usesbrk
to get more space for a new bucket.There are various
malloc
designs and there is propably no one true way of implementingmalloc
as you need to make a compromise between speed, overhead and avoiding fragmentation/space effectiveness. For example, if a bucket runs out of elements an implementation might get an element from a bigger bucket, split it up and add it to the bucket that ran out of elements. This would be quite space efficient but would not be possible with every design. If you just get another bucket viasbrk
/mmap
that might be faster and even easier, but not as space efficient. Also, the design must of course take into account that "free" needs to make space available tomalloc
again somehow. You don't just hand out memory without reusing it.If you're interested, the OpenSER/Kamailio SIP proxy has two
malloc
implementations (they need their own because they make heavy use of shared memory and the systemmalloc
doesn't support shared memory). See: https://github.com/OpenSIPS/opensips/tree/master/memThen you could also have a look at the GNU libc
malloc
implementation, but that one is very complicated, IIRC.