Notes
main
main
  • Introduction
  • linuxKernel
    • tips
    • make_help
    • old linux
      • compile_linux0.11
      • TestEnvironment
      • load_setup
      • get_hard_data
    • list
    • plist
    • fifo
    • idr
    • xarray
    • rbtree
    • maple_tree
    • syscall
    • bitmap
    • page
    • page_flags
    • page_size
    • page mapcount
    • page refcount
    • folio
    • slub
      • proc_slabinfo
      • slub_theory
      • kmalloc_kfree
      • kmem_cache
      • slab_alloc
      • slab_free
      • proc_meminfo_SReclaimable_SReclaimable
    • vmalloc
    • brk
    • mmap
    • mremap
    • mprotect
    • madvise
    • read
    • write
    • shmem
    • huge_page
    • page_fault
    • rmap
    • lru
    • multi-gen-LRU
    • page_reclaim
    • page_cache
    • page_table
    • rcu
    • kvm
    • aarch64_boot
    • tracing_system
    • cache_coherence_and_memory_consistency
    • cpu_speculates
    • mmap_lock
    • per-vma_lock
    • cgroup
    • symbol
    • campact
    • page_ext
    • mempool
    • kernelstack
    • filesystem
    • io_stack
    • workingset
    • ioremap
    • sched_period
  • linuxDebug
    • openocd_openjtag
    • i2c_tools
    • objdump
    • addr2line
    • gdb_useage
    • debug_linux_kernel_via_gdb
    • debug_linux_module_via_gdb
    • early_boot
    • sequentially_execute
    • dynamic_debug
    • research_linuxKernel_by_patch
    • tracefs
    • ebpf
    • bpftrace
    • perf
    • flame_graph
    • crash
    • ASAN_HWASAN_MTE_check_mem_bug
    • page_owner
    • vmtouch
    • fio
    • benchmark
  • linuxSystem
    • common
      • system_version
      • procfs
      • proc_sys_vm
      • cmd_ps
      • makefile
      • file_descriptor
      • psi
      • ulimit
      • top
      • delay_accounting
    • ubuntu
      • custom_kernel
      • get_cmd_src
      • record_ssh_info
      • log
      • run_custom_script
      • repo
      • cockpit
      • nfs
      • tftp
      • misc
    • fedora
      • system_upgrade
      • custom_kernel
      • lvextend
      • yt-dlp
      • jellyfin
  • linuxDriver
    • i2c_peripherals_driver
    • spi_peripherals_driver
    • gpio_subsystem
    • IRQ_driver
    • blockIO_unblockIO_async
    • linux_own_driver
    • misc_device
    • input_device
    • timer
    • atomic_spinlock_semaphore_mutex
    • lcd
    • touch_screen
    • debugfs
    • v4l2
    • mmap
  • hardware
    • paging_mmu_pt
    • iommu
  • process_thread_scheduler
    • scheduler01
    • scheduler02
    • scheduler03
    • scheduler04
    • scheduler05
    • scheduler06
  • memory_management
    • mm1
    • mm2
    • mm3
    • mm4
    • mm5
  • input_output_filesystem
    • io_fs_01
    • io_fs_02
    • io_fs_03
    • io_fs_04
  • lock_and_lockup_detector
    • general_lock
    • hung_task
    • softLockup_hardLockup
    • crash_experiment
  • MIT_6.S081
    • 6.S081_Operating_System_Engineering
    • Schedule.md
    • Class
      • Overview
      • Administrivia
    • Labs
      • Tools
      • Guidance
      • startup
      • syscall
      • page_table
      • Calling_Convention
      • traps
    • xv6
      • xv6
    • References.md
  • qemu
    • qemu_buildroot
    • qemu_busybox.md
    • Serial.md
    • demo_mini2440
      • 0_compilation_error_summary
      • 1_compilation_steps
      • 2_operation_mode
      • 3_transplant_tools_libraries
      • 4_tools_use
      • reference_website
  • tools
    • getKernelSourceCodeList
    • nat
    • shell
    • translating
    • YouCompleteMe
    • cscope
    • global
    • vscode
    • vim
    • binary
    • markdown
    • draw
    • git
    • tig
    • tmux
    • mail_client
    • download_patchset_from_LKML
    • minicom
    • clash
  • other
    • interview
    • interview_c_base
    • know_dontknow
    • Stop-Ask-Questions-The-Stupid-Ways
    • How-To-Ask-Questions-The-Smart-Way
    • docker
    • buildroot
    • rv32_to_rv64
Powered by GitBook
On this page
  • 0. 简介
  • 1. 分析源码
  • 1.1 创建kmem_cache
  • 1.2 删除kmem_cache
  • 1.3 从kmem_cache中分配object
  • 1.4 从kmem_cache中释放object

Was this helpful?

  1. linuxKernel
  2. slub

kmem_cache

0. 简介

kmem_cache_xxx()是linux kernel提供的申请小内存(8B, 16B, 32B...4096B, 8192B)的API,一般用法:

#include <linux/slab.h>

cachep = kmem_cache_create(name, size, align, flags, ctor_func);
objp = kmem_cache_alloc(cachep, GFP_KERNEL);
...
kmem_cache_free(cachep, objp);
kmem_cache_destroy(cachep);

通过kmem_cache_create()申请一个名字为name、object大小为size、以align对齐的kmem_cache类型的变量,然后调用kmem_cache_alloc()从kmem_cache中申请一个object;

通过kmem_cache_free()释放之前申请到的object,然后调用kmem_cache_destroy()释放kmem_cache类型的变量

我们平时通过kmalloc()/kfree()申请/释放小内存时,就是通过kmem_cache实现的

1. 分析源码

我们使用kmem_cache_xxx()时会通过#include <linux/slab.h>导入函数的原型,但是在导入函数原型前,需要通过配置CONFIG_SLUB、CONFIG_SLOB来选择小内存分配算法,如下:

/* include/linux/slab.h */
#ifdef CONFIG_SLUB
#include <linux/slub_def.h>
#elif defined(CONFIG_SLOB)
#include <linux/slob_def.h>
#else
#include <linux/slab_def.h>
#endif

通过查找.config或include/generated/autoconf.h,可知目前使用slub分配器进行小内存分配。

1.1 创建kmem_cache

kmem_cache_create()定义,如下:

/* mm/slub.c */
struct kmem_cache *kmem_cache_create(const char *name, size_t size,
        size_t align, unsigned long flags, void (*ctor)(void *))
{
    struct kmem_cache *s;

    s = kmalloc(kmem_size, GFP_KERNEL);
    kmem_cache_open(s, GFP_KERNEL, name, size, align, flags, ctor);
    list_add(&s->list, &slab_caches);
    sysfs_slab_add(s);
    return s;
}

static int kmem_cache_open(struct kmem_cache *s, gfp_t gfpflags,
        const char *name, size_t size,
        size_t align, unsigned long flags,
        void (*ctor)(void *))
{
    memset(s, 0, kmem_size);
    s->name = name;
    s->ctor = ctor;
    s->objsize = size;
    s->align = align;
    s->flags = kmem_cache_flags(size, flags, name, ctor);

    if (!calculate_sizes(s, -1))        // 初始化size变量
        goto error;

    set_min_partial(s, ilog2(s->size)); // 初始化min_partial变量
    s->refcount = 1;
#ifdef CONFIG_NUMA
    s->remote_node_defrag_ratio = 1000;
#endif
    if (!init_kmem_cache_nodes(s, gfpflags & ~SLUB_DMA)) // 初始化node变量
        goto error;

    if (alloc_kmem_cache_cpus(s, gfpflags & ~SLUB_DMA))  // 初始化cpu_slab变量
        return 1;
}

首先通过kmalloc()申请kmem_cache,再调用kmem_cache_open()进行初始化,然后将kmem_cache放入slab_caches链表,并且调用sysfs_slab_add()创建/proc/slabinfo相关条目,最后返回kmem_cache

1.2 删除kmem_cache

kmem_cache_destroy()定义,如下:

/* mm/slub.c */
void kmem_cache_destroy(struct kmem_cache *s)
{
    list_del(&s->list);
    kmem_cache_close(s);
    sysfs_slab_remove(s);
}

static inline int kmem_cache_close(struct kmem_cache *s)
{
    int node;

    flush_all(s);             // 释放kmem_cache_cpu page指向的slab
    free_percpu(s->cpu_slab); // 释放cpu_slab变量
    /* Attempt to free all objects */
    for_each_node_state(node, N_NORMAL_MEMORY) {
        struct kmem_cache_node *n = get_node(s, node);

        free_partial(s, n);   // 释放kmem_cache_node partial指向的slab
        if (n->nr_partial || slabs_node(s, node))
            return 1;
    }
    free_kmem_cache_nodes(s); // 释放node变量
    return 0;
}

首先将kmem_cache从slab_caches链表中删除,然后调用kmem_cache_close()释放kmem_cache相关资源,最后调用sysfs_slab_remove()删除/proc/slabinfo相关条目

1.3 从kmem_cache中分配object

kmem_cache_alloc()定义,如下:

/* mm/slub.c */
void *kmem_cache_alloc(struct kmem_cache *s, gfp_t gfpflags)
{
    void *ret = slab_alloc(s, gfpflags, -1, _RET_IP_);

    return ret;
}

1.4 从kmem_cache中释放object

kmem_cache_free()定义,如下:

/* mm/slub.c */
void kmem_cache_free(struct kmem_cache *s, void *x)
{
    struct page *page;

    page = virt_to_head_page(x);

    slab_free(s, page, x, _RET_IP_);
}
Previouskmalloc_kfreeNextslab_alloc

Last updated 4 years ago

Was this helpful?

调用从slab中分配一个object并且返回。如果没有可用的slab,使用gfpflags标志分配新slab

通过virt_to_head_page()从 x 获得slab的首页地址page,然后调用

slab_alloc()
slab_free()