当前位置：首页>python>Python ctypes.addressof() 详解:获取内存地址的利器

Python ctypes.addressof() 详解:获取内存地址的利器

2026-07-03 14:18:42

ctypes 是 Python 的一个标准库模块，它提供了与 C 语言兼容的数据类型和函数调用接口。其中 `ctypes.addressof()` 是一个非常重要的函数，它允许我们获取一个 ctypes 对象的内存地址。这个功能在需要与 C 语言代码交互、进行底层内存操作或实现特定数据结构时非常有用。

ctypes.addressof() 基本概念

`ctypes.addressof()` 函数用于获取一个 ctypes 对象的内存地址。它的基本语法如下：

ctypes.addressof(obj)

其中 obj 必须是一个 ctypes 实例（如 `c_int`, 或自定义的 `Structure` 等）。

1.1 与 id() 函数的区别

在 Python 中，`id()` 函数也返回对象的内存地址，但 `id()` 返回的是 Python 对象的地址，而 `ctypes.addressof()` 返回的是 ctypes 对象底层 C 数据的地址。对于简单的 ctypes 对象，两者可能相同，但对于复杂结构（如 Structure 的子对象），它们可能不同。

基本使用示例

让我们从一些简单的例子开始，了解 `ctypes.addressof()` 的基本用法。

2.1 基本数据类型

import ctypes# 创建一个 c_int 对象num = ctypes.c_int(42)print(f"ctypes.addressof(num): {hex(ctypes.addressof(num))}")print(f"id(num): {hex(id(num))}")  # 通常与 addressof 结果相同

2.2 指针类型

# 创建一个指针指向 numptr = ctypes.pointer(num)print(f"ctypes.addressof(ptr): {hex(ctypes.addressof(ptr))}")  # 指针对象本身的地址print(f"ptr.contents address: {hex(ctypes.addressof(ptr.contents))}")  # 指针指向的地址

2.3 数组类型

# 创建一个 c_int 数组arr = (ctypes.c_int * 3)(1, 2, 3)print(f"ctypes.addressof(arr): {hex(ctypes.addressof(arr))}")  # 数组首地址print(f"arr[0] address: {hex(ctypes.addressof(arr[0]))}")  # 第一个元素的地址print(f"arr[1] address: {hex(ctypes.addressof(arr[1]))}")  # 第二个元素的地址

自定义结构体示例

ctypes 允许我们定义自己的 C 结构体，这是 `addressof()` 特别有用的场景。

3.1 定义结构体

classPoint(ctypes.Structure):    _fields_ = [        ("x", ctypes.c_int),        ("y", ctypes.c_int)    ]# 创建结构体实例p = Point(10, 20)

3.2 获取结构体及其成员的地址

print(f"Point structure address: {hex(ctypes.addressof(p))}")print(f"x member address: {hex(ctypes.addressof(p.x))}")  # 错误！不能直接这样获取print(f"y member address: {hex(ctypes.addressof(p.y))}")  # 错误！不能直接这样获取

注意：上面的代码中直接对 `p.x` 和 `p.y` 使用 `addressof()` 会引发错误，因为 `p.x` 和 `p.y` 不是独立的 ctypes 对象。正确的方法是：

# 正确获取成员地址的方法import sys# 对于简单结构体，可以使用偏移量计算# 或者使用 byref() 获取成员地址（但不直接返回整数地址）print(f"Point structure address: {hex(ctypes.addressof(p))}")# 获取成员地址的替代方法（不推荐直接使用，仅作演示）# 更安全的方法是使用 ctypes.byref() 或计算偏移量x_addr = ctypes.addressof(p) + Point.x.offset  # 需要知道成员偏移量print(f"Calculated x address: {hex(x_addr)}")

更实用的方法是使用 `ctypes.byref()` 来获取成员的引用：

# 使用 byref 获取成员引用（不直接返回地址，但可用于指针操作）x_ref = ctypes.byref(p.x)  # 这实际上等同于 ctypes.byref(p) + Point.x.offset# 但 byref() 不直接返回整数地址，要获取地址需要结合 addressof 和偏移量

3.3 更安全的结构体成员地址访问

对于结构体成员地址，更安全的方法是：

# 方法1：使用指针和偏移量（需要知道结构体布局）classPoint(ctypes.Structure):    _fields_ = [        ("x", ctypes.c_int),        ("y", ctypes.c_int)    ]defget_member_address(self, member_name):# 获取成员偏移量for field inself._fields_:if field[0] == member_name:                offset = self.__class__.__dict__[member_name].offsetreturn ctypes.addressof(self) + offsetraise ValueError(f"Member {member_name} not found")p = Point(10, 20)print(f"x address: {hex(p.get_member_address('x'))}")print(f"y address: {hex(p.get_member_address('y'))}")

3.4 嵌套结构体示例

classVector(ctypes.Structure):    _fields_ = [        ("dx", ctypes.c_float),        ("dy", ctypes.c_float)    ]classParticle(ctypes.Structure):    _fields_ = [        ("position", Point),        ("velocity", Vector),        ("mass", ctypes.c_float)    ]# 创建粒子实例particle = Particle(Point(1, 2), Vector(0.5, -0.5), 1.0)# 获取各部分地址print(f"Particle address: {hex(ctypes.addressof(particle))}")print(f"Position address: {hex(ctypes.addressof(particle.position))}")print(f"Velocity address: {hex(ctypes.addressof(particle.velocity))}")print(f"Mass address: {hex(ctypes.addressof(particle.mass))}")  # 错误！不能直接这样获取# 正确获取 mass 地址的方法mass_addr = ctypes.addressof(particle) + Particle.mass.offsetprint(f"Calculated mass address: {hex(mass_addr)}")

与 C 代码交互示例

`ctypes.addressof()` 在与 C 代码交互时特别有用。让我们看一个完整的例子。

4.1 C 代码 (example.c)

#include<stdio.h>typedefstruct {int x;int y;} Point;voidprint_point_address(Point *p) {printf("Point address from C: %p\n", (void*)p);printf("Point x value: %d\n", p->x);printf("Point y value: %d\n", p->y);}voidmodify_point(Point *p, int new_x, int new_y) {    p->x = new_x;    p->y = new_y;}

编译为共享库：

gcc -shared -o libexample.so -fPIC example.c

4.2 Python 代码

import ctypes# 加载共享库lib = ctypes.CDLL('./libexample.so')# 定义 Point 结构体classPoint(ctypes.Structure):    _fields_ = [        ("x", ctypes.c_int),        ("y", ctypes.c_int)    ]# 设置函数原型lib.print_point_address.argtypes = [ctypes.POINTER(Point)]lib.print_point_address.restype = Nonelib.modify_point.argtypes = [ctypes.POINTER(Point), ctypes.c_int, ctypes.c_int]lib.modify_point.restype = None# 创建 Point 实例p = Point(10, 20)# 获取地址并传递给 C 函数p_addr = ctypes.addressof(p)print(f"Point address from Python: {hex(p_addr)}")# 将 Python 的 Point 指针传递给 C 函数# 方法1：使用 ctypes.byref()print("\nUsing ctypes.byref():")lib.print_point_address(ctypes.byref(p))# 方法2：创建指针对象print("\nUsing ctypes.pointer():")p_ptr = ctypes.pointer(p)print(f"Pointer address in Python: {hex(ctypes.addressof(p_ptr))}")print(f"Pointer value (points to): {hex(p_ptr.contents.x)}")  # 不直接显示地址lib.print_point_address(p_ptr)# 修改 Point 的值print("\nModifying Point from C:")lib.modify_point(ctypes.byref(p), 100, 200)print(f"After modification - x: {p.x}, y: {p.y}")

高级应用：实现自定义内存管理

`ctypes.addressof()` 可以用于实现更复杂的内存管理场景，比如直接操作内存缓冲区。

5.1 直接操作内存缓冲区

import ctypes# 创建一个大的缓冲区buffer_size = 1024buffer = (ctypes.c_ubyte * buffer_size)()# 获取缓冲区地址buffer_addr = ctypes.addressof(buffer)print(f"Buffer address: {hex(buffer_addr)}")# 直接操作内存（不推荐，仅作演示）# 将前4字节设置为 0x12345678 (小端序)# 注意：这需要知道系统字节序value = 0x12345678ctypes.memmove(buffer_addr, ctypes.byref(ctypes.c_int32(value)), 4)# 读取验证read_value = ctypes.c_int32.from_buffer(buffer, 0).valueprint(f"Read value: {hex(read_value)}")

5.2 与 numpy 数组交互

import numpy as np# 创建 numpy 数组arr = np.array([1, 2, 3, 4, 5], dtype=np.int32)# 获取 numpy 数组的数据缓冲区地址# 方法1：使用 ctypes.addressof() (需要先转换为 ctypes 数组)# 注意：这不是直接获取 numpy 数组地址的标准方法# 更推荐使用 numpy.ctypeslib 或 __array_interface__# 更正确的方法：arr_ptr = arr.ctypes.data_as(ctypes.POINTER(ctypes.c_int))print(f"NumPy array address: {hex(arr.ctypes.data)}")# 通过 ctypes 访问 numpy 数组元素for i inrange(len(arr)):print(f"Element {i}: {arr_ptr[i]}")

安全注意事项

使用 `ctypes.addressof()` 时需要注意以下几点：

不要修改非指针类型的数据：直接通过地址修改数据可能导致内存损坏或段错误。
生命周期管理：确保获取地址的对象在其被使用期间保持有效。
类型安全：确保通过地址访问的数据类型与实际存储的数据类型匹配。
平台兼容性：不同平台可能有不同的内存布局和对齐要求。

6.1 错误示例

import ctypes# 创建一个局部变量num = ctypes.c_int(42)# 获取地址addr = ctypes.addressof(num)# 函数结束后 num 可能不再有效defget_address():    local_num = ctypes.c_int(100)return ctypes.addressof(local_num)  # 危险！返回局部变量的地址# bad_addr = get_address()  # 不要这样做！

替代方案比较

除了 `ctypes.addressof()`，还有其他几种获取内存地址的方法：

id() 函数：

返回 Python 对象的地址

不适用于需要 C 数据地址的场景

ctypes.byref()：

ctypes.byref()

返回一个轻量级指针对象

不直接给出整数地址

更安全，推荐用于函数参数传递

ctypes.pointer()：

ctypes.pointer()

创建完整的指针对象

可以通过 `ctypes.addressof()` 获取指针本身的地址

要获取指向的数据地址需要访问 `.contents` 属性

7.1 比较示例

import ctypesnum = ctypes.c_int(42)# 方法1: addressofaddr1 = ctypes.addressof(num)print(f"addressof: {hex(addr1)}")# 方法2: byref (返回指针对象，不直接是地址)ref = ctypes.byref(num)# 要获取地址需要间接方式（不推荐）print(f"byref type: {type(ref)}")  # <class 'ctypes.byref_object'># 方法3: pointerptr = ctypes.pointer(num)print(f"pointer object address: {hex(ctypes.addressof(ptr))}")  # 指针对象本身的地址print(f"pointer value (points to): {hex(ctypes.addressof(ptr.contents))}")  # 指向的地址

完整案例：实现一个简单的 C 风格字符串操作

import ctypes# 定义 C 函数原型 (假设在 libstring.so 中)# extern "C" {#     void reverse_string(char *str);#     int string_length(const char *str);# }# 加载库lib = ctypes.CDLL('./libstring.so')# 设置函数原型lib.reverse_string.argtypes = [ctypes.c_char_p]lib.reverse_string.restype = Nonelib.string_length.argtypes = [ctypes.c_char_p]lib.string_length.restype = ctypes.c_int# 创建可修改的 C 字符串# 方法1: 使用 create_string_buffers = ctypes.create_string_buffer(b"Hello, World!")print(f"Original string: {s.raw}")print(f"String address: {hex(ctypes.addressof(s))}")# 调用 C 函数反转字符串lib.reverse_string(s)print(f"Reversed string: {s.raw}")# 方法2: 使用固定大小的数组char_array = (ctypes.c_char * 20)()char_array.value = b"Python ctypes"print(f"\nOriginal array: {char_array.raw}")print(f"Array address: {hex(ctypes.addressof(char_array))}")lib.reverse_string(char_array)print(f"Reversed array: {char_array.raw}")# 计算字符串长度length = lib.string_length(char_array)print(f"String length: {length}")

性能考虑

`ctypes.addressof()` 本身是一个简单的操作，性能开销很小。但在高性能场景中，需要注意：

频繁的地址获取可能影响性能
通过地址直接访问数据绕过了 Python 的类型检查
考虑使用 `ctypes.byref()` 或 `ctypes.pointer()` 作为替代，它们在某些情况下可能更高效`ctypes.addressof()` 是 ctypes 模块中一个强大但需要谨慎使用的功能，它允许我们获取 ctypes 对象的底层内存地址。主要用途包括：
与 C 代码交互时传递对象地址
实现自定义内存管理
直接操作内存缓冲区（高级用法）
调试和内存分析

使用时需要特别注意：