深入解析C语言中结构体的内存对齐机制及其优化方法

2019独角兽企业重金招聘Python工程师标准>>>

为了让CPU能够更舒服地访问到变量&＃xff0c;struct中的各成员变量的存储地址有一套对齐的机制。这个机制概括起来有两点&＃xff1a;第一&＃xff0c;每个成员变量的首地址&＃xff0c;必须是它的类型的对齐值的整数倍&＃xff0c;如果不满足&＃xff0c;它与前一个成员变量之间要填充(padding)一些无意义的字节来满足&＃xff1b;第二&＃xff0c;整个struct的大小&＃xff0c;必须是该struct中所有成员的类型中对齐值最大者的整数倍&＃xff0c;如果不满足&＃xff0c;在最后一个成员后面填充。

各种类型的变量的align值如下&＃xff0c;参考的是wikipedia的页面&＃xff1a;Data structure alignment

The following typical alignments are valid for compilers from Microsoft, Borland, and GNUwhen compiling for 32-bit x86:

A char (one byte) will be 1-byte aligned.
A short (two bytes) will be 2-byte aligned.
An int (four bytes) will be 4-byte aligned.
A float (four bytes) will be 4-byte aligned.
A double (eight bytes) will be 8-byte aligned on Windows and 4-byte aligned on Linux.
A long double (twelve bytes) will be 4-byte aligned on Linux.
Any pointer (four bytes) will be 4-byte aligned on Linux. (eg: char*, int*)

The only notable difference in alignment for a 64-bit linux system when compared to a 32 bit is:

A double (eight bytes) will be 8-byte aligned.
A long double (Sixteen bytes) will be 16-byte aligned.
Any pointer (eight bytes) will be 8-byte aligned.

这里写了个程序来验证这些事&＃xff1a;

#include struct s {char a;short b;char c;double d;char e; };int main() {struct s s1;printf("%d, %d, %d, %d, %d\n",(char*)(&s1.a) - (char*)(&s1),(char*)(&s1.b) - (char*)(&s1),(char*)(&s1.c) - (char*)(&s1),(char*)(&s1.d) - (char*)(&s1),(char*)(&s1.e) - (char*)(&s1));printf("%d\n", sizeof(struct s));return 0; }

在64位linux下面运行这段代码的结果是&＃xff1a;

0, 2, 4, 8, 16

由于对齐机制的存在&＃xff0c;实际上上面的struct在内存中是长这个样子的&＃xff0c;共计24个字节&＃xff1a;

struct s {char a; //在地址为0的位置char padding1[1]; //由于下面一个元素是short&＃xff0c;对齐字节数为2的位数&＃xff0c;需要补1字节short b; //对齐到了地址为2的位置char c; //在地址为4的位置char padding2[3]; //由于下面一个元素是double&＃xff0c;对齐字节数为8的倍数&＃xff0c;需要补3字节double d; //对齐到了地址为8的位置char e; //在地址为16的位置char padding3[7]; //整个struct的大小需要是对齐数最大者&＃xff0c;也就是double的8字节的整数倍 };

如果是在32位的linux下跑上面的程序&＃xff0c;由于double的长度还是8字节&＃xff0c;但是对齐是4字节的了&＃xff0c;所以前面几个成员的位置不变&＃xff0c;而最后的padding只需要补3个字节就可以了&＃xff0c;所以输出的结果是0, 2, 4, 8, 16及20.

对于windows&＃xff0c;其32位和64位下double都是8字节对齐的&＃xff0c;所以在32位和64位下跑这个程序结果都是0, 2, 4, 8, 16及24.

最后&＃xff0c;整个struct的大小的要求是对齐值最大者的整数倍&＃xff0c;没有什么默认的4或者8的倍数一说。如果把上面程序中的a,b,c,d,e的类型全变成char&＃xff0c;那么最后的他们的地址会是0,1,2,3,4&＃xff0c;整个struct的大小 sizeof(struct s)的值是5&＃xff0c;没有任何padding发生。

以上程序实验的环境在64位centos x64上的gcc 4.1.2&＃xff08;32位结果加-m32参数&＃xff09;及Visual Studio 2008上得出。