You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This articles will introduce type alignment and size guarantees in Go.
It is essential to know the guarantees to estimate the sizes of struct types
and properly use the 64-bit functions in sync/atomic standard package.
Go is a C family language, so many concepts talked in this article are shared with C language.
Type Alignment Guarantees in Go
Type alignment guarantees are also called value address alignment guarantees.
If the alignment guarantee of a type T is N,
then the address of every value of type T must be a multiple of N at run time.
We can also say the addresses of addressable values of type T are guaranteed to be N-byte aligned.
In fact, each type has two alignment guarantees, one is for when it is used as field types of other (struct) types,
the other is for other cases (when it is used for a variable declaration, array element type, etc).
We call the former one the field alignment guarantee of that type,
and call the latter one the general alignment guarantee of that type.
For a type T, we can call unsafe.Alignof(t) to
get its general alignment guarantee, where t is a non-field value of type T,
and call unsafe.Alignof(x.t) to
get its field alignment guarantee, where x is a struct value
and t is a field value of type T.
Calls to the functions in the unsafe standard code packages are always evaluated at compile time.
At run time, for a value t of type T,
we can call reflect.TypeOf(t).Align() to get the general alignment guarantee
of type T, and call reflect.TypeOf(t).FieldAlign() to get
the field alignment guarantee of type T.
For the current standard Go compiler (1.12), the field alignment guarantee and the general alignment guarantee
of a type are always equal. For gccgo compiler, the statement is false.
The following minimal alignment properties are guaranteed:
For a variable x of any type: unsafe.Alignof(x) is at least 1.
For a variable x of struct type: unsafe.Alignof(x) is the largest of all the values unsafe.Alignof(x.f) for each field f of x, but at least 1.
For a variable x of array type: unsafe.Alignof(x) is the same as the alignment of a variable of the array's element type.
So Go specification doesn't specify the exact alignment guarantees for any types.
It just specifies some minimal requirements.
For the same compiler, the exact type alignment guarantees may be different
between different architectures and between different compiler versions.
For the current version (1.12) of the standard Go compiler,
the alignment guarantees are listed here.
type alignment guarantee
bool, byte, uint8, int8 1
uint16, int16 2
uint32, int32, float32, complex64 4
arrays and structs depend on element types and field types
other types size of a native word
Here, the size of a native word (or machine word) is 4-byte
on 32-bit architectures and 8-byte on 64-bit architectures.
This means, for the current version of the standard Go compiler,
the alignment guarantees of other types may be either 4 or 8,
depends on different build target architectures.
This is also true for gccgo.
Generally, we don't need to care about the value address alignments in Go programming,
except that we want to optimizing memory consumption, or write portable programs which
using the 64-bit functions from sync/atomic package.
Please read the following two sections for details.
byte, uint8, int8 1
uint16, int16 2
uint32, int32, float32 4
uint64, int64, float64, complex64 8
complex128 16
uint, int 4 or 8, depend on architectures
uintptr large enough to store the uninterpreted
bits of a pointer value
Go specification doesn't make value size guarantees for other kinds of types.
The full list of sizes of different types settled
by the standard Go compiler are listed in
value copy costs.
The standard Go compiler (and gccgo) will ensure the size of values of a type
is a multiple of the alignment guarantee of the type.
To satisfy type alignment guarantees mentioned in the previous section,
Go compilers may pad some bytes between fields of struct values.
This makes the value size of a struct type may be not a simple
sum of the sizes of all fields of the type.
The following is an example showing how bytes are padded between struct fields.
We have already learned that
the alignment guarantee and size of the built-in type int8 are both one byte.
the alignment guarantee and size of the built-in type int16 are both two bytes.
the size of the built-in type int64 is 8 bytes, the alignment guarantee
of type int64 is 4 bytes on 32-bit architectures and 8 bytes on 64-bit architectures.
the alignment guarantees of the types T1 and T2
are their respective largest field alignment guarantees, a.k.a., the alignment guarantee of the int64 field.
So their alignment guarantees are both 4 bytes on 32-bit architectures and 8 bytes on 64-bit architectures.
the sizes of the types T1 and T2 must be multiples of their respective alignment guarantees,
a.k.a., 4N bytes on 32-bit architectures and 8N bytes on 64-bit architectures.
type T1 struct {
a int8
// On 64-bit architectures, to make field b 8-byte aligned,
// 7 bytes need to be padded here. On 32-bit architectures, to
// make field b 4-byte aligned, 3 bytes need to be padded here.
b int64
c int16
// To make the size of type T1 be a multiple of the alignment
// guarantee of T1, on 64-bit architectures, 6 bytes need to be
// padded here, and on 32-bit architectures, 2 bytes need to be
// padded here.
}
// The size of T1 is 24 (= 1 + 7 + 8 + 2 + 6) bytes on 64-bit
// architectures and is 16 (= 1 + 3 + 8 + 2 + 2) on 32-bit architectures.
type T2 struct {
a int8
// To make field c 2-byte aligned, one byte needs to be padded
// here on both 64-bit and 32-bit architectures.
c int16
// On 64-bit architectures, to make field b 8-byte aligned, 4
// bytes need to be padded here. On 32-bit architectures, field b
// is already 4-byte aligned, so no bytes need to be padded here.
b int64
}
// The size of T2 is 16 (= 1 + 1 + 2 + 4 + 8) bytes on 64-bit
// architectures and is 12 (= 1 + 1 + 2 + 8) on 32-bit architectures.
Although T1 and T2 have the same field set, their sizes are different.
One interesting fact for the standard Go compiler is that sometimes zero sized fields may affect structure padding.
Please read this question in the unofficial Go FAQ for details.
The Alignment Requirement for 64-bit Word Atomic Operations
64-bit words mean values of types whose underlying types are int64 or uint64.
The article atomic operations mentions a fact that
64-bit atomic operations on a 64-bit word require the address of the 64-bit word must be 8-byte aligned.
This is not a problem for the current 64-bit architectures supported by the standard Go compiler,
because 64-bit words on these 64-bit architectures are always 8-byte aligned.
However, on 32-bit architectures, the alignment guarantee made by the standard Go compiler for 64-bit words is only 4 bytes.
64-bit atomic operations on a 64-bit word which is not 8-byte aligned will panic at runtime.
Worse, on very old CPU architectures, 64-bit atomic functions are not supported.
On x86-32, the 64-bit functions use instructions unavailable before the Pentium MMX.
On non-Linux ARM, the 64-bit functions use instructions unavailable before the ARMv6k core.
On both ARM and x86-32, it is the caller's responsibility to
arrange for 64-bit alignment of 64-bit words accessed atomically.
The first word in a variable or in an allocated struct,
array, or slice can be relied upon to be 64-bit aligned.
So, things are not very bad for two reasons.
The very old CPU architectures are not mainstream architectures nowadays.
If a program needs to do synchronization for 64-bit words on these architectures,
there are other synchronization techniques to rescue.
On other not-very-old 32-bit architectures,
there are some ways to ensure some 64-bit words are relied upon to be 64-bit aligned.
The ways are described as
the first (64-bit) word in a variable or in an allocated struct,
array, or slice can be relied upon to be 64-bit aligned.
What does the word allocated mean?
We can think an allocated value as a declared variable,
a value returned the built-in make function,
or the value referenced by a value returned by the built-in new function.
If a slice value derives from an allocated array and
the first element of the slice is the first element of the array,
then the slice value can also be viewed as an allocated value.
The description of which 64-bit words can be relied upon to be 64-bit aligned
on 32-bit architectures is some conservative.
There are more 64-bit words which can be relied upon to be 8-byte aligned.
In fact, if the first element of an array or slice which element type
is a 64-bit word type can be relied upon to be 64-bit aligned,
then all elements in the array/slice can also be accessed atomically.
It is just some subtle and verbose to make a simple clear description to include
all the 64-bit words can be relied upon to be 64-bit aligned on 32-bit architectures,
so the official documentation just makes a conservative description.
Here is an example which lists some 64-bit words which are safe or unsafe
to be accessed atomically on both 64-bit and 32-bit architectures.
type (
T1 struct {
v uint64
}
T2 struct {
_ int16
x T1
y *T1
}
T3 struct {
_ int16
x [6]int64
y *[6]int64
}
)
var a int64 // a is safe
var b T1 // b.v is safe
var c [6]int64 // c[0] is safe
var d T2 // d.x.v is unsafe
var e T3 // e.x[0] is unsafe
func f() {
var f int64 // f is safe
var g = []int64{5: 0} // g[0] is safe
var h = e.x[:] // h[0] is unsafe
// Here, d.y.v and e.y[0] are both safe,
// for *d.y and *e.y are both allocated.
d.y = new(T1)
e.y = &[6]int64{}
_, _, _ = f, g, h
}
// In fact, all elements in c, g and e.y.v are
// safe to be accessed atomically, though no
// official documentations make the guarantees.
If a 64-bit word field (generally the first one) of a struct type
will be accessed atomically in code, we should always use allocated values of the struct type
to guarantee the atomically accessed fields always can
be relied upon to be 8-byte aligned on 32-bit architectures.
When this struct type is used as the type of a field of another struct type,
we should arrange the field as the first field of the other struct type,
and always use allocated values of the other struct type.
Sometimes, if we can't make sure whether or not a 64-bit word can be accessed atomically,
we can use a value of type
[15]byte
to determine
the address for the 64-bit word at run time. For example,
package mylib
import (
"unsafe"
"sync/atomic"
)
type Counter struct {
x [15]byte // instead of "x uint64"
}
func (c *Counter) xAddr() *uint64 {
// The return must be 8-byte aligned.
return (*uint64)(unsafe.Pointer(
uintptr(unsafe.Pointer(&c.x)) + 8 -
uintptr(unsafe.Pointer(&c.x))%8))
}
By using this solution, the Counter type can be embedded in other user types freely and safely,
even on 32-bit architectures.
The drawback of this solution is there are seven bytes being wasted for every value of Counter type
and it uses unsafe pointers.
The sync standard package uses
a [3]uint32 value to do this trick instead.
This trick assumes that the alignment guarantee of the uint32 type is a multiple of 4 bytes.
The assumption is true for both the standard Go compiler and gccgo compiler.
However, it might be false for another third-party Go compiler.
The Go 101 project is hosted on both Github and Gitlab. Welcome to improve Go 101 articles by submitting corrections for all kinds of mistakes, such as typos, grammar errors, wording inaccuracies, description flaws, code bugs and broken links.
Memory Layouts - Go 101 (Golang Knowledgebase)
https://ift.tt/2I0lWZJ
About 0 Minutes
Go Practice 101
Go Links 101
Go Tools 101
Go Quizzes 101
Memory Layouts
This articles will introduce type alignment and size guarantees in Go. It is essential to know the guarantees to estimate the sizes of struct types and properly use the 64-bit functions in
sync/atomic
standard package.Go is a C family language, so many concepts talked in this article are shared with C language.
Type Alignment Guarantees in Go
Type alignment guarantees are also called value address alignment guarantees. If the alignment guarantee of a type
T
isN
, then the address of every value of typeT
must be a multiple ofN
at run time. We can also say the addresses of addressable values of typeT
are guaranteed to be N-byte aligned.In fact, each type has two alignment guarantees, one is for when it is used as field types of other (struct) types, the other is for other cases (when it is used for a variable declaration, array element type, etc). We call the former one the field alignment guarantee of that type, and call the latter one the general alignment guarantee of that type.
For a type
T
, we can callunsafe.Alignof(t)
to get its general alignment guarantee, wheret
is a non-field value of typeT
, and callunsafe.Alignof(x.t)
to get its field alignment guarantee, wherex
is a struct value andt
is a field value of typeT
.Calls to the functions in the
unsafe
standard code packages are always evaluated at compile time.At run time, for a value
t
of typeT
, we can callreflect.TypeOf(t).Align()
to get the general alignment guarantee of typeT
, and callreflect.TypeOf(t).FieldAlign()
to get the field alignment guarantee of typeT
.For the current standard Go compiler (1.12), the field alignment guarantee and the general alignment guarantee of a type are always equal. For gccgo compiler, the statement is false.
Go specification only mentions
a little on type alignment guarantees:
So Go specification doesn't specify the exact alignment guarantees for any types. It just specifies some minimal requirements.
For the same compiler, the exact type alignment guarantees may be different between different architectures and between different compiler versions. For the current version (1.12) of the standard Go compiler, the alignment guarantees are listed here.
Here, the size of a native word (or machine word) is 4-byte on 32-bit architectures and 8-byte on 64-bit architectures.
This means, for the current version of the standard Go compiler, the alignment guarantees of other types may be either
4
or8
, depends on different build target architectures. This is also true for gccgo.Generally, we don't need to care about the value address alignments in Go programming, except that we want to optimizing memory consumption, or write portable programs which using the 64-bit functions from
sync/atomic
package. Please read the following two sections for details.Type Sizes and Structure Padding
Go specification only makes following
type size guarantees:
Go specification doesn't make value size guarantees for other kinds of types. The full list of sizes of different types settled by the standard Go compiler are listed in value copy costs.
The standard Go compiler (and gccgo) will ensure the size of values of a type is a multiple of the alignment guarantee of the type.
To satisfy type alignment guarantees mentioned in the previous section, Go compilers may pad some bytes between fields of struct values. This makes the value size of a struct type may be not a simple sum of the sizes of all fields of the type.
The following is an example showing how bytes are padded between struct fields. We have already learned that
int8
are both one byte.int16
are both two bytes.int64
is 8 bytes, the alignment guarantee of typeint64
is 4 bytes on 32-bit architectures and 8 bytes on 64-bit architectures.T1
andT2
are their respective largest field alignment guarantees, a.k.a., the alignment guarantee of theint64
field. So their alignment guarantees are both 4 bytes on 32-bit architectures and 8 bytes on 64-bit architectures.T1
andT2
must be multiples of their respective alignment guarantees, a.k.a., 4N bytes on 32-bit architectures and 8N bytes on 64-bit architectures.Although
T1
andT2
have the same field set, their sizes are different.One interesting fact for the standard Go compiler is that sometimes zero sized fields may affect structure padding. Please read this question in the unofficial Go FAQ for details.
The Alignment Requirement for 64-bit Word Atomic Operations
64-bit words mean values of types whose underlying types are
int64
oruint64
.The article atomic operations mentions a fact that 64-bit atomic operations on a 64-bit word require the address of the 64-bit word must be 8-byte aligned. This is not a problem for the current 64-bit architectures supported by the standard Go compiler, because 64-bit words on these 64-bit architectures are always 8-byte aligned.
However, on 32-bit architectures, the alignment guarantee made by the standard Go compiler for 64-bit words is only 4 bytes. 64-bit atomic operations on a 64-bit word which is not 8-byte aligned will panic at runtime. Worse, on very old CPU architectures, 64-bit atomic functions are not supported.
At the end of the
sync/atomic
documentation, it mentions:
So, things are not very bad for two reasons.
The ways are described as the first (64-bit) word in a variable or in an allocated struct, array, or slice can be relied upon to be 64-bit aligned. What does the word allocated mean? We can think an allocated value as a declared variable, a value returned the built-in
make
function, or the value referenced by a value returned by the built-innew
function. If a slice value derives from an allocated array and the first element of the slice is the first element of the array, then the slice value can also be viewed as an allocated value.The description of which 64-bit words can be relied upon to be 64-bit aligned on 32-bit architectures is some conservative. There are more 64-bit words which can be relied upon to be 8-byte aligned. In fact, if the first element of an array or slice which element type is a 64-bit word type can be relied upon to be 64-bit aligned, then all elements in the array/slice can also be accessed atomically. It is just some subtle and verbose to make a simple clear description to include all the 64-bit words can be relied upon to be 64-bit aligned on 32-bit architectures, so the official documentation just makes a conservative description.
Here is an example which lists some 64-bit words which are safe or unsafe to be accessed atomically on both 64-bit and 32-bit architectures.
If a 64-bit word field (generally the first one) of a struct type will be accessed atomically in code, we should always use allocated values of the struct type to guarantee the atomically accessed fields always can be relied upon to be 8-byte aligned on 32-bit architectures. When this struct type is used as the type of a field of another struct type, we should arrange the field as the first field of the other struct type, and always use allocated values of the other struct type.
Sometimes, if we can't make sure whether or not a 64-bit word can be accessed atomically, we can use a value of type
[15]byte
to determine the address for the 64-bit word at run time. For example,
By using this solution, the
Counter
type can be embedded in other user types freely and safely, even on 32-bit architectures. The drawback of this solution is there are seven bytes being wasted for every value ofCounter
type and it uses unsafe pointers. Thesync
standard package uses a[3]uint32
value to do this trick instead. This trick assumes that the alignment guarantee of theuint32
type is a multiple of 4 bytes. The assumption is true for both the standard Go compiler and gccgo compiler. However, it might be false for another third-party Go compiler.Russ Cox has proposed that the addresses of 64-bit words should always be 8-byte aligned, on either 64-bit or 32-bit architectures, to make Go programming simpler. Currently (Go 1.12), this proposal hasn't been adopted yet.
The Go 101 project is hosted on both Github and Gitlab. Welcome to improve Go 101 articles by submitting corrections for all kinds of mistakes, such as typos, grammar errors, wording inaccuracies, description flaws, code bugs and broken links.
Support Go 101 by playing
Tapir's games.
Cryptocurrency donations are also welcome:
Bitcoin: 1xucQbv5jujFPPwhyg395ri5yV71hx9g9
Ethereum: 0x5dc4aa2c2bbfaadae373dadcfca11b3358912212
Go Practice 101
Go Links 101
Go Tools 101
Go Quizzes 101
via go101.org https://go101.org
March 26, 2019 at 11:22PM
The text was updated successfully, but these errors were encountered: