Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

runtime: leftover defers #9610

Closed
bradfitz opened this issue Jan 16, 2015 · 5 comments
Closed

runtime: leftover defers #9610

bradfitz opened this issue Jan 16, 2015 · 5 comments
Milestone

Comments

@bradfitz
Copy link
Contributor

The linux-amd64-clang builder has been complaining about leftover defers.

I have no clue why it's only the clang builder, but there are three of them recently.

runtime: g1832: leftover defer sp=0xdeaddeaddeaddead pc=0x405a9c
    defer 0xc2086a45f0 sp=0xdeaddeaddeaddead pc=0x405a9c
    defer 0xc2086c2060 sp=0xdeaddeaddeaddead pc=0x40500e
fatal error: traceback has leftover defers

In:

http://build.golang.org/log/ca786e4aaa24e7be4e66e1a0e5f81a8572dbf8fc
http://build.golang.org/log/39ff914cd33ad824bb874853592ea3a2f3fd7221
http://build.golang.org/log/aad8bdc0026b36def50c43a426212ed546d80b70

@bradfitz bradfitz added this to the Go1.5 milestone Jan 16, 2015
@bradfitz
Copy link
Contributor Author

I also see these on the Windows builders.

/cc @rsc @minux

@egonelbre
Copy link
Contributor

Got one recently as well https://gist.github.com/egonelbre/e6d0d22e73953300c9a8

go version devel +4e03bbb Sat Jan 10 00:23:04 2015 +0000 windows/amd64

@alexbrainman
Copy link
Member

I cannot reproduce it here. But I didn't tried hard enough.

Alex

@mattyw
Copy link
Contributor

mattyw commented Feb 3, 2015

I just had this error as well: https://gist.github.com/mattyw/31e701977f213ac07279
I've taken off parts of the traceback to hide some proprietary stuff - if you need more let me know and I'll see what I can do.

go version devel +1757b5c Thu Dec 11 07:31:32 2014 +0000 linux/amd64

The process had been running for >3 days when this occurred

RLH added a commit that referenced this issue Mar 17, 2015
The GC assumes that there will be no asynchronous write barriers when
the world is stopped. This keeps the synchronization between write
barriers and the GC simple. However, currently, there are a few places
in runtime code where this assumption does not hold.
The GC stops the world by collecting all Ps, which stops all user Go
code, but small parts of the runtime can run without a P. For example,
the code that releases a P must still deschedule its G onto a runnable
queue before stopping. Similarly, when a G returns from a long-running
syscall, it must run code to reacquire a P.
Currently, this code can contain write barriers. This can lead to the
GC collecting reachable objects if something like the following
sequence of events happens:
1. GC stops the world by collecting all Ps.
2. G #1 returns from a syscall (for example), tries to install a
pointer to object X, and calls greyobject on X.
3. greyobject on G #1 marks X, but does not yet add it to a write
buffer. At this point, X is effectively black, not grey, even though
it may point to white objects.
4. GC reaches X through some other path and calls greyobject on X, but
greyobject does nothing because X is already marked.
5. GC completes.
6. greyobject on G #1 adds X to a work buffer, but it's too late.
7. Objects that were reachable only through X are incorrectly collected.
To fix this, we check the invariant that no asynchronous write
barriers happen when the world is stopped by checking that write
barriers always have a P, and modify all currently known sources of
these writes to disable the write barrier. In all modified cases this
is safe because the object in question will always be reachable via
some other path.

Some of the trace code was turned off, in particular the
code that traces returning from a syscall. The GC assumes
that as far as the heap is concerned the thread is stopped
when it is in a syscall. Upon returning the trace code
must not do any heap writes for the same reasons discussed
above.

Fixes #10098
Fixes #9953
Fixes #9951
Fixes #9884

May relate to #9610 #9771

Change-Id: Ic2e70b7caffa053e56156838eb8d89503e3c0c8a
Reviewed-on: https://go-review.googlesource.com/7504
Reviewed-by: Austin Clements <[email protected]>
@randall77
Copy link
Contributor

I haven't seen one of these in a while. Please reopen if you're still seeing it.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

6 participants