WIP: RFC: Create type SecureString #24738

omus · 2017-11-24T02:09:51Z

Part of the problem in #24731 is that when working with structures which try to securely wipe themselves when finalized you end up wiping the underlying data once the first struct has been finalized. In an ideal world we would only securely wipe the data once the last reference has been removed.

SecureString is a new type which allows other structures to reference secure data which will only be securely wiped once the SecureString is no longer referenced or explicitly wiped. The new string type also allows me to stop using deepcopy to work around unwanted securezero! calls which avoids having to duplicating sensitive information.

omus · 2017-11-24T14:45:29Z

Note that this PR also addresses the same issue that was addressed in #24731. The solution proposed here is definitely more developer friendly and contains less gotchas when developing new code around secure strings.

stevengj · 2017-11-24T15:10:51Z

base/strings/types.jl

+
+    function SecureString(str::AbstractString)
+        s = new(str)
+        finalizer(securezero!, s)


I don't understand a design based on finalizers, since finalizers may take a long time to call. Don't you need to zero the data as soon as it falls out of scope?

Best practise would be to securely zero the data as soon it is no longer in use. The finalizer is used mostly as a fail-safe to ensure that at the very least the data is zeroed when the the instance is garbage collected.

I'll make sure to mention this in the docstring for SecureString

stevengj · 2017-11-24T15:11:05Z

base/strings/types.jl

+```
+"""
+mutable struct SecureString <: AbstractString
+    string::String


Vector{UInt8}?

Yes, I need to switch this to use non-immutable memory.

One nice thing I have noticed about using String is that when doing SecureString("password") the string literal memory will also be wiped out when running securezero!(::SecureString).

The above is also true when using Vector{UInt8} internally as vector can access the memory of the original String.

The above is also true when using Vector{UInt8} internally as vector can access the memory of the original String

Yes, but (1) you're not supposed to use such a vector to mutate a string, (2) we plan to fix this to least make it much harder to mutate a string via a vector.

Wiping string literals is definitely not a good idea. It basically breaks your program. If secure data is put in a string literal there's nothing we can do about it.

omus · 2017-11-24T17:41:02Z

I don't understand a design based on finalizers, since finalizers may take a long time to call. Don't you need to zero the data as soon as it falls out of scope?

After giving this some more thought I think there are two approaches we can take when dealing with secure data:

Explicitly copy and wipe data: whenever we allocate secure memory we explicitly wipe the data soon as would fall out of scope. If multiple data structures need to reference the same secure data they would each need to have duplicate copies of the secure data to ensure there copy is not wiped out accidentally.
Rely on finalizers: secure memory is only wiped when there are no references to the secure memory and the finalizer is called. If multiple data structures need to reference the same secure data they can all reference the same secure memory without duplication.

The main issue with explicitly copying and wiping data is that it is error prone it is easy to forget to wipe the data. Additionally making duplicate copies of secure data seems like a bad solution as it increases the chances for exposure.

Disadvantages of relying on finalizers is the finalizers may take a long time to call. This could be mitigated with some kind of reference counting approach.

omus · 2017-11-24T17:42:11Z

My end goal here is to find an approach where we can have secure string data without having it be harder to use than any other String type.

stevengj · 2017-11-24T18:27:26Z

I just think that if you're worried about secure data being in memory for too long, then a finalizer-based approach is inherently insufficient. See also #11207.

omus · 2017-12-06T04:18:36Z

I did some experimentation with trying to implement reference count and some nice results with using convert(::Type{SecureString}, s::SecureString) = ... but unfortunately without a way to count dereferences this methodology is dead without lower level support.

omus · 2017-12-06T04:32:17Z

The current implementation is not dissimilar to just using String with securezero! but the new type introduces three advantages:

Container types that use SecureString do not need to implement their own finalizer to ensure that sensitive data is wiped. Additionally, keeping the shredding finalizer in one place means that we don't have to worry about garbage collection accidentally wiping data early (Fix LibGit2 securezero! issue #24731).
Internally the zeroed memory is overwriting data in a mutable Vector{UInt8} container instead of overwriting immutable string data. (Since passed in Strings are still currently zeroed this point isn't very strong)
Using the SecureString type in another container clearly indicates that the container will hold secure information and you may need to use deepcopy to maintain your own copy.

I've also introduced a new exported function shred! (named after the command line program) for SecureString rather than use securezero! as the new function name is clear and state any implementation details in the function name.

stevengj · 2017-12-06T04:41:53Z

Again, I'm skeptical that anyone who needs this functionality should be relying on finalizers.

omus · 2017-12-06T05:05:04Z

Again, I'm skeptical that anyone who needs this functionality should be relying on finalizers.

The finalizer is just used as a failsafe. Best practise is to use shred! when the sensitive data is no longer required.

StefanKarpinski · 2017-12-06T14:54:29Z

The name SecureString is a bit vague – it makes one think that it's extra careful to prevent buffer overflows or something. How about calling this SecretString since it's for holding secrets?

omus · 2017-12-06T15:03:24Z

The name SecureString is used in C#. I'm okay with SecretString but I think ConfidentialString might be a better name if we're moving in that direction

StefanKarpinski · 2017-12-06T15:11:59Z

I can't tell from a quick perusal if this is the case, but SecretString/SecureString probably shouldn't have any convert methods to other string types so that the chance of accidentally copying the data somewhere else is minimized.

I do like the approach here: having a separate type allows us to limit the behaviors that one can do with this type – such as conversion and copying. Just using String and relying on the user not to copy and remember to securezero! a secret can't give nearly as strong guarantees. With a type you can be sure that the only things that are done with a string are creating it, passing it around in places that are designed to hold secrets and treated accordingly, and that ultimately it is cleaned up – ideally explicitly, but one way or the other.

To that end, I wonder if this shouldn't be a completely opaque data type instead of a subtype of AbstractString – after all any usage of a secret in a place that's not explicitly designed to handle secret data carefully is potentially a leak. So the type would be something like this:

struct Secret
    data::Vector{UInt8}
end

It would have no methods except constructors and shred!. If you want the data, you have to reach in and grab it explicitly. Methods for safely getting secret data like prompting the user and saving the result straight into a Secret object or reading the contents of a file straight into one could be provided as well.

omus · 2017-12-06T15:40:45Z

It would have no methods except constructors and shred!

Overall I like this idea. I think we may want a few additional methods like write(io::IO, ::Secret). I'll experiment with this later and see how it works out in practise.

StefanKarpinski · 2017-12-06T16:05:25Z

I think that doing write(io, secret.data) explicitly might be better – that way it's explicit everywhere code might be using / exposing secret data.

yurivish · 2017-12-07T05:44:15Z

Given the importance of properly handling confidential data, another idea might be to require the user to explicitly shred the secret when they're done with it, and to complain if one is ever left un-shredded.

From this perspective, cleaning up the memory "at some future point" with a finalizer feels like a convenience that could mask a bug – better to throw an error instead.

(This also feels like a place where Rust-like ownership or Rc semantics would help clarify who owns the data and/or to deterministically ensure that it gets dropped. Maybe in Julia 3.0!)

StefanKarpinski · 2017-12-07T16:13:40Z

I like that idea but I'm not sure if we can actually raise an error in a finalizer. Which task gets the error in that case? I also vaguely recall that printing in a finalizer is a problem.

nalimilan · 2017-12-07T16:52:13Z

Would it make sense/be possible to use mlock to ensure the secret data is never swapped to disk?

stevengj · 2017-12-07T20:43:27Z

Currently, exceptions caught during finalization print something to stderr rather than throwing "normally" to the caller.

StefanKarpinski · 2017-12-07T20:56:45Z

Printing a warning seems like it might be good. Of course, if only the GC has a reference to a secret then it's pretty safe – after all, malicious code doesn't have a reference to it either. Sure, proactively "shredding" the data is better, but there's nothing that wrong with GC shredding it either.

omus · 2017-12-11T19:07:28Z

My current approach is to print a warning if the data has not been shredded and then proceed to shred the data.

StefanKarpinski · 2017-12-11T19:26:40Z

That sounds like the best option.

StefanKarpinski · 2017-12-11T19:26:55Z

Sorry, didn't mean to close and reopen.

omus · 2017-12-13T16:52:54Z

I've done a bunch of work on this but I'm not sure I can have it ready for the feature freeze. The good news is that while implementing this I've uncovered and fixed some issues I'll make some PRs for yet.

omus · 2017-12-13T16:53:25Z

How would people feel if at the very least I try to get the rename of securezero! to shred! in before the freeze?

StefanKarpinski · 2017-12-13T17:04:58Z

This is not a public API, right? If it's not public, you can always change it after the feature freeze.

Keno · 2018-05-30T23:54:49Z

Bump on this, it would be nice to stop all the CI failures due to the runtime incorrectly overwriting immutable memory.

omus · 2018-05-31T05:46:46Z

Did most of the rebase. Will hopefully push something for tomorrow morning.

Originally used `isequal` to deal with `Nullable`

omus · 2018-05-31T18:00:26Z

Finished the rebase. SecureString itself needs a bunch of work and currently doesn't have the limited method set as suggested by @StefanKarpinski.

I've tested the LibGit2 usage of SecureString which is currently working but some additional effort will be needed to properly secure the SecureString implementation. I'll try to do this work yet but my time is limited so if others want to help out I'm all for it.

staticfloat · 2018-06-04T19:56:20Z

base/strings/secure.jl

+    return s
+end
+
+isshredded(s::SecureString) = sum(s.data) == 0


I think we want all(s.data .== 0); as there's a (minute) possibility that this can overflow to precisely zero.

Or rather all(iszero, s.data).

staticfloat · 2018-06-04T20:04:07Z

I like the direction of this PR, but I think we basically want to get rid of lines 38-52 in strings/secure.jl.

With regards to the "right API" for dealing with these objects, I don't think having ss.data be the right way to get data out of a SecureString object is what we want; I think it would be better to have a read(::SecureString) function that returns a String instead, (named so as to mirror the write(::SecureString, ::String) method we use to put data within it. In either case, what we don't want is for users to be able to just generically treat a SecureString as if it were a String; we want the ergonomics to explicitly require the user to deal with the fact that it's a SecureString, so as long as the functions defined don't overlap with String methods, we should be good to go.

mbauman · 2018-06-04T22:53:46Z

I'm in agreement with the calls for read and write APIs. In fact, I think this would be much easier to use safely as an IO object — perhaps SecureBuffer a la IOBuffer? I've already started work on transitioning the APIs; I'll have a full report tomorrow about what this could look like.

omus · 2018-06-20T14:54:04Z

Succeeded by: #27565

omus added security System security concerns and vulnerabilities strings "Strings!" labels Nov 24, 2017

This was referenced Nov 24, 2017

libgit2 credentials objects are dangerous #23232

Open

Fix LibGit2 securezero! issue #24731

Merged

nalimilan mentioned this pull request Nov 24, 2017

Replace Nullable{T} with Union{Some{T}, Void} #23642

Merged

stevengj reviewed Nov 24, 2017

View reviewed changes

omus changed the title ~~Create type SecureString~~ WIP: Create type SecureString Nov 24, 2017

omus force-pushed the cv/securestring branch from 75faf97 to e8fa186 Compare December 6, 2017 04:15

omus changed the title ~~WIP: Create type SecureString~~ RFC: Create type SecureString Dec 6, 2017

omus force-pushed the cv/securestring branch from e8fa186 to eb79d97 Compare December 6, 2017 05:09

StefanKarpinski closed this Dec 11, 2017

StefanKarpinski reopened this Dec 11, 2017

nalimilan mentioned this pull request May 26, 2018

Error in LibGit2 tests on FreeBSD #27109

Closed

omus added 2 commits May 31, 2018 12:55

Create type SecureString

02bff65

Switch GitCredential equality to use ==

3818ace

Originally used `isequal` to deal with `Nullable`

omus changed the title ~~RFC: Create type SecureString~~ WIP: RFC: Create type SecureString May 31, 2018

omus force-pushed the cv/securestring branch from eb79d97 to 3818ace Compare May 31, 2018 17:57

Keno added this to the 0.7 milestone Jun 1, 2018

staticfloat reviewed Jun 4, 2018

View reviewed changes

Keno assigned mbauman Jun 5, 2018

staticfloat mentioned this pull request Jun 12, 2018

Upgrade libgit2 to v0.27.2 #27525

Merged

mbauman mentioned this pull request Jun 13, 2018

RFC: Create SecretBuffer and use it to help keep LibGit2's secrets #27565

Merged

omus closed this Jun 20, 2018

omus deleted the cv/securestring branch June 20, 2018 14:54

WIP: RFC: Create type SecureString #24738

WIP: RFC: Create type SecureString #24738

Conversation

omus commented Nov 24, 2017

omus commented Nov 24, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

omus commented Nov 24, 2017

omus commented Nov 24, 2017 • edited Loading

stevengj commented Nov 24, 2017

omus commented Dec 6, 2017

omus commented Dec 6, 2017 • edited Loading

stevengj commented Dec 6, 2017

omus commented Dec 6, 2017

StefanKarpinski commented Dec 6, 2017

omus commented Dec 6, 2017

StefanKarpinski commented Dec 6, 2017 • edited Loading

omus commented Dec 6, 2017

StefanKarpinski commented Dec 6, 2017

yurivish commented Dec 7, 2017 • edited Loading

StefanKarpinski commented Dec 7, 2017

nalimilan commented Dec 7, 2017

stevengj commented Dec 7, 2017

StefanKarpinski commented Dec 7, 2017

omus commented Dec 11, 2017

StefanKarpinski commented Dec 11, 2017

StefanKarpinski commented Dec 11, 2017

omus commented Dec 13, 2017

omus commented Dec 13, 2017

StefanKarpinski commented Dec 13, 2017

Keno commented May 30, 2018

omus commented May 31, 2018

omus commented May 31, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

staticfloat commented Jun 4, 2018

mbauman commented Jun 4, 2018 • edited Loading

omus commented Jun 20, 2018

omus commented Nov 24, 2017 •

edited

Loading

omus commented Dec 6, 2017 •

edited

Loading

StefanKarpinski commented Dec 6, 2017 •

edited

Loading

yurivish commented Dec 7, 2017 •

edited

Loading

mbauman commented Jun 4, 2018 •

edited

Loading