Add [Foreign.dynamic_funptr] #595

tiash · 2019-05-08T11:53:57Z

[Foreign.dynamic_funptr] is a safer alternative for [Foreign.funptr], in
particular this new version requires explicit life cycle management making it
easier to avoid difficult to debug segmentation faults where the ocaml
closure is released before it is used from C.

We discovered one of these in Async_ssl which motivated us to propose this addition.

[Foreign.dynamic_funptr] is a safer alternative for [Foreign.funptr], in particular this version uses explicit life cycle management making it easier to avoid dificult to debug segmentation faults where the ocaml closure is released before it is used from C.

yallop · 2019-05-08T11:58:16Z

Thanks for the proposal!
There might be a delay before I can review it in detail, but I think this approach is a good direction to go overall.

fdopen · 2019-05-13T16:25:18Z

The OCaml type systems only enforces the types on the OCaml side, not the corresponding types at the C side, that are also part of the contract. It's not a problem with the current interface, because you define the function and its callbacks together.

With your new interface there can be a mismatch that is not longer captured by the type system, e.g with:

let foo = foreign "foo"  (int @-> dynamic_funptr (int @-> int @-> returning int) @-> returning void)

I could allocate the corresponding dnymaic_funptr for foo in multiple ways that are incompatible with the definition above:

let dynptr1 = dynamic_funptr_of_fun (int8_t @-> int8_t @-> returning int8_t) (+)
let dynptr2 = dynamic_funptr_of_fun (camlint @-> camlint @-> returning camlint) (+)
let no_int_at_all = Ctypes.view ~read:(fun _ -> 1) ~write:(fun _ -> "aaa") Ctypes.string
let dynptr3 = dynamic_funptr_of_fun (no_int_at_all @-> no_int_at_all @-> returning no_int_at_all) (+)

tiash · 2019-05-13T17:04:21Z

Tanks for flagging this!

We have a couple of ideas to fix this, I'll follow up once we have this fleshed out a bit more.

tiash · 2019-05-14T13:44:05Z

I have just pushed what I think should fix this nicely, here's a slightly contrived example of how to use this

module Progress_callback = Foreign.Make_funptr(val (
  Foreign.dynamic_funptr (int @-> int @-> ptr void @-> returning void))

let keygen =
  foreign "RSA_generate_key" (int @-> int @-> Progress_callback.t @-> ptr void @-> returning rsa_key)

let secret_key =
  Progress_callback.with_fun
    (fun a b _ -> printf "progress: a:%d, b:%d\n" a b) 
    (fun progress -> keygen 2048 65537 progress null)

The Foreign.Make_funptr functor is necessary to ensure that the types remain non-generative so they are safe to use in cstubs.

the (val (Foreign.dynamic_funptr ...)) is a helper fill in defaults for the calling convention and to avoid having to repeat the types (the 4.07 compiler infers everything in the example)

In particular this renames `module type Dynamic_funptr` to `module type Funptr` and drops a redundant debug arg.

andrewray · 2019-07-27T01:41:41Z

FWIW I have been using this new API to bind to C with maybe 50 callbacks. I basically allow them to leak because it is test code - but otherwise, the project would not have been possible because of the crashes that the GC would (and did) cause.

I am not sure I like the way the new functions integrate with the old ones - some distinction between safe and "raw" would be nice I think.

yallop · 2019-09-06T12:12:48Z

Thanks again for this PR, and apologies for my long delay in responding. As I said above, I think adding explicit allocation/deallocation for function pointers is a good direction. Looking at the code, I see several other things I like: using roots, catching double frees, and catching memory leaks by attaching a finaliser.

One thing I'm not so convinced about is the proposed interface, so I'm going to suggest an alternative that seems simpler to me. If I'm not mistaken, we can do everything we need with a pair of functions that construct and deallocate static_funptr values:

val make_funptr : ('a -> 'b) static_funptr typ -> ('a -> 'b) -> ('a -> 'b) static_funptr
val release_funptr : ('a -> 'b) static_funptr -> unit

For example, your RSA_generate_key example could be written using this interface as follows:

let progress_callback_t = static_funptr (int @-> int @-> ptr void @-> returning void)
let keygen = foreign "RSA_generate_key"
       (int @-> int @-> progress_callback_t @-> ptr void @-> returning rsa_key)
let secret_key = 
  let progress = make_funptr progress_callback_t
                   (fun a b _ -> printf "progress: a:%d, b:%d\n" a b) in
  let key = keygen 2048 65537 progress null in
  let () = release_funptr progress in
  key

(If there's something that you think can't be expressed with this interface, please do say!)

Then the semantics follow what you have already: the valid pattern usage pattern is

p = make_funptr ...
use p: pass it to C, write it to memory, convert it to an OCaml function and call it.
release_funptr p

Everything else is invalid, including

using p after the call to release_funptr
calling release_funptr twice
not calling release_funptr at all

Regarding the implementation, I was going to suggest extending static_ptr to store the root. But static_ptr already has a place to store associated OCaml objects, namely the object passed as the optional managed argument to Ctypes_ptr.make. I think storing the root there could work well, and avoid the need to change the type.

I hope to merge this change when everything is addressed, so I'm going to mention a couple more things now. First, the change needs some tests --- ideally, tests that exercise the various failure cases (use-after-free, double-free, leak, etc.). Second, the documentation is currently not to my taste: it's too implementation-specific (e.g. talking about segmentation faults), too opinionated and sometimes too vague. I'm happy to make some more concrete suggestions for documentation once we have consensus on the interface.

tiash · 2019-09-10T21:33:50Z

Thanks for the feedback!

I originally attempted to do something similar to what you're suggesting but this had soundness issues that @fdopen pointed out.

We could get something similar to your proposal by changing the interface to use a phantom type (similar to [Core.Map]), but I think we still need to retain some type witness to ensure that the ctype of the funptr agrees with the argument its being used in.

Something like the following maybe?

type ('ocaml_type, 'witness) safe_funptr
val make_funptr : ('ocaml_type, 'witness) safe_funptr typ -> 'ocaml_type -> ('ocaml_type,'witness) safe_funptr
val release_funptr : ('ocaml_type,'witness) safe_funptr -> unit

module Progress_callback = Foreign.Make_funptr(val (
  Foreign.safe_funptr (int @-> int @-> ptr void @-> returning void))

let keygen =
  foreign "RSA_generate_key" (int @-> int @-> Progress_callback.t @-> ptr void @-> returning rsa_key)

let secret_key = 
  let progress = make_funptr progress_callback_t
                   (fun a b _ -> printf "progress: a:%d, b:%d\n" a b) in
  let key = keygen 2048 65537 progress null in
  let () = release_funptr progress in
  key

I don't think its possible to avoid the use of a functor to get a unique (witness) type. The nesting of first class module and functor while a bit obscure is a trick to get the compiler to infer all the necessary types from the ctype without having to spell out everything.

It is also possible to create something more like to old interface where the FFI closure is dynamically allocated, but with a more explicit free-step. This would allow an interface very similar to your proposal and also address the safety issues when dealing with closures from ocaml, however there remains some non-obvious issues when dealing with funptrs returned from C.

Please let me know which direction you think is a better fit for ctypes?

Re documentation & testing - point taken

yminsky · 2019-10-24T12:06:17Z

Anyone know what this is stuck on? This is current blocking our next stable release, all it would be lovely to get it unstuck...

yallop · 2019-10-25T16:48:16Z

Thanks for the prompt, and apologies (again!) for the slow response. I'd also like to have this resolved soon, and will follow up properly later today.

yallop · 2019-10-25T23:52:08Z

Okay, I've read through the commit history and I see @fdopen's point about type safety and your neat functor-based fix for the problem he points out.

I'm happy to have the extra type safety of the functor approach, but I'd like to keep the interface as simple as possible. Do you think the following interface (which removes Funptr_spec, Make_funptr and funptr_spec and adds a single function static_funptr) is sufficient?

module type Funptr = sig (* what you have already *) end

val static_funptr :  ?abi:Libffi_abi.abi -> ?runtime_lock:bool -> ?thread_registration:bool -> ('a -> 'b) Ctypes.fn ->
   (module Funptr with type fn = 'a -> 'b)

For example, with this interface your RSA_generate_key code might be written as follows:

module Progress_callback = (val static_funptr (int @-> int @-> ptr void @-> returning void))

let keygen =
  foreign "RSA_generate_key" (int @-> int @-> Progress_callback.t @-> ptr void @-> returning rsa_key)

let secret_key =
  Progress_callback.with_fun
    (fun a b _ -> Printf.printf "progress: a:%d, b:%d\n" a b) 
    (fun progress -> keygen 2048 65537 progress null)

tiash · 2019-10-28T14:35:56Z

Unfortunately Make_funpstr(val funptr_spec ...) is necessary due to type system restrictions when using the applicative functor the cstubs generator requires.
static_funptr introduces a fresh type (not allowed in applicative functors), while Make_funptr(val funptr_spec ...) exposes to the compiler that the type isn't fresh but depends on the ctype signature (and other parameters) so that its allowed by the compiler.

I have no objections to adding static_funptr, but I'm not sure if having two ways to create a funptr type would make this more confusing?

yallop · 2019-10-28T19:17:59Z

Unfortunately Make_funpstr(val funptr_spec ...) is necessary due to type system restrictions when using the applicative functor the cstubs generator requires.

Could you expand on this a bit? At the moment, I'm interested in exploring the space of solutions, rather than fixing on a particular approach. Perhaps a concrete example would help, if you have one in mind.

tiash · 2019-10-29T10:00:37Z

Here's the specific use case where static_funptrcauses compiler errors:

module Bindings (F : Cstubs.FOREIGN) = struct
  (* ... *)
  module Progress_callback = Foreign.Make_funptr(
    (val Foreign.funptr_spec Ctypes_static.(int @-> int @-> ptr void @-> returning void)))
  (* ... *)
  let generate_parameters = foreign "DH_generate_parameters"
      Ctypes.(int @-> int @-> Progress_callback.t_opt @-> ptr void @-> returning Dh.t)
  (* ... *)
end

(https://github.com/janestreet/async_ssl/blob/0e8c5561c1b82d0f5cd29092eb1e301a09416f24/bindings/ffi_bindings.ml#L313)

This Bindings functor is then used with the cstubs generator to generate supporting C + Ocaml (https://github.com/janestreet/async_ssl/blob/0e8c5561c1b82d0f5cd29092eb1e301a09416f24/stubgen/ffi_stubgen.ml#L24) and instantiated using the generated types for use in Async_ssl (https://github.com/janestreet/async_ssl/blob/0e8c5561c1b82d0f5cd29092eb1e301a09416f24/src/import.ml#L2)

The workarounds I can see are

Define module Progress_callback outside of the the Bindings functor (this splits up the definition and use unnecessarily)
Modify the code generator to work using generative functors (breaks existing uses of the cstubs generator)
Use the proposed Make_funptr((val funptr_spec ...)) boilerplate to help the compiler

yallop · 2019-10-29T11:28:13Z

Thanks for the example.

It feels that there's something not quite right here, somehow. I don't see how we can have something that simultaneously:

introduces new types (to avoid the problem @fdopen describes)
can be used in the body of an applicative functor

Doesn't your proposed interface still have @fdopen's type safety problem? Suppose we have two Funptr_spec modules returned from funptr_spec that share OCaml types but represent different C types:

module Spec1 = (val Foreign.funptr_spec Ctypes_static.(int @-> int @-> ptr void @-> returning void))
module Spec2 = (val Foreign.funptr_spec Ctypes_static.(int8_t @-> int8_t @-> ptr void @-> returning void))

Then we might define a functor which builds another Funptr_spec, selecting the fn member at functor application time according to the value of a mutable flag:

let flag = ref false
module F(S: sig end) = struct include Spec1 let fn = if !flag then Spec1.fn else Spec2.fn end

Now calling F twice builds modules with compatible OCaml types but (if the flag is changed between the calls) representing incompatible C types:

module U = struct end
let () = flag := false
module Progress_callback1 = Foreign.Make_funptr(F(U))
let () = flag := true
module Progress_callback2 = Foreign.Make_funptr(F(U))

and so we can build a function f that is specified to take a value of one of the types but that also accepts a value of the other type:

let f = Foreign.foreign "foo" Ctypes.(Progress_callback1.t @-> returning void)
f (Progress_callback2.of_fun (fun _ _ _ -> ()))

The workarounds I can see are

Define module Progress_callback outside of the the Bindings functor (this splits up the definition and use unnecessarily)

This seems like a fairly reasonable approach. Is moving the type definition away outside of the functor really such a problem? It doesn't seem to depend on the functor argument, so in some respects moving it outside the functor makes the code structure clearer.

Modify the code generator to work using generative functors (breaks existing uses of the cstubs generator)

Yes, I'd also like to avoid this. (Perhaps one day OCaml will have generativity polymorphism and we can update the cstubs generator to support generative functors in a backwards-compatible way.)

… `(val dynamic_funptr ..)`.

tiash · 2019-10-30T13:43:55Z

Thanks for the feedback, I agree that writing a custom Funptr_spec does allow subverting type safety.

Making the changes to split out the type definition has proven to be less messy then I expected so lets go with the simpler API, I have pushed the changes.

I think dynamic_funptr might be a better name for the function to avoid confusion with the existing static_funptr type in ctypes (I did this also in this patch).

yallop · 2019-10-31T09:40:50Z

It's good to see that we've reached consensus on the interface. I'll aim to provide a proper code review in the next few days.

tiash · 2019-11-14T10:17:53Z

@yallop ping

yallop

Please also add some tests, including

using Funptr.t values with the dynamic (libffi) interface
using Funptr.t values with the static (cstubs) interface
using Funptr.t values in a struct or array
lifetimes that extend beyond a single call, e.g.:
(a) allocate a function pointer that toggles a bool ref in OCaml
(b) pass the function pointer to a C function, which stores it and returns
(c) call Gc.compact (perhaps twice)
(d) check from OCaml that the OCaml closure is still alive (e.g. using another flag set in the finalizer)
(e) call a second C function which invokes the stored function pointer
(f) check that the call took place by examining the bool ref
auxiliary function such as with_fun, t_opt, etc.

src/ctypes-foreign-base/ctypes_ffi.ml

src/ctypes-foreign-threaded/foreign.mli

tiash

Thanks for all the feedback!
I've updated the API and documentation as suggested.
I'm still working on the tests

tiash · 2019-11-29T12:39:14Z

I've added unit tests that should give basic coverage of everything.

* use 'fun (type a) (type b) ...' instead of 'fun (type a b)' * use Gc.finalise instead of Gc.finalise_last

yallop · 2019-12-04T15:21:22Z

Thanks, @tiash. I had a few final tweaks related to style/layout and compatibility with OCaml 4.02. I've added them as a pull request against this PR: tiash#1.

Thank you very much!

yallop · 2019-12-06T22:44:07Z

I've pushed one more commit (760f638) to prevent flambda from turning the closure (+)1 into a top-level function.

The 32-bit Windows build is still failing, but that seems to be unrelated to this PR, so I've opened a separate issue in the mingw opam-repository.

yallop · 2019-12-07T00:57:10Z

Thanks again for all your work on this very useful addition, @tiash. (And thanks, too, to @fdopen, @andrewray and @avsm for comments and review.)

This is now merged in master (285f119, a41dd9d), and included in the 0.16.0 release, which should be available via OPAM shortly (ocaml/opam-repository#15473).

tiash · 2019-12-07T14:32:11Z

Many thanks @yallop for taking the time to review this and for all the feedback you provided to improve the pull request!

tiash added 4 commits May 8, 2019 12:18

minimize diff

27499c3

more diff minification

05564b1

more diff minification

b14ca31

Ensure type safety

d3d4524

tiash added 3 commits May 15, 2019 18:10

Some minor refactoring based on internal feedback.

310dc16

In particular this renames `module type Dynamic_funptr` to `module type Funptr` and drops a redundant debug arg.

correct/clarify a couple of comments

41b3525

Fix syntax to compile under older ocaml versions

6508a62

replace Make_funptr((val funptr_spec ...)) with the simpler & safer…

8ed53e0

… `(val dynamic_funptr ..)`.

yallop requested changes Nov 21, 2019

View reviewed changes

tiash added 3 commits November 26, 2019 17:43

address the documentation and api feedback

c6af9ac

fix typo

8969015

more fixes

318782c

One more fix

aa41913

tiash commented Nov 26, 2019

View reviewed changes

tiash and others added 3 commits November 26, 2019 18:13

Oops - changed in error.

ebe593a

Add unit tests!

88343ad

Merge minor fixes

23802e6

tiash requested a review from yallop November 29, 2019 12:39

yallop added 2 commits December 4, 2019 15:17

Documentation and layout tweaks.

7373ea1

Changes for OCaml 4.02 compatibility:

a928781

* use 'fun (type a) (type b) ...' instead of 'fun (type a b)' * use Gc.finalise instead of Gc.finalise_last

tiash and others added 2 commits December 6, 2019 19:12

Merge pull request #1 from yallop/add-dynamic-funptr-type-review

d0cd550

Thank you very much!

Tests: prevent flambda eliminating closures.

760f638

yallop force-pushed the add-dynamic-funptr-type branch from af2b1d1 to 760f638 Compare December 6, 2019 20:18

yallop approved these changes Dec 6, 2019

View reviewed changes

yallop closed this Dec 7, 2019

yallop mentioned this pull request Dec 7, 2019

Reflect "managed" status in the types of fat pointers #619

Merged

hanw mentioned this pull request Dec 26, 2019

v0.13.0 is not on opam janestreet/async_ssl#30

Closed

yallop mentioned this pull request Jul 11, 2020

Remove the threaded/unthreaded split in ctypes-foreign. #651

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add [Foreign.dynamic_funptr] #595

Add [Foreign.dynamic_funptr] #595

tiash commented May 8, 2019

yallop commented May 8, 2019

fdopen commented May 13, 2019

tiash commented May 13, 2019

tiash commented May 14, 2019

andrewray commented Jul 27, 2019

yallop commented Sep 6, 2019 •

edited

Loading

tiash commented Sep 10, 2019 •

edited

Loading

yminsky commented Oct 24, 2019

yallop commented Oct 25, 2019

yallop commented Oct 25, 2019

tiash commented Oct 28, 2019

yallop commented Oct 28, 2019

tiash commented Oct 29, 2019

yallop commented Oct 29, 2019

tiash commented Oct 30, 2019 •

edited

Loading

yallop commented Oct 31, 2019

tiash commented Nov 14, 2019

yallop left a comment

tiash left a comment

tiash commented Nov 29, 2019

yallop commented Dec 4, 2019

yallop commented Dec 6, 2019

yallop commented Dec 7, 2019 •

edited

Loading

tiash commented Dec 7, 2019

Add [Foreign.dynamic_funptr] #595

Add [Foreign.dynamic_funptr] #595

Conversation

tiash commented May 8, 2019

yallop commented May 8, 2019

fdopen commented May 13, 2019

tiash commented May 13, 2019

tiash commented May 14, 2019

andrewray commented Jul 27, 2019

yallop commented Sep 6, 2019 • edited Loading

tiash commented Sep 10, 2019 • edited Loading

yminsky commented Oct 24, 2019

yallop commented Oct 25, 2019

yallop commented Oct 25, 2019

tiash commented Oct 28, 2019

yallop commented Oct 28, 2019

tiash commented Oct 29, 2019

yallop commented Oct 29, 2019

tiash commented Oct 30, 2019 • edited Loading

yallop commented Oct 31, 2019

tiash commented Nov 14, 2019

yallop left a comment

Choose a reason for hiding this comment

tiash left a comment

Choose a reason for hiding this comment

tiash commented Nov 29, 2019

yallop commented Dec 4, 2019

yallop commented Dec 6, 2019

yallop commented Dec 7, 2019 • edited Loading

tiash commented Dec 7, 2019

yallop commented Sep 6, 2019 •

edited

Loading

tiash commented Sep 10, 2019 •

edited

Loading

tiash commented Oct 30, 2019 •

edited

Loading

yallop commented Dec 7, 2019 •

edited

Loading