Simplify build log and let store set its location #90

MisterDA · 2021-11-24T09:34:56Z

The first step is to remove the dup file in the log tailing to be sure that when/if we want to move the log file, there are no open file descriptors to it. This can be done by switching to Readonly when the log is finished.
The second step is to let the store decide the location of the log file. There's a bit of a chicken-and-egg problem here: the store was responsible for moving the log file, but the Db_store module creates it a sets its location, and the store doesn't have access to the location. By letting the store set the log file location, it becomes able to set it to a location independent of the results directories, or to the original location and move it if there's no problem with it.

cc @patricoferris

- don't duplicate the log file file descriptor but switch to reading the file when the log finishes; - replace the `Finished state with the `Readonly state.

Checking the result for the Docker backend requires asynchronous calls to Docker.

For the currently available stores, the original location is retained.

talex5

I'm not quite sure how this works. I would expect that simply reopening the same log path (but read-only) would have the same problem.

I would expect get_build to:

Create a new Open log file.
Perform the build.
Close the file and change the log state to Finalising.
Snapshot the result.
Set the log state to Readonly with the final location, and wake any readers waiting for the end of the Finalising state.

Or possibly the Finalising state could simply be Busy. That would also work for creating the log, where we have a similar problem (currently solved messily by using set_log to resolve an Lwt promise).

talex5 · 2021-11-24T10:07:38Z

lib/build_log.ml

+            dst (Bytes.sub_string buf 0 n);
+            aux (i + avail)
+          | `Readonly path -> readonly_tail path buf i
+          | _ -> Lwt_result.return ()


Suggested change

| _ -> Lwt_result.return ()

| `Empty -> Lwt_result.return ()

talex5 · 2021-11-24T10:12:45Z

lib/build_log.ml

-  | `Open (fd, cond) ->
-    t.state <- `Finished;
+  | `Open (path, fd, cond) ->
+    t.state <- `Readonly path;


I think this has the same problem: a reader will open the path in the temporary clone, preventing it from being removed (unless a read-only FD is OK somehow?).

Instead, I think this needs to transition to a new Finalising of unit Lwt.t state or similar, where the promise resolves once the log is ready for reading in its new location.

talex5 · 2021-11-24T10:16:36Z

lib/s.ml

  (** [delete t id] removes [id] from the store, if present. *)

-  val result : t -> id -> string option
+  val result : t -> id -> string option Lwt.t


Why does this need to be async? It seems useful to have an atomic way of finding out whether something exists in the store. Otherwise, how do we know the result is still valid by the time it has returned?

With the Docker backend I'm making a call to Docker to check whether the result as a Docker image exists. I use functions from Os which calls Lwt_process.exec and they're asynchronous. I could call Unix.create_process instead and wait for the termination to fix the TOCTOU.

Let's move that to a separate PR then. It doesn't have anything to do with the build log problem.

talex5 · 2021-11-24T10:17:09Z

lib/s.ml

+  val result : t -> id -> string option Lwt.t
  (** [result t id] is the path of the build result for [id], if present. *)

+  val log_file : t -> id -> string Lwt.t


Why do we want a different log location for each store?

patricoferris · 2021-11-24T10:48:30Z

Thanks @MisterDA :))

I'm not sure if this fixes the log problem I was describing before (the moving log files bit is useful for patricoferris#5 however). I mean having log files outside of the build directory should in theory solve the problem, I'm just not sure if there is a reason the logs currently get saved in the build directory.

The problem I was seeing is Db_store.get_build calls Raw.build which in this case was Zfs. If the build fails then it calls

obuilder/lib/zfs_store.ml

Line 193 in c89b876

Zfs.destroy t ds `Only >>= fun () ->

but as far as I can tell at that point the log file is still open so trying to destroy the dataset gets a resource busy error (only on macOS...)

MisterDA added 3 commits November 24, 2021 10:22

Build_log simplifications

8b4828f

- don't duplicate the log file file descriptor but switch to reading the file when the log finishes; - replace the `Finished state with the `Readonly state.

Make STORE.result return a Lwt promise

dde45db

Checking the result for the Docker backend requires asynchronous calls to Docker.

Let the store set the location of the log file

39f457d

For the currently available stores, the original location is retained.

talex5 reviewed Nov 24, 2021

View reviewed changes

MisterDA marked this pull request as draft November 30, 2021 14:27

MisterDA closed this Jan 5, 2022

MisterDA deleted the build-log branch January 5, 2022 14:46

MisterDA mentioned this pull request Jan 5, 2022

Refactor Build_log module #98

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify build log and let store set its location #90

Simplify build log and let store set its location #90

Uh oh!

MisterDA commented Nov 24, 2021

Uh oh!

talex5 left a comment

Uh oh!

talex5 Nov 24, 2021

Uh oh!

talex5 Nov 24, 2021

Uh oh!

talex5 Nov 24, 2021

Uh oh!

MisterDA Nov 24, 2021

Uh oh!

talex5 Nov 29, 2021

Uh oh!

talex5 Nov 24, 2021

Uh oh!

patricoferris commented Nov 24, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Simplify build log and let store set its location #90

Simplify build log and let store set its location #90

Uh oh!

Conversation

MisterDA commented Nov 24, 2021

Uh oh!

talex5 left a comment

Choose a reason for hiding this comment

Uh oh!

talex5 Nov 24, 2021

Choose a reason for hiding this comment

Uh oh!

talex5 Nov 24, 2021

Choose a reason for hiding this comment

Uh oh!

talex5 Nov 24, 2021

Choose a reason for hiding this comment

Uh oh!

MisterDA Nov 24, 2021

Choose a reason for hiding this comment

Uh oh!

talex5 Nov 29, 2021

Choose a reason for hiding this comment

Uh oh!

talex5 Nov 24, 2021

Choose a reason for hiding this comment

Uh oh!

patricoferris commented Nov 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

patricoferris commented Nov 24, 2021 •

edited

Loading