Make packages/io async #7

jclem · 2019-05-21T22:49:22Z

This pull request rewrites a bunch of logic in packages/io so that it's fully async for IO operations.

In the current iteration of packages/io, functions are returning promises, but are still internally calling the sync versions of fs modules.

@damccorm Let me know what you think. Another thing I wanted to point out was something w/async functions—when a function is declared to be async, it implicitly returns a promise—there's no need to create and then manually return one, rejecting and resolving, etc. Throwing inside of an async function will cause it to return a rejected promise (which will throw if that call is awaited).

- `CopyOptions` on `cp` now defaults to empty - Setting option values is easier now

damccorm · 2019-05-22T02:37:54Z

In the current iteration of packages/io, functions are returning promises, but are still internally calling the sync versions of fs modules

Its unclear to me why this is a bad thing. It looks to me like you're basically replacing each fs.fooSync with await fs.promises.foo - doesn't this functionally do the same thing (aka block until the operation is complete)? When we await async functions don't they basically just become synchronous? Maybe I'm missing something here but I don't see the advantage

Everything else here makes sense to me

when a function is declared to be async, it implicitly returns a promise

Cool, thanks for the pointer

jclem · 2019-05-22T03:08:29Z

When we await async functions don't they basically just become synchronous?

The vertical flow of the code may make it look like that way, but they don't actually become synchronous. fs.statSync is a blocking IO operation, meaning that everything in the entire Node.js VM blocks while that operation is happening (since just wrapping a function in async doesn't mean it runs in a separate thread).

For example, in the following two snippets, the one using Sync calls would potentially run much slower than the one using async because each Sync call fully blocks the rest of the VM:

Generally slower (IO is not concurrent, one file read at a time):

aBunchOfFilePaths.map(fs.readFileSync)

Generally faster (IO is concurrent, many files read at a time):

await Promise.all(aBunchOfFilePaths.map(async path => {
  return await fs.promises.readFile(path)
}))

// Note that because async promises are implicit, that's the same as:
await Promise.all(aBunchOfFilePaths.map(fs.promises.readFile))

In our CI context, it may be, however, that there isn't going to be too much concurrent IO happening—this isn't a web server. That said, generally I think that Node.js users are used to asynchronous APIs for file system interaction. Further, providing truly async APIs reduces the cost of doing lots of file IO. I would also be concerned that putting an async API in front of fully synchronous code would have some adverse side effects—users may expect to be able to map quickly over our IO API and get concurrent IO benefits, but behind the scenes we'd actually just be doing 1 thing at a time.

jclem · 2019-05-22T03:10:05Z

Whoops! Didn't mean to close.

bryanmacfarlane · 2019-05-22T04:04:07Z

While it's true that it's blocking for that whole node VM, in practice it's not critical since folks are writing sequential script CI steps. Each step / action is run on the agent and each action step is run out of proc with a new instance of the node process.

However, technically, someone could invoke these methods concurrently and it would matter.

The use cases though was inspired by shelljs which aimed to write bash style scripts in javascript so the goal was synchronous and sequential.

As jclem said, it's not a web server here so it's not as critical. In a web server any sync code would be a huge non starter.

My 2 cents is if we're going after async/await style actions then it's consistent to have them all be awaitable and if they're awaitable, let's just make them async.

Make sense? Thoughts?

bryanmacfarlane · 2019-05-22T12:07:53Z

I'm good to take these changes.

damccorm · 2019-05-22T12:39:37Z

Ah cool, thanks for the explanation. I'm good with these changes then.

jclem added 2 commits May 21, 2019 18:03

Use Boolean() to convert options

122e1de

- `CopyOptions` on `cp` now defaults to empty - Setting option values is easier now

Rewrite packages/io to be fully async

04cfb1e

jclem requested a review from damccorm May 21, 2019 22:49

jclem added 2 commits May 21, 2019 18:49

Move IS_WINDOWS into ioUtil

e566c89

DRY up cp/mv by moving shared code into move function

04be602

jclem closed this May 22, 2019

jclem reopened this May 22, 2019

damccorm merged commit 86228a8 into features/io May 22, 2019

damccorm deleted the features/io-patch-jclem branch May 22, 2019 12:39

damccorm mentioned this pull request May 22, 2019

Add io #5

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make packages/io async #7

Make packages/io async #7

jclem commented May 21, 2019

damccorm commented May 22, 2019

jclem commented May 22, 2019 •

edited

Loading

jclem commented May 22, 2019

bryanmacfarlane commented May 22, 2019

bryanmacfarlane commented May 22, 2019

damccorm commented May 22, 2019

Make packages/io async #7

Make packages/io async #7

Conversation

jclem commented May 21, 2019

damccorm commented May 22, 2019

jclem commented May 22, 2019 • edited Loading

jclem commented May 22, 2019

bryanmacfarlane commented May 22, 2019

bryanmacfarlane commented May 22, 2019

damccorm commented May 22, 2019

jclem commented May 22, 2019 •

edited

Loading