Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

load in parallel #81

Closed
ViralBShah opened this issue Jun 23, 2011 · 10 comments
Closed

load in parallel #81

ViralBShah opened this issue Jun 23, 2011 · 10 comments

Comments

@ViralBShah
Copy link
Member

What should load do in parallel? Do we assume an NFS type filesystem? It seems to me that the right thing to do would be for the client to preprocess/compile the code and send it to the rest of the Workers.

@JeffBezanson
Copy link
Member

It would be much easier to implement pload that sends the file everywhere so you don't need a shared FS.

@ViralBShah
Copy link
Member Author

Do we really need a pload? Can't load just do that automatically, if it detects parallel mode?

@StefanKarpinski
Copy link
Member

In general, I think that we need a simpler eval_file() primitive that just evals the contents of a file like load() originally did. Then load() needs to be a much more sophisticated mechanism on top of that which

  1. Searches a system load_path variable for where the file to load actually resides.
  2. Sends the file across the network to other nodes so that they can also load the file.

Is there a single node which "owns" all file system access? Or is this where we need a distributed file system?

@JeffBezanson
Copy link
Member

Agree.

@ViralBShah
Copy link
Member Author

I prefer that there is a single node that owns all the filesystem access. Until then, we assume a distributed FS available, so that we can just do

@bcast load(file)

@StefanKarpinski
Copy link
Member

I feel like this is backwards — it makes sense to have a single node own accesses until there's a distributed filesystem to rely on. No?

@ViralBShah
Copy link
Member Author

Well, most clusters we are likely to run on in early stages will probably have NFS. Also, the multi-core mode where you run multiple julia processes have the same fs (although not distributed).

@StefanKarpinski
Copy link
Member

Maybe we should just assume that some fs will take care of this then and
just do the load on each machine without worrying about it.

On Fri, Jun 24, 2011 at 12:10 PM, ViralBShah <
[email protected]>wrote:

Well, most clusters we are likely to run on in early stages will probably
have NFS. Also, the multi-core mode where you run multiple julia processes
have the same fs (although not distributed).

Reply to this email directly or view it on GitHub:
#81 (comment)

@ViralBShah
Copy link
Member Author

Except that it breaks down, when your client is your laptop, and the computation is in the cloud. But for now, we can assume that some fs will figure it out.

-viral

On Jun 24, 2011, at 9:59 PM, StefanKarpinski wrote:

Maybe we should just assume that some fs will take care of this then and
just do the load on each machine without worrying about it.

On Fri, Jun 24, 2011 at 12:10 PM, ViralBShah <
[email protected]>wrote:

Well, most clusters we are likely to run on in early stages will probably
have NFS. Also, the multi-core mode where you run multiple julia processes
have the same fs (although not distributed).

Reply to this email directly or view it on GitHub:
#81 (comment)

Reply to this email directly or view it on GitHub:
#81 (comment)

@StefanKarpinski
Copy link
Member

Right. That's an excellent case for using the local copy and shipping it
elsewhere for evaluation. In fact, I find that completely convincing for
development work.

On Fri, Jun 24, 2011 at 12:38 PM, ViralBShah <
[email protected]>wrote:

Except that it breaks down, when your client is your laptop, and the
computation is in the cloud. But for now, we can assume that some fs will
figure it out.

-viral

On Jun 24, 2011, at 9:59 PM, StefanKarpinski wrote:

Maybe we should just assume that some fs will take care of this then and
just do the load on each machine without worrying about it.

On Fri, Jun 24, 2011 at 12:10 PM, ViralBShah <
[email protected]>wrote:

Well, most clusters we are likely to run on in early stages will
probably
have NFS. Also, the multi-core mode where you run multiple julia
processes
have the same fs (although not distributed).

Reply to this email directly or view it on GitHub:
#81 (comment)

Reply to this email directly or view it on GitHub:
#81 (comment)

Reply to this email directly or view it on GitHub:
#81 (comment)

StefanKarpinski pushed a commit that referenced this issue Feb 8, 2018
Handle ellipsis in Tuple{} (Fix #81)
StefanKarpinski pushed a commit that referenced this issue Feb 8, 2018
Complete refactor of the dependency graph and related functions
cmcaine pushed a commit to cmcaine/julia that referenced this issue Sep 24, 2020
LilithHafner pushed a commit to LilithHafner/julia that referenced this issue Oct 11, 2021
Keno pushed a commit that referenced this issue Oct 9, 2023
Add support for breakpoints (!)
udesou pushed a commit to udesou/julia that referenced this issue Oct 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants