Fixed multiprocessing in generators. #7118

joeyearsley · 2017-06-24T17:51:45Z

Previously, applying the multiprocessing.Lock did not synchronize variables across processes, to ensure files are not duplicated in an epoch we need these variables to be synchronized.

To ensure they are synchronized the code now creates shared variables which are updated when the lock is applied. Therefore ensuring no duplicates in an epoch and batch.

I've witnessed no overhead of this lock (due to the processing on the batch and minimal time the lock is actually held by a process.

Sequences are recommended to keep order when using the prediction function since multiprocessing/threading can not guarantee threads will return in order. However, you can also use generators in your own multiprocessing pool and then use predict.

ahundt · 2017-06-24T18:49:59Z

General thought: could all processes be given a seed and an integer ID, which they all use to determine generate their batches modulo the total number of processes?

Specific comment: make sure it is possible for the user to specify the exact seed which is used, so the randomness source used does not change if they change the data size or # of workers

joeyearsley · 2017-06-24T19:36:28Z

@ahundt I looked into it. PID's aren't assigned from 0 onwards, they can be 6001-6009 (i.e. 8 workers), how would you then determine the correct modulo?
You could pass it in as an argument through the queue, however, you then are only getting pseudo random files, as each part of the list is split into N workers.
Finally, the shared variables add no visible overhead and I believe it is the correct way to ensure a random shuffle and no duplicates.

Exact seed? I've not altered the seed part from the original code, is it wrong in the original?

joeyearsley · 2017-06-26T20:06:24Z

Closed for now as there is a heisenbug, sometimes the processes all go to sleep with no hint at an error and other times it succeeds. Only happens in the validation stage.

joeyearsley · 2017-06-28T11:02:48Z

Found the issue, will be re-opening once I've cleaned the code up.

Dref360 · 2017-06-28T17:06:05Z

First of all great work. I just need some clarification :P

Maybe I do not understand this PR, but how does it fix multiprocessing in generators?
From what I understand, your PR is for the DirectoryIterator and not every generators right? Would there be an easy interface that we could extract from this to handle "most" generators?

ahundt · 2017-06-29T02:20:34Z

assuming all processes take the same amount of computation time, then all the files should be in order.

Could you elaborate? In general this is never true, even when executing the exact same static program twice, except on very carefully set up real time systems.

ahundt · 2017-06-29T19:33:07Z

My comment was to point out one of the assumptions since I happened to come across it. I see performance vs repeatability reasons for each. I think users will need different behavior depending on application, networks for research and publication vs running quick experiments on datasets for business. I don't want to prognosticate too much because I'm mostly using tfrecords right now so I don't have a specific need for this feature and would rather not ask you to write code I don't need yet, so I leave the decision up to you. :-)

joeyearsley · 2017-07-02T16:01:22Z

This is now resolved.

I still prefer @Dref360 Sequences, however for the sake of fixing generators now work properly with multiprocessing and threading. To achieve this it uses far too many locks, whiles and sleeps, yet it is now fully synchronized.

It achieves the same as the sequences, but uses global variables to keep count of who is allowed into what processes (either next or placing on the queue) at what time.

joeyearsley · 2017-07-03T16:24:14Z

Closing this issue, even this fix doesn't guarantee ordering, just reduces the likelihood. The proper way is to use sequences.

Updated Code to allow ordered multiprocessing

9a98204

joeyearsley changed the title ~~Fixed Preprocessing to allow generators to be ordered~~ Fixed preprocessing to ensure no duplicates in a batch when preprocessing is used. Jun 24, 2017

joeyearsley changed the title ~~Fixed preprocessing to ensure no duplicates in a batch when preprocessing is used.~~ Fixed preprocessing to ensure no duplicates in a batch when multiproc is used. Jun 24, 2017

This was referenced Jun 24, 2017

Fix the ordering bugs when using pickle_safe=True #6891

Merged

How to break symmetry in fit_generator() sample generator? #6586

Closed

Out of order issue of predict_generator #6512

Closed

ahundt mentioned this pull request Jun 26, 2017

Guide to handle large datasets #7140

Closed

Fix in Progress

f1f9c13

joeyearsley closed this Jun 26, 2017

Push back to server

f0b6fee

Joe Yearsley and others added 3 commits June 28, 2017 11:14

Fixed the Bug

5d1702c

Fixed Bugs

6b0f75a

Fixed bugs related to multi-processing

8c0009c

joeyearsley reopened this Jun 28, 2017

joeyearsley added 2 commits June 28, 2017 14:47

Fixed PEP8 and reset function

db23f9f

Added warning

566dc76

joeyearsley closed this Jun 28, 2017

Missed Kwarg

7e2c67a

joeyearsley reopened this Jun 28, 2017

joeyearsley added 2 commits June 28, 2017 15:18

Tests now pass

48e3f81

Forgot to place User Warning in

0d97951

joeyearsley changed the title ~~Fixed preprocessing to ensure no duplicates in a batch when multiproc is used.~~ Fixed multiprocessing in generators. Jun 28, 2017

joeyearsley closed this Jul 2, 2017

joeyearsley reopened this Jul 2, 2017

joeyearsley and others added 6 commits July 2, 2017 18:17

Fixed Permissions

cafbabd

Merge branch 'master' into master

64e0980

Fixed ordering such that it doesn't block

66994c1

Merge branch 'master' of github.com:joeyearsley/keras

e4c98d8

PEP8 fixed

76e95bb

Merge branch 'master' into master

ce4e2d9

joeyearsley closed this Jul 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed multiprocessing in generators. #7118

Fixed multiprocessing in generators. #7118

joeyearsley commented Jun 24, 2017 •

edited

Loading

ahundt commented Jun 24, 2017

joeyearsley commented Jun 24, 2017 •

edited

Loading

joeyearsley commented Jun 26, 2017

joeyearsley commented Jun 28, 2017

Dref360 commented Jun 28, 2017 •

edited

Loading

ahundt commented Jun 29, 2017

ahundt commented Jun 29, 2017

joeyearsley commented Jul 2, 2017 •

edited

Loading

joeyearsley commented Jul 3, 2017

Fixed multiprocessing in generators. #7118

Fixed multiprocessing in generators. #7118

Conversation

joeyearsley commented Jun 24, 2017 • edited Loading

ahundt commented Jun 24, 2017

joeyearsley commented Jun 24, 2017 • edited Loading

joeyearsley commented Jun 26, 2017

joeyearsley commented Jun 28, 2017

Dref360 commented Jun 28, 2017 • edited Loading

ahundt commented Jun 29, 2017

ahundt commented Jun 29, 2017

joeyearsley commented Jul 2, 2017 • edited Loading

joeyearsley commented Jul 3, 2017

joeyearsley commented Jun 24, 2017 •

edited

Loading

joeyearsley commented Jun 24, 2017 •

edited

Loading

Dref360 commented Jun 28, 2017 •

edited

Loading

joeyearsley commented Jul 2, 2017 •

edited

Loading