Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Configurable multiprocessing for large data import (Ecoinvent ecospold2) #258

Open
noahebrooks-cb opened this issue Apr 10, 2024 · 2 comments

Comments

@noahebrooks-cb
Copy link

Using the ecospold2 importer:

bw2io.bw2setup()
importer: SingleOutputEcospold2Importer = bw2io.SingleOutputEcospold2Importer(
    dirpath = path,
    db_name = str_ecoinvent_version
)
importer.apply_strategies()

There is a param use_mp which is used to determine whether to use multiprocessing or not, however it doesn't appear there is any way to configure the number of cores or workers to use.

I was curious if anyone had any solution to this or strategies for doing a large data import on limited hardware.

@noahebrooks-cb
Copy link
Author

Trying to extract full Ecoinvent dataset leads to Out of Memory Error.

RAM: 8GB
Chip: Apple M1

@cmutel
Copy link
Member

cmutel commented May 5, 2024

@noahebrooks-cb Can you give some more details on exactly what you are importing? This shouldn't raise an error with 8 GB, and multiprocessing won't have much of a memory impact. You can always turn multiprocessing off to test that assumption, of course.

Check to make sure you are importing the right file: https://docs.brightway.dev/en/latest/content/faq/ecoinvent.html#which-ecoinvent-file-should-i-download

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants