You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Come up with a solution for ingesting CSVs into a backend that is more scalable than SQLite.
Context: The package cytominer-database is a key component of image-based profiling workflows: it is a small Python-based command line tool that ingests data generated from CellProfiler into a database. Currently, only SQLite is supported but it would be great if we could use something more scalable. Addressing this issue would equip researchers working with single cell imaging data to execute queries and perform analysis across tens of millions of cells. This would be particularly useful to analyze single cell data across all plates in an experiments, or across multiple experiments.
E.g. a recent 135 plate experiment had 100M single cells and there's no easy way to analyze this.
Come up with a solution for ingesting CSVs into a backend that is more scalable than SQLite.
Context: The package
cytominer-database
is a key component of image-based profiling workflows: it is a small Python-based command line tool that ingests data generated from CellProfiler into a database. Currently, only SQLite is supported but it would be great if we could use something more scalable. Addressing this issue would equip researchers working with single cell imaging data to execute queries and perform analysis across tens of millions of cells. This would be particularly useful to analyze single cell data across all plates in an experiments, or across multiple experiments.E.g. a recent 135 plate experiment had 100M single cells and there's no easy way to analyze this.
Here's how to started on this issue:
ingest
documentation herecytominer-database
by running the tests yourselfThe text was updated successfully, but these errors were encountered: