Input files required:
- The createtable.txt file with DDL statements.
- The queries.txt file with DML statements, more can be added if needed.
- 3 abundance files for metabolomics, proteomics and transcriptomics each.
- Annotations file for the genes that are present in the abundance files.
How to run the program: python .py [–-createdb] [–-loaddb] [–-querydb=n]
- The –-createdb option creates the database structure.
- The –-loaddb option parses the data files and inserts the relevant data into the database.
- The –-querydb option runs one of the queries specified in the queries file on the created and loaded database. The argument n is a number from 1 to 9.
How the code runs:
- Uses SQLite3 to interface with a SQL database.
- Based on user input at the command line, it can create, load or query the database. (or all three at the same time.)
- For creating, it uses the DDL statements provided to make the database file.
- For loading, for this design, it extracts the data from the input files and stores it in the tables in the database.
- Querying the database, based on user input of 1-9, it pulls that query from the query file and outputs the data to the console.
Notes: This program is very specific to this database design. Please don't load or query before creating this will throw an error, and don't query before loading, the output will be empty.