-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: lipid biomass composition #21
Comments
Possible solution:
Would be most versatile when implemented in either MATLAB or Excel. |
Pushed the wrong button..
where the coefficients represent measure composition of fatty acids chains, instead of having individual ISA reactions:
As is the case now. |
There are total about 176 ISA reactions in present yeast model. These reactions were from yeast 5. In general, it is difficult to understand these reactions as they lack evidences in the database (I am not searching all the reaction database). |
@edkerk as you point out, all these rxns convert chain-specific species to general species. Also, you are correct in the observation that the stoich. coeff. that these rxns get is based on chain length. The relationship is actually linear; as an example, I will focus on triglycerides (even though it should equally apply to all other species). Here are the 4 possible stoich. coeff. for all 32 triglyceride ISA rxns, based on total chain length from all three F.A. tails: So the more carbon the higher the assigned stoich. coeff. The idea then currently in Yeast7 is to allow the model to choose any triglyceride from the 32 options (through the ISA rxns), and to correct for the chain length, so it is "equally attractive" carbon-wise for the cell to produce any of them. However as you say, we should adjust these coefficients to account for saturations as well, so making them proportional to the molecular weight would solve the issue; in that sense it's good that @hongzhonglu is working on including chemical formulas for all these species. That being said, I think there are still mistakes in how the total abundance for each lipid is calculated: We know that we can get the abundance of each species in the model if we take the stoich. coeff. that is in the lipid pseudo-rxn, because the stoich. coeff. of the species
of that specific TAG (as
This of course should be done with molecular weights as stated before to improve precision, but the idea is the same. I hope this clears out some of the confusion. |
Answering to some other comments in the discussion:
Here I am not so sure. The TAG distribution can vary considerably between strains, so I think it might be safer to just leave it up to the modeler if he/she has specific data, but if not just allow the model to choose any TAG, making of course the corrections that I mentioned in my previous post. Or at the very least, let's first solve the mistakes in the composition, and maybe then we can try out to force the model to specific TAG distributions. How well are these distributions studied btw? For TAGs probably well enough, but for phospholipids?
Note that ISA rxns are actually pseudo-rxns, so they are not expected to appear in any database. Let me know what do you think about the solution that I presented in the previous post :) |
Finally, to add some more to the discussion, here is a breakdown for all lipids created through the 176 ISA rxns:
*partially: The only pseudo-metabolites that go into the lipid pseudo-rxn are the 3 in Golgi, as they get pooled in the And later transported to the cytoplasm (where they are used in the lipid pseudo-rxn): For the case of mitochondrion, even though equivalent isa rxns also exist, there is no transport to the cytoplasm, therefore those 3 pseudo-metabolites are dead-end metabolites For the case of ER, there're no isa rxns to begin with, therefore all 30 pseudo-metabolites are dead-end as well. |
@edkerk maybe this is something to fix? should complex sphingolipids be only produced in Golgi or can they also be produced in mitochondrion and ER? |
@BenjaSanchez I assume you're referring to the sphingolipids. There are two issues here:
One strength of these ISA reactions is that gene essentiality simulations will have less false positives, as the cell will have the choice to make different (complex sphingo)lipids, which apparently is the case in reality. So, instead of deleting ISA reactions, perhaps we should leave them in but set boundaries to 0. If one wants to do gene essentiality simulations, one has to switch those reactions on. |
@edkerk thanks for your feedback :)
However, in literature data (PJ Lahtvee et al. 2016) we don't see sphingolipids at all, so I'm not sure what to do with them for now. Keep the original abundance values? Or remove them entirely? |
update: PR #112 fixes this issue by using the newly defined SLIME reactions: lipids are now split into their 2 basic components, backbone and acyl-chains:
With this, separate lipid pseudoreactions are defined later for backbones and for acyl-chains. The stoichiometric coefficients are representing molecular weights, as the data used comes in g/gDW. More info is found on SysBioChalmers/SLIMEr. This issue will be closed when the changes are merged to master. |
The expansive description of lipid metabolism is confusing, so I might be wrong in the following. To my understanding, the model doesn't specify a distribution of different FA chain lengths, but demands all FA chains in equal amounts (for majority of lipid metabolism, sterol esters seem specific):
There are all these individual reactions that build up for instance TAG(16:0, 18:1, 18:1).
oleoyl-CoA[erm] + diglyceride (1-16:0, 2-18:1)[erm] => coenzyme A[erm] + triglyceride (1-16:0, 2-18:1, 3-18:1)[erm]
They use specific acyl-CoAs (so no pooled pseudometabolite), and nowhere in those reactions is there any specification on abundant each fatty acid is. With that in mind, it would be cheapest to make TAG(16:0, 16:1, 16:0), and this is what I actually see when I run FBA and minimize the number of fluxes.
The model has so-called 'ISA' reactions that 'converts' FA-chain specific TAG species into a generic TAG species:
triglyceride (1-16:0, 2-18:1, 3-18:1)[erm] => 0.67901 triglyceride[erm]
I don't understand what these coefficients mean, but they seem to connect to the chain length (e.g. triglyceride (1-18:0, 2-18:1, 3-16:0) gets the same coefficient, even though it has different number of saturations).
These generic lipid species are then used in the lipid pseudoreaction:
[...] + 0.000206 fatty acid[c] + [...] + 0.000781 triglyceride[c] + 1.5e-05 zymosterol[c] => lipid[c]
So nowhere along that path is there any specification of distribution of FA chain lengths & saturation, all TAG species are as likely to be made, with some correction for the amount of carbons (but not hydrogens, as the two species mentioned above with similar coefficients do have different molecular weights).
The ISA reactions for fatty acids have no influence in this for two reasons:
palmitate[c] => 0.61538 fatty acid[c]
1: the coefficients are again just representing the number of carbons, not any measured abundance
2: fatty acid[c] is only used in the lipid pseudoreaction to represent free fatty acids, it is not used to be incorporated in any other lipid species.
The text was updated successfully, but these errors were encountered: