-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support multiple resources #12
Comments
https://developers.google.com/sheets/api/guides/metadata this could be used to keep track where each piece of information comes from. |
@bjdmeest For this, do we want to try/do something specific with Targets? |
Currently, the single source also isn't done via Targets, so I'd first try to figure how to do it (without thinking too much about RML), and then have a look to see whether RML targets makes sense for this or not (I think we'll first need to add a Solid-target or equivalent to RMLMapper, Els is working on that) |
So what do you suggest as next step? Can you maybe put that in a separate issue? |
I think just doing this issue will be tricky enough on itself. For next steps I'd first need to see the outcome of this one |
But I don't understand from your initial comment what we should use? Something with Targets? Or something else in the RML space? Or just something engineered that works? |
I suggest that @sevrijss analyses the problem and suggests a course of action. We can then review and decide whether that course of action makes sense as is or should be enhanced (eg by using RML target to generate multiple local files), I don't know what the potential pitfalls may be so I can't give a more clear direction yet |
Possible courses of action:
Both approaches have advantages and disadvantages. The first is easier to setup but will require a lot of files. The second way is more RML based, but will be more complex to setup and maintain. |
Interesting! What would happen if you do a query over multiple resources, but you don't know which data is coming from which data source? How do you know which resources to update for which specific data? |
In a specific case, that might pose a problem. |
I got imagine that you would want to write the changes to the resource where originally the data came from. But indeed I don't think you can specify that with RML. Comunica might be able to tell us where every triple came from. But then we still need to see how that information can be used by RML. @bjdmeest What are your thoughts? |
for me, the most relevant case is when a single query result/row contains data from multiple sources. So I'd start with specifying which column updates should be written to which source. e.g a sheet mentioning my favorite tv shows and my personal ratings: the tv show metadata comes from dbpedia, the ratings come from my pod. my rating updates should be persisted in my pod, tv show metadata updates should be (ideally) feed back to dbpedia, but currently practicaly will just fail, and that's OK :) You can probably figure that out by combining the RML and SPARQL query ( |
@sevrijss Based on Ben's answer do you think do that the RML Target-based solution will work? Ignoring the potential complexity of the files. Also about this from your earlier comment
This also an issue when querying so I would not worry about that now. |
I think such a solution will definitely work. A lot of heavy lifting will be done by the RML api endpoint and not by the code. |
It's possible to read and write to a single resource, but certain use cases need multiple resources.
Reading from multiple resources is supported by Comunica, but for the writing we have to have a closer look at how we should do this in YARRRML/RML.
The text was updated successfully, but these errors were encountered: