UUID to Fedora Path does not work inside transactions #185

whikloj · 2016-04-12T15:57:18Z

The idea of using a UUID matched via the triplestore to a fedora path assumes that it is indexed. In the case of a transaction, nothing is done until the transaction is committed.

So...

> curl -i -XPOST http://localhost:8282/islandora/transaction
HTTP/1.1 201 Created
Date: Tue, 12 Apr 2016 15:48:44 GMT
Server: Apache/2.4.18 (Ubuntu)
Location: http://localhost:8080/fcrepo/rest/tx:9065bb33-803a-4b1c-8bb8-8238f83560c5
Expires: Tue, 12 Apr 2016 15:51:44 GMT
Cache-Control: private, must-revalidate
Content-Length: 0
Connection: close
Content-Type: text/html; charset=UTF-8

> curl -i -XPOST "http://localhost:8282/islandora/collection?tx=tx:9065bb33-803a-4b1c-8bb8-8238f83560c5"
HTTP/1.1 201 Created
Date: Tue, 12 Apr 2016 15:49:48 GMT
Server: Apache/2.4.18 (Ubuntu)
Cache-Control: must-revalidate, private
Location: http://localhost:8282/islandora/resource/05c0224b-4ace-4092-a8f6-603c94260d08
Content-Length: 77
Connection: close
Link: <http://localhost:8282/islandora/resource/05c0224b-4ace-4092-a8f6-603c94260d08/members>; rel="hub"
Content-Type: text/plain; charset=UTF-8

http://localhost:8282/islandora/resource/05c0224b-4ace-4092-a8f6-603c94260d08

> curl -i -XPOST "http://localhost:8282/islandora/collection/05c0224b-4ace-4092-a8f6-603c94260d08/member/5fa71ed6-f5f3-4831-8662-b1b132815a52?tx=tx:9065bb33-803a-4b1c-8bb8-8238f83560c5"
HTTP/1.1 404 Not Found
Date: Tue, 12 Apr 2016 15:51:27 GMT
Server: Apache/2.4.18 (Ubuntu)
Cache-Control: no-cache
Content-Length: 89
Content-Type: text/html; charset=UTF-8

Failed getting resource Path for "05c0224b-4ace-4092-a8f6-603c94260d08" from triple store

@Islandora-CLAW/7-x-2-x-committers : Ideas?

The text was updated successfully, but these errors were encountered:

whikloj · 2016-04-12T18:44:26Z

I am trying to think how you can resolve this in any fashion? Do we use a SQLite db?

We would need to handle transactions as well as regular resources.

So
curl -XPOST http://localhost:8282/islandora/collection generates a UUID and POSTs to Fedora, then links the UUID <-> fedora path in SQLite. No problem.

In a transaction, the above returns a transaction prefixed path, so we use that until the transaction is committed (or rolled back) then we need to update with the un-prefixed path.

How do we deal with abandoned transactions? Add a timestamp and update it for each action on a transaction, wipe them after N minutes of inactivity?

whikloj · 2016-04-12T18:52:31Z

Or we could submit our own quad to the triplestore from the microservices.
<http://localhost:8080/fcrepo/rest/tx:1234abcd-1234-abcd-78gh-56op78lm/path/to/resource> nfo:uuid "05c0224b-4ace-4092-a8f6-603c94260d08"^^xsd:string

Once it is committed or rolledback we can delete it as either:

The transaction is committed and the normal triple is added to the triplestore
The transaction is rolled back and the resource was not created.

Also we don't have to change the behaviour for normal actions, only items created in transactions.

acoburn · 2016-04-12T18:56:29Z

The advantage of using the triplestore for this seems mostly to be the fact that the triplestore already exists in the infrastructure. For this particular session-based interaction, I'd highly recommend using something like Redis that already supports key expiry. Also, all operations are entirely atomic, which means one less thing to worry about in a distributed context. The downside is that it's one more thing to install and keep running.

whikloj · 2016-04-12T19:32:13Z

Thanks @acoburn, I figured there would be something out there. Redis looks very simple and easy. I'll see about implementing it as simple key (UUID) value (fedora path) pair for now.

whikloj · 2016-04-15T20:34:32Z

Okay, so I played around with this for a bit and I have two problems. I have solutions but I'm sure there are better ways. So let me know if I am crazy or if you have a suggestion.

I need to link all the UUID -> Fedora URI pairs together with the transaction ID. Because any action on a transaction ID should update the expiry time on ALL the UUID -> F4 URI pairs acted on within that transaction.

So I think it is probably easiest to generate a JSON list of values ala:

{
   [
      { "UUID-1234" : "http://localhost:8080/fcrepo/rest/obj1" },
      {"UUID-5678" : "http://localhost:8080/fcrepo/rest/obj2" }
   ]
}

Then store that in Redis with the transaction ID as the key

> SET "tx:86dd0891-d975-42d8-8837-a24ad6041b59" "$json-array"

This would mean we would pull the entire object for each action on a transaction, but I think it is the easiest as we set a single expiry in Redis and update it if the transaction is acted upon.

When you PUT/POST to Fedora, we don't have the Fedora URI and UUID at the same time. After the PUT/POST action we need to use the Location: header and get the UUID from the object.

So I am thinking about a super simple transform like:

@prefix nfo : <http://www.semanticdesktop.org/ontologies/2007/03/22/nfo/v1.2/>
id      = . :: xsd:string ;
uuid  = nfo:uuid :: xsd:string ;

So after an object is created in Fedora we can get the transform

curl -i -H"Accept: application/ld+json" "http://localhost:8080/fcrepo/path/from/location/fcr:transform/uuidTransform"

This gives us the path and UUID in a simple JSON-LD object.

Which we add using the data structure in 1.

Thoughts?

ruebot · 2016-04-16T08:08:34Z

That makes sense to me.

DiegoPino · 2016-04-17T01:02:30Z

Hi sadly stil on the train and i have a lot of thoughts on this. How will you manage a tx session that expired or was rolled back, will to clean that how from reddis? If we are keeping resources involved in a transaction checked (means we keep track on them) there are a lot of better/faster ways than reddis right now (on year 2016) so we could maybe explore the options/discuss them before going further with this? Also, since we are passing TX around, assuming all resources belonging in a common tx will be done using the same client/server pair(means coming from the same source and using the same microservice..i hope we are not try resolve right now multi service same tx) we can simply make use of cookies and headers right? We can even use silex/symphony caching options to avoid putting yet.another.service right now to maintain. Train moves a lot!.

whikloj · 2016-04-18T16:10:37Z

So the idea is that Redis allows you to add an expiry to your entries.

So if nothing happens on them then they are automatically removed. I would also only use this for actions in a transaction and as a fail-over from the triplestore, because I would like to have the triplestore as the main source of this information.

I am dealing only within a single transaction, but if you had two clients using the same transaction ID. Then they could refer to objects the other generated as they would both get back the same object from Redis. Changes that each makes, might cause a problem however.

I am abstracting this with an interface so we should be able to put any implementation behind it that you want. So if you've got a better one, we can happily make that the default. Redis seems like a nice easy solution for now.

whikloj · 2016-04-18T16:16:36Z

This is very early, but just to give an idea of what I was thinking.
https://github.com/Islandora-CLAW/chullo/compare/master...whikloj:issue-185?expand=1

whikloj · 2016-04-18T16:23:41Z

Also, that doesn't do anything special (hence the name KeyCache). It could be made specialized and have it deal with the intricacies of the information internally.

For example: $service->get($txID, $uuid) could get whatever way you store the transaction information and locate it. So the organization of the information could be however works best for a specific use-case.

whikloj · 2016-05-16T21:15:37Z

@DiegoPino I have a problem. Where this keyCache would be useful is inside the idToUri. So if the triplestore query returns 0 rows then check the keyCache.

But I need access to the transaction ID.

We could pass the TX ID in each time, but is there another way to access the Request?

DiegoPino · 2016-05-17T00:52:08Z

@whikloj, of course.
->convert always gets as second argument the Request object. Just change the idToUri signature and add a Request type $request param (no need to change the call itself)
e.g
$callback = function ($post, Request $request) {
return new Post($request->attributes->get('slug'));
};
More fun here:
http://silex.sensiolabs.org/doc/master/usage.html

(Convert callbacks can be also services, but i suspect you don't like services so much! 👍 )

whikloj · 2016-06-21T19:07:43Z

Okay, so here are the 3 for this now.

Remove uuid key cache from here, moving to Crayfish chullo#49 - Strip it out of Chullo
Add IUuidCache functions for transactions Crayfish#4 - Add the cache layer at Crayfish
Add silex cache for use in TransactionService, also fix isToUri signa… islandora-deprecated/pdx#8 - Initialize the cache at PDX too

The cache is super simple, but the idea is you could easily replace it with APC, Memcache, Redis, etc using the same (https://github.com/moust/silex-cache-service-provider) library.

ruebot · 2016-06-24T16:18:33Z

Resolved with:
Islandora/Crayfish@128e43e
islandora-deprecated/pdx@a59e0be
Islandora/chullo@5d8487e

whikloj · 2016-06-24T16:39:52Z

FINALLY!!!!!!!!

DiegoPino · 2016-06-24T16:42:56Z

Found also an extra use for you cache Jared... in the future we could even block a resource to be touched by another TX or direct call if it's in the cache. Good work!

whikloj · 2016-06-24T16:45:49Z

We might want to refactor the cache, and namespace the different caches then. Like have my UuidCache class prepend "uuidcache:" to the keys. To allow for multiple separate caches.

DiegoPino · 2016-06-24T16:47:07Z

cool, next sprint

ruebot added this to the Community Sprint - 06 milestone Apr 18, 2016

ruebot added the PHP Services label Apr 23, 2016

ruebot modified the milestones: Community Sprint - 07, Community Sprint - 06 Apr 30, 2016

whikloj self-assigned this May 13, 2016

whikloj mentioned this issue May 16, 2016

May sprint; Kick-off call notes #227

Closed

This was referenced May 19, 2016

Add a cache to store UUID -> Fedora 4 path mappings Islandora/chullo#41

Merged

Add IUuidCache functions for transactions Islandora/Crayfish#4

Merged

ruebot modified the milestones: Community Sprint - 07, Community Sprint - 08 May 27, 2016

This was referenced Jun 20, 2016

June sprint; Kick-off call notes #284

Closed

Remove uuid key cache from here, moving to Crayfish Islandora/chullo#49

Merged

Add silex cache for use in TransactionService, also fix isToUri signa… islandora-deprecated/pdx#8

Merged

ruebot closed this as completed Jun 24, 2016

whikloj mentioned this issue Sep 19, 2016

Fcrepo-transform is dead #370

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UUID to Fedora Path does not work inside transactions #185

UUID to Fedora Path does not work inside transactions #185

whikloj commented Apr 12, 2016

whikloj commented Apr 12, 2016

whikloj commented Apr 12, 2016

acoburn commented Apr 12, 2016

whikloj commented Apr 12, 2016

whikloj commented Apr 15, 2016

ruebot commented Apr 16, 2016

DiegoPino commented Apr 17, 2016

whikloj commented Apr 18, 2016

whikloj commented Apr 18, 2016

whikloj commented Apr 18, 2016

whikloj commented May 16, 2016

DiegoPino commented May 17, 2016 •

edited

Loading

whikloj commented Jun 21, 2016

ruebot commented Jun 24, 2016

whikloj commented Jun 24, 2016

DiegoPino commented Jun 24, 2016

whikloj commented Jun 24, 2016

DiegoPino commented Jun 24, 2016

UUID to Fedora Path does not work inside transactions #185

UUID to Fedora Path does not work inside transactions #185

Comments

whikloj commented Apr 12, 2016

whikloj commented Apr 12, 2016

whikloj commented Apr 12, 2016

acoburn commented Apr 12, 2016

whikloj commented Apr 12, 2016

whikloj commented Apr 15, 2016

ruebot commented Apr 16, 2016

DiegoPino commented Apr 17, 2016

whikloj commented Apr 18, 2016

whikloj commented Apr 18, 2016

whikloj commented Apr 18, 2016

whikloj commented May 16, 2016

DiegoPino commented May 17, 2016 • edited Loading

whikloj commented Jun 21, 2016

ruebot commented Jun 24, 2016

whikloj commented Jun 24, 2016

DiegoPino commented Jun 24, 2016

whikloj commented Jun 24, 2016

DiegoPino commented Jun 24, 2016

DiegoPino commented May 17, 2016 •

edited

Loading