Json package report try #2 #810

imalsogreg · 2019-02-22T01:33:23Z

Replacement for #399

Add two endpoints, which reuse existing URLs but use the requested content type to choose a JSON response instead of HTML. The new endpoints provide basic information about a package (author, description, license), and a listing of available versions for a package, along with each version's deprecation status.

For example:

/package/avro-0.4.1.2

{
    "author": "Thomas M. DuBuisson", 
    "description": "Avro serialization and deserialization support for Haskell", 
    "license": "BSD-3-Clause", 
    "metadata_revision": 0
}

/package/avro

{
    "0.4.1.1": "normal", 
    "0.4.1.2": "deprecated", 
    "0.4.1.4": "normal"
}

For the first (single-version) view, the URL can specify a metadata revision:
/packages/avro-0.4.1.1/revision/1

{
    "author": "Thomas M. DuBuisson", 
    "description": "Avro serialization and deserialization support!", 
    "license": "BSD-3-Clause", 
    "metadata_revision": 1
}

(when no revision is specified, the most recent one is used)

NOTE: There is no caching implemented for these endpoints. Is it Ok to assume that the computations being done (parsing several cabal files per request) are cheap enough to justify not adding some AcidState caching?

There is no dependencies data in this PR. I'm thinking of adding that in a second PR, after talking with @hvr about his ideas for formatting there.

/cc @hvr @alexcmcdaniel @gbaz

imalsogreg · 2019-02-22T02:08:43Z

There's at least one error: package/avro returns the JSON result even when we aren't requesting json.

alexcmcdaniel · 2019-02-22T15:52:43Z

Would it be possible to include homepage as well?

imalsogreg · 2019-02-23T03:30:20Z

Fixed and cleaned up. Ready for review :)

hvr · 2019-02-25T16:46:23Z

there is no caching implemented for these endpoints. Is it Ok to assume that the computations being done (parsing several cabal files per request) are cheap enough to justify not adding some AcidState caching?

Depends... there's some packages which have 150+ releases and there's also .cabal files which are not that cheap to parse and take a significant amount of time and space to parse. But I don't think you need to parse all .cabal files for any single service request here?

Btw, when there's a description field, I'd also expect the synopsis-field to be present. And if you extract the license-information, then you also ought to extract the copyright: property imo.

imalsogreg · 2019-02-25T17:13:04Z

@hvr re: synopsis and copyright, agreed! Done.

re: caching, yes, requests for the version listing only force parsing of the cabal files for that package. Requests for a particular version only parse that one version. Would it be too optimistic to assume that when we are only using these top-level fields, laziness saves us from having to parse the whole file?

alexcmcdaniel · 2019-02-25T19:37:00Z

I noticed that the sample endpoints do not have .json, is that intentional?

imalsogreg · 2019-02-25T19:54:43Z

@alexcmcdaniel Thanks, yep. If you just type those example URLs into a browser you'll get html. Implicitly I meant for an Accept: application/json header to be attached to the requests.

alexbiehl · 2019-02-25T21:13:22Z

Would it be too optimistic to assume that when we are only using these top-level fields, laziness saves us from having to parse the whole file?

Yes. To claim a successful parse the whole file needs to be examined. Usually lazy parsing only works well if your format supports laziness explicitly.

alexcmcdaniel · 2019-03-11T18:49:03Z

@imalsogreg any updates?

imalsogreg · 2019-03-11T23:37:59Z

@alexcmcdaniel I'm slowly working on the caching part.

alexcmcdaniel · 2019-03-19T16:29:41Z

@imalsogreg how is the caching coming?

imalsogreg · 2019-04-06T13:01:33Z

I added AcidState actions for the API endpoints. It does read-through caching, with a hook on package change to delete the cache lines for a package.

I did not add any backup logic, since all the data that would be backed up is a redundant view of other already-backed-up data. And our API is cheap to call. So the backup would cost us more in maintenance and versioning than it would help us with site integrity. Think so?

I've rebased to clean up the commit history, added a couple small tests and gone through a cleanup pass. Is there anything else we should do before final review and merge? Any more fields to add to the package description API?

alexbiehl · 2019-04-11T05:34:31Z

Distribution/Server/Features/PackageInfoJSON/State.hs

+-- | Basic information about a package. These values are
+--   used in the `/package/:packagename` JSON endpoint
+data PackageBasicDescription = PackageBasicDescription
+  { pbd_license           :: License


Can we make these fields strict? We put that into a map in memory, it would be a shame to introduce a leak.

alexbiehl · 2019-04-11T05:35:00Z

Distribution/Server/Features/PackageInfoJSON/State.hs

+import           Data.Aeson           ((.=), (.:))
+import           Data.Acid            (Query, Update, makeAcidic)
+import qualified Data.HashMap.Strict  as HashMap
+import qualified Data.Map             as Map


Also it might make sense to use Data.Map.Strict here.

Note: You should use Data.Map.Strict instead of this module if: You will eventually need all the values stored. The stored values don't represent large virtual data structures to be lazily computed.

Sounds like me! 👍

imalsogreg · 2019-04-13T21:59:22Z

Anyone available to approve/request changes? I'd love to get this in to hackage :)

alexbiehl · 2019-04-14T06:21:16Z

Distribution/Server/Features/PackageInfoJSON/State.hs

+setDescriptionFor  pkgId descr = State.modify $ \p ->
+  case descr of
+    Just d  -> p {descriptions = Map.alter (const (Just d)) pkgId (descriptions p)}
+    Nothing -> p {descriptions = Map.filterWithKey (\pkgId' _ -> fst pkgId' /= fst pkgId) (descriptions p)}


@iamalsogreg can you explain why we wouldn't use Map.delete here?

Yah - since descriptions is keyed on (PackageIdentifier, Maybe Int), if I use delete, I would be deleting just a particular package/metadata combination. But what I want is for a change to the package to delete every package entry for some fixed package, at every metadata revision.

If there were a deleteWithKey function that transforms the key before deciding what to delete, that could be nicer than the filterWithKey I use.

alexbiehl · 2019-04-14T11:51:16Z

Ah makes sense, I didn't see both fst!

alexcmcdaniel · 2019-04-15T20:23:23Z

@imalsogreg hey sorry to make a last second request but would it be possible to add the homepage as well?

alexcmcdaniel · 2019-04-22T20:42:29Z

oh never mind I see that you added it in the last commit, any update on the timing of the release? @imalsogreg @hvr

aj-arena · 2019-05-06T15:52:38Z

I am also interested in this pull request, any update?

@gbaz

Package JSON API (replacement of #810)

gbaz · 2022-03-28T22:52:55Z

obsoleted by #996

imalsogreg force-pushed the jsonPackageReport2 branch from 86a75cd to 1786e2d Compare February 23, 2019 01:18

imalsogreg mentioned this pull request Feb 23, 2019

Json package report #399

Closed

Add JSON endpoints for basic package information

a73f029

imalsogreg force-pushed the jsonPackageReport2 branch from 8e4ac86 to a73f029 Compare April 6, 2019 12:41

alexbiehl reviewed Apr 11, 2019

View reviewed changes

Use strict record fields and strict map in PackageInfoJSON

b7cebc6

imalsogreg force-pushed the jsonPackageReport2 branch from 0da8279 to b7cebc6 Compare April 13, 2019 00:28

alexbiehl approved these changes Apr 13, 2019

View reviewed changes

alexbiehl reviewed Apr 14, 2019

View reviewed changes

Kleidukos mentioned this pull request Dec 27, 2021

Package JSON API (replacement of #810) #996

Merged

gbaz added a commit that referenced this pull request Feb 24, 2022

Merge pull request #996 from Kleidukos/api/package

e8cf5ec

Package JSON API (replacement of #810)

gbaz closed this Mar 28, 2022

Json package report try #2 #810

Json package report try #2 #810

Uh oh!

Conversation

imalsogreg commented Feb 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

imalsogreg commented Feb 22, 2019

Uh oh!

alexcmcdaniel commented Feb 22, 2019

Uh oh!

imalsogreg commented Feb 23, 2019

Uh oh!

hvr commented Feb 25, 2019

Uh oh!

imalsogreg commented Feb 25, 2019

Uh oh!

alexcmcdaniel commented Feb 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

imalsogreg commented Feb 25, 2019

Uh oh!

alexbiehl commented Feb 25, 2019

Uh oh!

alexcmcdaniel commented Mar 11, 2019

Uh oh!

imalsogreg commented Mar 11, 2019

Uh oh!

alexcmcdaniel commented Mar 19, 2019

Uh oh!

imalsogreg commented Apr 6, 2019

Uh oh!

alexbiehl Apr 11, 2019

Choose a reason for hiding this comment

Uh oh!

alexbiehl Apr 11, 2019

Choose a reason for hiding this comment

Uh oh!

imalsogreg Apr 12, 2019

Choose a reason for hiding this comment

Uh oh!

imalsogreg commented Apr 13, 2019

Uh oh!

alexbiehl Apr 14, 2019

Choose a reason for hiding this comment

Uh oh!

imalsogreg Apr 14, 2019

Choose a reason for hiding this comment

Uh oh!

alexbiehl commented Apr 14, 2019

Uh oh!

alexcmcdaniel commented Apr 15, 2019

Uh oh!

alexcmcdaniel commented Apr 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aj-arena commented May 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gbaz commented Mar 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

imalsogreg commented Feb 22, 2019 •

edited

Loading

alexcmcdaniel commented Feb 25, 2019 •

edited

Loading

alexcmcdaniel commented Apr 22, 2019 •

edited

Loading

aj-arena commented May 6, 2019 •

edited

Loading