Two things: 1) Yes, you should only use info supplied by the API. This should me...

heme · on April 1, 2013

Aren't you worried that there is too much "protocol" information in your returned object? I'm struggling with this now. I've come to think the API should return exactly what was requested. So instead of a very specific structured object...

    "collectionName": {
      "total": 861,
      "limit": 10,
      "offset": 100,
      "pages": 87,
      "links": [
        {"rel": "first", "href": "/api/things?limit=10"},
        {"rel": "prev", "href": "/api/things?limit=10&offset=90"},
        {"rel": "self", "href": "/api/things?limit=10&offset=100"},
        {"rel": "next", "href": "/api/things?limit=10&offset=110"},
        {"rel": "last", "href": "/api/things?limit=10&offset=860"},
      ],
      "items": [
        ...
      ]
    }

It generically returns what was asked for. The items array...

    [{},{}]

I have seen the "total records" value return in the header with "206 Partial Content" response, but that still leaves the very important "links" information. Right now I am putting a "link-href" value for each object returned, but I don't know where to put the collections link information.

Any thoughts are welcome and greatly appreciated!

EDIT: Adding to this comment because pfraze's comment below (https://news.ycombinator.com/item?id=5471523).

It seems as though the Link Header item could be the appropriate place for this: http://www.w3.org/wiki/LinkHeader

Where does the meta information about the resource belong? In the object/resource returned OR as meta information in the header (with other HTTP information). Is this a needless hindress for

buro9 · on April 1, 2013

I'm not worried about it, no.

There's a purity vs pragmatism argument, and I have tried the purity approach several times without success. This time I'm just going for pragmatism... giving developers an easy to use interface that is predictable and consistent.

I just didn't find it constructive to continue pursuing purity when the developers trying to implement the APIs just wanted to get the job done as simply as possible, with all the info at hand, and for them to return to what they were trying to do (usually solve some problem for a user).

MichaelGG · on April 1, 2013

So what if you want to have a list of pages to jump to - send back 80 links?

What is a client supposed to do when a new "nextToLast" appears? I've yet to hear of any API consumer that is driven like a browser.

What's the actual point, other than some purity to some abstract concept? Just make it part of the API to take /api/things?page=x and do the right thing.

What's "tomorrow it may well be a computer" supposed to mean? We'll have AI? Or there will be some sort of new WSDL "... for REST" invented?

steveklabnik · on April 1, 2013

> What's the actual point, other than some purity to some abstract concept?

The actual point is decoupling. It's generally considered in software architecture that design that's decoupled is a little harder to build upfront, but survives change over time.

As a 'real life' example, because Twitter exposed internal details (a tweet ID), clients sorted based on that number, because they assumed it'd be monotonically increasing. Once sequential IDs became a problem with Twitter at scale, they had an issue: if they switched to GUIDs, clients would break, because they were sorting based on the ID value. They had to invent an entirely new algorithm (Snowflake) to adapt to this change. This wouldn't have happened if they hadn't leaked internal details. It's basic encapsulation.

MichaelGG · on April 1, 2013

I'm not sure how the Twitter example is remotely related to what I'm asking. You're saying that if Twitter returned long URLs as opaque tokens to get, then it'd have been ok. Sure, but it'd have been fine if they had made the tweet id an opaque token either way. Surely returning named URL pairs isn't "basic encapsulation", and "GET /Tweet/<token>" has no reason to break.

I can see how returning hypermedia adds yet another layer of abstraction (and potentially plenty more round trips!). I'm just unsure how it helps. I don't understand how actual client code (besides a browser) can deal with arbitrary hypermedia. I'm cautious when I don't understand why people are hyped up about something, but I've yet to see any "real life" examples that demonstrate real benefits of this approach.

IanCal · on April 1, 2013

I can't point to them because they're internal, but there are some good ones I've used. Pagination was one useful part, where we saw the API change but didn't have to change the client. A second was being able to explore parent relationships, so we could start somewhere in the tree and move up until we hit the level we were interested in.

There are always agreements between the API designers and client designers, because the code isn't intelligent, so it doesn't know what 'parent' means. If the name of that changes, then the client would have to change. What can be different is the url structure to get the 'parent'. Maybe there's a new token that has to be there, or the naming of something has been changed, or whatever. Those changes can be made without breaking the clients. Putting this in the responses means two things:

1) Small changes don't break everything (pagination is a great one, its a /pages/1 one day then ?page=1, then ?offset=25, then ?pageSize is optional, then it's required, etc.).

2) Fewer assumptions baked into the client. Getting the tweets for a user you've loaded would be nicer as load(user.tweets) than load("/tweets/"+user.id)

barkmadley · on April 1, 2013

There is no such thing as "arbitrary hypermedia".

watch these two very good talks on this topic:

* http://oredev.org/2012/sessions/designing-hypermedia-apis

* http://vimeo.com/20781278

rtuck · on April 1, 2013

About the pagination example you mentioned: Rather than use link relations, pagination might be better done with URI Templates. This is a spec [1] that allows passing a URI with placeholders to the client which can then fill them in without every option being described up front by the server. It's still a new-ish area but there are library implementations for most major languages now.

Explicit link relations aren't the only way to do hypermedia and the best practices are still moving/being discovered.

[1] http://tools.ietf.org/html/rfc6570

heme · on April 1, 2013

WSDL for REST is happening, but see the JSON schema discussion below.

    WADL is a machine-readable XML description of HTTP-based web applications (typically REST web services).

http://en.wikipedia.org/wiki/Web_Application_Description_Lan...

bmelton · on March 31, 2013

I do something similar, with the only exception being that I've never bothered calculating the number of pages (or links to other pages) on the server side, preferring to just let the client figure it out.

Mainly, I've found that for dynamic data sets, the links could possibly change between the first page and the 87th, as well as the number of pages, voiding the practicality of the data set.

Give them the number of records per page, the offset, the number of records, and let the clients determine the rest (ideally you'd have documentation showing how to paginate, whether it be ?page=1 or ?offset=20, or whatever).

That looks very complete though.

steveklabnik · on April 1, 2013

> one of the things I dislike about link relations is how they presently fail to mention the method you should use

Link relations can absolutely say this. See ATOMpub as one example, it's `edit` relation specifies that you should use PUT.

buro9 · on April 1, 2013

Absolutely ATOMpub does this.

But, my point was more that it's a bit opaque. That there is a defined list of link relations ( http://www.iana.org/assignments/link-relations/link-relation... ) , but that list does not tell you the method to use.

edit and edit-media are covered by ATOMpub, but rel is not an attribute with a fixed range of values. Which means that the information a developer needs isn't there with the link, instead one has to go off and for each link relation try and find the original first citation, hopefully in a spec, which defines for that particular link relation the method to use. Then the developer has to hope that the API implementor also knew about that spec and did the right thing.

My criticism is that link relations fails to describe the valid verbs for a link. And there is no appropriate place to put it. We're being told about a link, but now how to interact with it.

We're given half of the story: this link relates to this item in this way (rel attribute), and you can expect it to hold this content type (type attribute) and it's over here (href attribute).

But the missing part of that story is "and to make a call to this particular end point you need to use one of these methods: HEAD, or GET".

We could use OPTIONS of course, that's the point of it. But pragmatism kicks in and when I've in the past gone done a purity route and had people do things like this... developers start to kick back. They end up with a very chatty API and lots more code than they need to perform a simple action. The audience of an API remains the developer and keeping to a purity line just causes most developers pain (not all devs, some prefer purity).

Going back to the example in the linked cookbook, they had a bank and a deposit resource. You can reasonably expect that the API permits a new deposit, permits fetching information about an existing deposit, but in the case of a bank account won't allow you to edit or delete a deposit once it's been made... you need to make a new deposit to fix that.

So reasonably links should have been returned that pretty much said:

    <link rel="deposit" methods="HEAD,GET,POST" href="/account/12345/deposits" />

And for a specific deposit:

    <link rel="self" methods="HEAD,GET,PUT" href="/account/12345/deposits/17" />

DELETE wasn't allowed for either, and PUT and POST depended on the end point.

I shouldn't do that though, as 'methods' is a gibberish attribute I just made up and no client will know what to do with that.

I should use OPTIONS, but then if you follow that through why include a 'type' attribute as you could've got that info from OPTIONS? (The entity-body of the OPTIONS response could give you the type information and even describe the schema of the resource).

I guess where I'm at is that once you start printing links according to what the user (not client) of the API can or cannot do, you're effectively echoing permissions through the use of links. And that permissions go beyond which resources can be touched, and into what actions you can perform on those resources. Meaning that to express this properly we start needing to be able to communicate which verbs are good for this user, for a given resource. Requiring a second HTTP request for everything to check these permissions seems a bit crazy in practise (very chatty APIs) though great in theory (conformance to every spec there is).

And what we're doing at the moment is opaque as the information on those interactions that the user can perform is held in different places, in the link, in OPTIONS, and additionally in specs. To a developer implementing against this, they're not given an easy way to just answer the question "What can I do right now?"... we give them half the information they need and leave it as a job for the developer to figure out the rest.

BTW: Respect for your book, thanks for replying as I'd love to hear your views on the above.

steveklabnik · on April 1, 2013

> BTW: Respect for your book, thanks for replying as I'd love to hear your views on the above.

<3. Trying to get more good stuff out there...

> Then the developer has to hope that the API implementor also knew about that spec and did the right thing.

This is why relations that aren't specified in your media type or in the IANA list are supposed to be URIs. If they're not, they're breaking the Web Linking spec.

> But the missing part of that story is "and to make a call to this particular end point you need to use one of these methods: HEAD, or GET".

There is no reason that text cannot be in every link relation. It's just not. When defining your own, absolutely add that in.

> And that permissions go beyond which resources can be touched, and into what actions you can perform on those resources.

This is certainly a good insight.

> Requiring a second HTTP request for everything to check these permissions seems a bit crazy in practise

I agree 100%.

justwrote · on April 1, 2013

Just wondering, is it better to use relative or absolute URLs ?

buro9 · on April 3, 2013

Technically, there is no difference at all in how they function.

But I only use //example.com/ if an API is truly available on both http and https (very unlikely, nearly all APIs should be on https only if they use some form of access token in the querystring for auth), and I only use https://example.com/ if the end point exists on some other domain.

As there is no technical difference I opt to save some bytes in the bandwidth.

GhotiFish · on April 1, 2013

oh... OH I SEE!

Ack! My pagination system is wrong! I'm going to try fixing that tommorow! Thanks!