Hypermedia Workflow Questions

Following a talk I gave about hypermedia APIs, I received a question from an Abiel Woldu on how to handle hypermedia support when the same backend operation is called from different workflows. Here’s part of the Abielâ€™s email:

â€œSay you have an end point for validating address; call it /validateAddress. Now this endpoint is called from two work flows.

When a user updates his account settings (changes a new address)
When a user tries to buy a product and enters the shipment address

In both cases the /validateAddress should give different set of links and forms as part of the response of validation (next step affordances) because the flow is different. In this case what is the set of the next links and forms returned from the endpoint? Is it the union of the two workflows and the client knows how to get what it needs? Or does the client send information of which flow it is in and the server uses the information to figure out what response to give it?â€

Decoupling Backend Processes from Public URIs This kind of question comes up frequently. Essentially, there are a couple assumptions here that are worth exploring. The first is the idea that a backend operation (e.g. â€œvalidateAddress()â€) is exposed over HTTP as a single endpoint, no matter the calling context. This is not a requirement. In fact, it is advantageous to decouple public addresses (URI) from private operations on the server. HTTP (whether using HTTP-CRUD, Hypermedia-REST or some other model) offers the advantage of using multiple public URIs to point to the same backend operation. For example, it is perfectly correct to publish both /validateExistingAddress and /validateNewAddress URIs each of which points to the same â€œvalidateAddress()â€ operation on the server.

Not Everything Needs a URI Just because the backend server has an operation such as â€œvalidateAddress()â€ does not mean there has to be a URI associated with that operation. For example, the â€œuser updates his account settingsâ€ workflow need not have a direct URI call to â€œvalidateAddress()â€. Instead, there could be an account settings resource (/account-settings/) that supports the HTTP.PUT method and accepts a body containing (among other things) a modified address. Executing this client-side operation (PUT /account-settings/) passes data to the server and â€“ along with other operations â€“ the server calls the â€œvalidateAddress()â€ operation itself and reports the results to the client.

The same can be done in the case of â€œuser tries to buy a product and enters the shipment addressâ€. This address validation could be a small part of the server-side operation and processing of an HTTP.POST to a /check-out/ resource.

Mapping Actions to URI & Method In the HTTP-CRUD model, the focus is on using URIs to identify entities and/or operations and using the protocol methods to perform actions. For example, an /addresses/ resource that supports adding (POST), modifying (PUT), removing (DELETE) and retrieving (GET) addresses associated with a context (logged in user, check-out processing etc.) In this case, POSTing or PUTing a resource body to the server allows the server to call the â€œvalidateAddress()â€ operation (among other things) and report results to the client.

Mapping Actions to Hypermedia Controls In the hypermedia model, actions are described using a hypermedia control such as a link or form. The URI is not important in this model. Instead the control has an identifier (e.g. â€œvalidateâ€œ), indicates a protocol action (â€œPOSTâ€œ) and lists state data to include in the payload.

In Siren it might look like this:

"actions": [
 {
     "name": "validate",
     "title": "Validate an Address",
     "method": "POST",
     "href": "...",
     "type": "application/x-www-form-urlencoded",
     "fields": [
           { "name" : "Street", "type" : "text", "value" : "123 Main Street" },
           { "name" : "City",   "type" : "text", "value" : "Byteville"},
           { "name" : "State",  "type" : "text", "value" : "MD" },
           { "name" : "ZIP",    "type" : "text", "value" : "12345"}
     ]
     }
 ]

Note that I didn’t bother to enter a value for the href in this example. It could be any valid URL; I just left it out.

Tracking Workflow Progress Within Messages Here’s another question from Abiel Woldu’s email:

â€œThe concept of which work flow the client is going through â€“ is it code that should reside in the API code itself or it’s something that sits outside in some other gateway or something?â€

When implementing processes over HTTP, it’s wise not to rely on stateful multi-request chains. In other words, don’t expect either the client or server to keep track of where some request belongs in a workflow. Instead, include that information in the request and response bodies themselves. This pattern of including all the important context information with each request and response not only assures that the request can be handled independently (e.g. in a load-balanced cluster), it also helps clients and servers to do work within varying time-spans (e.g. a clients can cache the last request to disk and pick things up a day later). In the REST model, Fielding described this as making messages â€œself-descriptiveâ€.

For example, there might be a use case that prompts human users to provide quite a lot of information (across various UI tabs) before finally submitting this completed set of work to the server for final validation and processing. One way to support this over HTTP is to allow clients to store â€œwork-in-progressâ€ (WIP) records on the server. As each â€œtabâ€ (or other UI affordance) is completed, the client app is free to execute a POST or PUT operation with the payload to a URI supplied by the server. The stored data would include a value that indicates how far along in the workflow the user has progressed. This same client app could also recall stored WIP records, inspect the workflow indicator and prompt the user to pick up where she left off. Once all the required elements were supplied, the work could be forwarded for final validation and processing.

Dynamic Workflow via Hypermedia Finally, in some cases, the series of steps in a workflow might vary greatly at runtime. For example, a service might support a multi-tenant model where each instance of â€œsupply all the details for this workâ€ has different steps or the same steps appear in differing order. The â€œnext stepâ€ need not be memorized by the client code. Instead, hypermedia servers can inspect the current server-side configuration, check the current progress by the user and then supply the correct â€œnext stepâ€ for this particular instance.

In this way, the client app can support a wide range of workflow details without needing custom code ahead of time (or even downloaded code-on-demand). Instead, the client app only needs to be able to recognize the â€œnext stepâ€ link and navigate to that resource.

In Summary In general, when using HTTP:

There is no rule that you must expose internal methods as public URIs
You may use more than one URI for the same backend operation
In the HTTP-CRUD model, you usually map operations by linking URIs and methods
In the hypermedia model, you usually map operations by linking controls and state variables
It is best to use â€œself-descriptiveâ€ messages to track workflow progress statelessly
The hypermedia model supports dynamic workflow progress using the â€œnext stepâ€ link pattern

Thanks to Abiel for his questions and his generous permission for me to use his email and name in this blog post. If you’ve got a question that I havenâ€™t answered online before, feel free to ping me via twitter (@mamund) and fire away.