[service-orientated-architecture] Vinoski on RESTing with Erlang

Gervas Douglas Fri, 04 Apr 2008 08:42:05 -0700

<<Ever see the famous "Apache vs. Yaws" graphs<http://www.sics.se/%7Ejoe/apachevsyaws.html> and wonder whether you,too, should be using Yaws? The graphs show what at first seems to be anunbelievably huge scalability advantage for Yaws, with its ability toscale to over 80000 parallel connections while Apache keels over at only4000. Reactions to these graphs tend to be quite polarized, typicallyeither one of "there's no way these graphs are accurate" or "they musthave misconfigured Apache", to the opposite reaction of "Wow, I need totry using Yaws!"

Regardless of whether you believe the Yaws comparison graphs or not,Yaws <http://yaws.hyber.org/> is a solid web server for serving dynamiccontent. Claes Wikström wrote Yaws - "Yet Another Web Server" - inErlang <http://www.erlang.org/>, a programming language createdspecifically to support long-running, concurrent, highly reliabledistributed systems. (To learn more about Erlang, get a copy of thewonderful book /Programming Erlang/<http://www.pragprog.com/titles/jaerlang>, written by the language'screator, Joe Armstrong.) The flexibility of Yaws combined with severalunique features of Erlang makes them a compelling combination for aRESTful web services platform. If you're serving static pages, grablighttpd <http://www.lighttpd.net/> or nginx <http://nginx.net/>instead, but if you're writing dynamic RESTful web services, then Yawsis definitely worth exploring. In this article, I'll relate some of myexperiences with using Yaws and Erlang for web services development.



   Yaws Basics

Yaws provides several ways of serving dynamic web content and supportingRESTful web services:


   *

     /Embedding Erlang code within static pages/. With this approach,
     you embed Erlang code within a function named |out/1| within ...
     tags directly into static content. Files of this nature have a
     |.yaws|... tags with the result of executing the |out/1| function
     they're expected to contain. In Erlang terms, |out/1| is a
     function of arity 1, i.e., a function taking one argument. Its
     argument is expected to be a Yaws |arg| record, which is a data
     structure that Yaws uses to communicate details for incoming
     requests to the code handling them. For example, an |arg| record
     supplies information such as the request URI, incoming headers,
     |POST| data, etc. extension, by which Yaws knows to process the
     file and replace the

   *

     /Application Modules (appmods)/. The Yaws appmod facility lets
     application code take control of URIs. In the approach described
     above, Erlang code is embedded within static files whose URIs are
     determined by their pathnames relative to the web server's
     document root. With an appmod, however, the application controls
     the meaning of URIs, and such URIs usually do not correspond to
     any file system artifacts. Appmods are basically Erlang modules
     that export an |out/1| function. Such modules are configured in
     the Yaws configuration file to correspond to a URI path element.
     When a request is made containing a path element associated with a
     registered appmod, Yaws invokes that module's |out/1| function,
     passing it an |arg| record. The appmod's |out/1| function can then
     examine the rest of the URI to determine the precise resource that
     is the target of the incoming request, and respond accordingly.

   *

     /Yaws applications (yapps)/. Unlike appmods which are usually just
     single Erlang modules, Yaws yapps are full-fledged applications.
     Each yapp has its own document root, and each can have its own set
     of appmods. Specifically, yapps are Erlang/OTP applications. OTP,
     which stands for "Open Telecom Platform," is a set of well-proven
     libraries and frameworks that provide Erlang applications with
     powerful capabilities. OTP encapsulates idioms and approaches for
     achieving distribution, event handling, and high reliability,
     among many other things. Erlang/OTP has been proven in real-world
     field usage within a variety of telecom systems, for example, some
     of which mark their downtime in just a few milliseconds per year.

All three of these approaches, which are detailed at the Yaws website<http://yaws.hyber.org/>, can be usefully applied within a RESTful webservice, depending on the specific nature of the service itself.However, in my experience, yapps and appmods work the best, because theyprovide the most control to the web application.



   RESTful Design

Since we want to develop RESTful web services, let's look at somedetails of REST, which stands for "Representational State Transfer." RoyT. Fielding coined the term "REST" in his doctoral thesis<http://www.ics.uci.edu/%7Efielding/pubs/dissertation/top.htm> todescribe an architectural style suitable for large-scale distributedsystems like the web. HTTP is essentially an implementation of REST. Theterm "representational state transfer" refers to the fact that RESTfulsystems operate via the exchange of representations of resource state inrequests and replies. For example, the typical web page retrieved withan HTTP GET is an HTML representation of the web resource identified bythe URI targeted by the GET.

When developing a RESTful web service, these are the key areas to payattention to:


   *

     Resources and resource identifiers

   *

     Methods supported by each resource

   *

     Formats of data interchanged between client and server

   *

     Status codes

   *

     Applicable HTTP headers for each request and response

Let's consider each of these areas in the context of Yaws and Erlang.


   Resource Identifiers

Designing a RESTful web service requires you to think about whatresources comprise your service, how to best identify them, and how theyrelate to one another. RESTful resources are identified by URIs.Normally, related resources have URIs that are themselves related,sharing common path elements. For example, in a web-based bug trackingsystem, all the bugs for imaginary project "Phoenix" might be foundunder the URI |http://www.example.com/projects/Phoenix/bugs/|, whereasthe specific bug numbered 12345 might be under|http://www.example.com/projects/Phoenix/bugs/12345/|. RESTful resourcesalso tend to provide URIs for other resources within their own staterepresentations. This allows clients retrieving a particular resource'sstate to use the URIs returned within the state representation tonavigate to other portions of the overall web application.


out(Arg) ->
    Uri = yaws_api:request_url(Arg),

Path = string:tokens(Url#url.path, "/"),Once you have the request URI, I've found that it's handy to tokenizethe request path as shown above, by splitting it on its forward slashes.The result is a list of path elements that begin at the URI point whereyou've tied your appmod. For example, let's assume we've tied an appmodonto the "projects" path element in the URI|http://www.example.com/projects/|. If a request is made on any URIcontaining this URI as its prefix, the appmod's |out/1| function willwind up with a list of separated path elements indicating the targetresource of the request. For example, a request for URI|http://www.example.com/projects/Phoenix/bugs/| will result in thefollowing Erlang list of path elements in the |Path| variable afterexecuting the code shown above:


["projects", "Phoenix", "bugs"]

The utility of splitting the URI is that it makes further dispatchingquite simple, thanks to Erlang's pattern matching. For example, we canwrite a separate function, let's call it |out/2|, to handle thisspecific URI by defining the function head like this:

out(Arg, ["projects", Project, "bugs"]) ->% code to handle this URI goes here.

This |out/2| function will handle all requests for bug lists for allprojects we know about, with the variable |Project|, which is availableto the function body, being set to the specific project name beingrequested. Supporting additional URIs is equally as simple: just addmore variants of the |out/2| function. You can also feel free to use aname other than |out| for these functions if you wish, since they arenot invoked directly by the Yaws framework.

Note that properly defining your resource URIs yields significantbenefits. With appmods and yapps, having a rich URI space is quitesimple because of the simplicity of tying different appmods ontodifferent URI path elements, and the ease of dispatching. Erlang patternmatching makes handling requests for different URIs trivial. Contrastthis with the poor style traditionally used for defining non-RESTfulservices, where all services are given the same URI. This URI typicallypoints to a script that uses information provided within the requestbody or through URI query strings to determine where to actuallydispatch the request. The URIs that result from the Erlang/Yawsdispatching technique shown above are far cleaner than the overloadedURIs with seemingly endless parameter lists that result from thetraditional approach.



   Resource Methods

The methods that web clients can invoke on a web resource are defined byHTTP's verbs, primarily |GET|, |PUT|, |POST|, and |DELETE|. However,individual resources tend to support only a subset of those verbs. Whenyou design your web service, you need to determine what methods each ofyour resources supports, bearing in mind the semantics expected for eachHTTP verb as defined in RFC 2616 <http://www.ietf.org/rfc/rfc2616.txt>.

In Yaws, the request method is found in the |http_request| record,accessible via the |arg| record:


Method = (Arg#arg.req)#http_request.method

This returns an Erlang atom representing the request method, which canthen be added into our pattern-matching dispatching approach. We can adda new parameter to our |out| function, turning it into |out/3|, toinclude the request method:


out(Arg, 'GET', ["projects", Project, "bugs"]) ->
    % code to handle GET for this URI goes here.


out(Arg, 'GET', ["projects", Project, "bugs"]) ->
    % code to handle GET for this URI goes here;
out(Arg, 'POST', ["projects", Project, "bugs"]) ->
    % code to handle POST for this URI goes here;
out(Arg, _Method, ["projects", _Project, "bugs"]) ->
    [{status, 405}].

Just as with URI dispatching, Erlang pattern matching makes dispatchingto separate functions to handle separate HTTP verbs trivial.



   Representation Formats

When designing a RESTful web service, you need to consider whatrepresentation(s) each resource supports. Web service resources oftensupport XML or JSON representations, for example. Erlang supplies thexmerl library <http://www.erlang.org/doc/apps/xmerl/index.html> forcreating and reading XML, and Yaws provides a straightforward JSONmodule. Both work quite well.


Accept_hdr = (Arg#arg.headers)#headers.accept

If your resource supports multiple representations, you can check thisheader to see if the client indicated which representation it prefers.If the client did not send an |Accept| header, the |Accept_hdr| variableshown above will be set to the atom |undefined|, and your resource cansupply whatever representation it deems best. Otherwise, your servicecan parse the |Accept_hdr| value to determine which representation tosend. If the client requests representations that your resource cannotfulfill, it can return HTTP status 406, which means "not acceptable,"along with a body indicating what formats are acceptable:


case Accept_hdr of
    undefined ->
        % return default representation;
    "application/xml" ->
        % return XML representation;
    "application/json" ->
        % return JSON representation;?
    _Other ->
        Msg = "Accept: application/xml, application/json",
        Error = "Error 406",
        [{status, 406},
         {header, {content_type, "text/html"}},
         {ehtml,
          [{head, [], [{title, [], Error}]},
           {body, [],
            [{h1, [], Error},
             {p, [], Msg}]}]}]

end.The Erlang code above checks the |Accept_hdr| value to see if it'seither |application/xml| or |application/json|. If it's either of those,the resource returns a suitable representation, but if not, the codereturns an HTTP status 406 along with an HTML document indicating therepresentations the resource is willing to provide.

Another way of handling the desired representation is - you guessed it -adding it as another parameter to our |out| handler function. This way,Erlang pattern matching ensures that our request gets dispatched to theright handler for the requested URI/method/representation combination.This avoids cluttering handlers with case statements like the one above.

By the way, this example also shows the Yaws |ehtml| type, which is away of representing HTML as a series of Erlang terms. I find |ehtml|quite intuitive to write because it directly follows the structure ofHTML, but is far more compact and eliminates the tedium and errors ofmatching tags that you face when writing literal HTML.



   Status Codes

RESTful web services must return proper HTTP status codes, as indicatedby RFC 2616. Returning the right status is easy with Yaws: simplyinclude a |status| tuple in the result of your |out/1| function. See thecase statement above for an example of returning the appropriate statuscode. If your code does not explicitly set a status, Yaws will set astatus 200 for you, indicating success.



   HTTP Headers

Retrieving request headers and setting reply headers with Yaws isstraightforward, too. We've already seen an example of retrieving the|Accept| header from the headers record; other request headers can beretrieved in the same fashion. Setting reply headers simply requiresputting a |header| tuple in the outgoing reply, like this:


{header, {content_type, "text/html"}}

This sets the |Content-type| header to "text/html," for example.Similarly, in our previous example where we returned status 405 toindicate a "method not allowed" error, we should have also included thefollowing header:

{header, {"Allow", "GET, POST"}}


   Appmods or Yapps?

So far we've seen how Yaws and Erlang make it almost trivial to handlemany of the most important concerns for RESTful web services. Oneremaining question is about choosing appmods vs. yapps, and the answerdepends on what your services do. If you're writing web services thathave to interact with other back-end services, then yapps are probablyyour best bet. Since they're full-blown Erlang/OTP applications, theytypically have initialization and termination functions whereconnections to the back end can be created and shut down. If your yappis an Erlang/OTP |gen_server|, for example, your |init/1||gen_server|framework will provide to you, and allow you to modify, every time itcalls you back due to an incoming call to your server. Besides, usingyapps also means you can use appmods as well, so it's not really amatter of choosing one over the other. Finally, yapps can participate inErlang/OTP supervision trees, where supervisor processes can monitoryour yapps and restart them if they should fail. Supervisor trees play asignificant role in the reliability of long-running Erlang systems.function can establish state that the

This article is geared toward RESTful web services based on back endsother than relational databases. If you're writing a traditional webserver on top of a relational database, you should check out Erlyweb<http://erlyweb.org/>, a framework for such web services, which is alsobased on Yaws and Erlang.



   Conclusion

A significant aspect of writing RESTful web services is choosing theright programming language. We've seen numerous service frameworks in avariety of programming languages come and go over the years, and mostwere failures simply because they were a poor match to the problem. Yawsand Erlang do not specifically provide a RESTful web services framework,yet the facilities they provide are a better match for RESTfuldevelopment than many other language frameworks that were builtspecifically for that purpose.

While an article of this nature necessarily can't dive deeply into thedetails of Yaws, Erlang, and RESTful web services, it has hopefullytouched on the important topics and provided, through its minimal codeexamples, an idea of how to address them. In my experience, buildingRESTful web applications with Yaws and Erlang is very straightforward,and the resulting code is easy to read, easy to maintain, and easy toextend.>>


*You can read this at:
*

*http://www.infoq.com/articles/vinoski-erlang-rest;jsessionid=DE23AB2982FB4FF34989ADE0A1F536AD
*

*Gervas*

[service-orientated-architecture] Vinoski on RESTing with Erlang

Reply via email to