Re: [Rd] Exposing native httpd server

Ben Bolker Sun, 08 Dec 2024 12:20:05 -0800

I absolutely appreciate the desire for minimalism. On the other handRserve has no dependencies other than R >= 1.5.0 (!!!), so you would inany case be cutting your dependencies way down (`servr` has 16 recursivedependencies of which 5 seem to be base/recommended, presumably this iswhere your count of 12 came from; `Rserve` has none).


On 12/8/24 14:57, Jiří Moravec wrote:

Dear Simon and Jeroen,
thank you for your answers. I have to reiterate that I am out of mydepth in here. My knowledge of http is clicking links and not muchbeyond that.
I will definitely look into `webutils` and `Rserve`.
One of the reason why I brought this issue is that I have a static sitegenerator that uses the pkg `servr` to serve the static site locally,before I push it to github pages.
This allowed me to remove some 12 dependencies.
For this, the internal R webserver seems to be completely sufficient andI thought that it would be nice to have this functionality without itbeing "illegal" (i.e., replacing internal function)
and possibly documented so that the limitations are clear.
As for the limitations, IMHO when implemented as I did (Sys.sleep(Inf),setting path, and reset on exit), it behaves like most shiny apps I saw,or many apps in general.So when I think about it as kind of user interface within browserinstead of written in something like tcl/tk instead of a part ofinternet infrastructure, it feels quite sufficient to me.
Lately, I have been quite minimalist and I found a great joy findingthat base is quite bit more powerful than people often think so, so I amquite happy finding out that the internal R server is fully sufficientfor me,
but can't speak for other people and their intended use.
So we can leave it at that. Maybe in few more years when I am morefamiliar with web architecture and R internals, I can make a betterargument, hopefully followed with some rad code.
-- Jirka

On 6/12/24 20:05, Simon Urbanek wrote:
Jiří,
in a sense there are two quite different issue that you are touchingupon. On one hand, your request for exposing the http server issomething I was pretty much expecting. In order to judge the appetitefor it I have included the support for custom handlers back then asinofficial API specifically so that if anyone cares we could work onrefining it (really only Jeff and Hadley ever asked and/or providedfeedback). But I would argue over time it became more clear that it'sprobably not the way to go.
The real problem is that we don't really want to "just" expose theserver because of the implications that you mentioned indirectly: theserver is deliberately run in the current R session - which is prettymuch exactly what we want for the help system, but it is somethingthat is in most cases undesirable for several reasons. Firstly, normalR user does not expect http requests to mess with their analysis (e.g.changing the working directory would certainly not be welcome), so wedon't want random code to execute and interfere with user's work.Secondly, http services are usually expected to be scalable and notinterfere with each other - which is not possible directly here withthe server as-is since it is fully serial within the user's session.What is truly desired strongly depends on the use-case: someapplications would prefer a forked session for each connection, othermay want co-operation in a separate environment. It is all doable, butbeyond the scope of R's internal http server.
Moreover the internal http server is based on the Rserve package andyou always have much larger flexibility there. There are also higherlevel abstractions like RestRserve. So if you like the internal serverthen you can seamlessly use Rserve as the API was derived from there.Of course there are other alternatives in package space like httpuv.We typically don't want to fold things into core R unless it'sabsolutely necessary - i.e., if they can happily live in package space.
In short, I'm still not convinced that you really want to use thebuilt-in sever. Although it is a fully featured http server, it wasincluded for a very specific purpose, and it's not clear that it wouldbe a good fit for other purposes.
That said, I'm interested in ideas about what users would want to useit for. There may be use-cases which do fit the design so we couldmake it happen. I would recommend looking at Rserve first, becauseanything implemented there is trivial to add to R (as it is the samecode base) if it would make sense. So I'm open to suggestions, butthey should be centered around what cannot be done already.
Cheers,
Simon
On Dec 5, 2024, at 2:43 PM, Jiří Moravec <[email protected]>wrote:
R has a native HTTP server that is used for serving R help pagesinteractively, at least on the loopback device (127.0.0.1)
But all of the working are internal, not exposed to user and notdocumented.This is quite shame since the server seems to be fully capable ofhandling basic tasks,
be it serving static websites or even interactively processing queries.
This was previously noticed by Jeffry Horner, the author of the Rookpackage.
I am just a guy who found it interesting.

The basic working is as follows:
User needs to either overwrite the internal `tools:::httpd` functionor add their hook into the internal environmenttools:::.httpd.handlers.env.
In the former case, the user will be of a full control of the server,in the later case, the `app` will be hooked to `/custom/app` instead.All that is needed then is to run the interactive help that startsthe webserver.
Based on the breadcrumbs left on the way, I was able to write aserver that emulates much more complex `servr` package that I havepreviously used to test locally my blog.
https://gist.github.com/J-Moravec/497d71f4a4b7a204235d093b3fa69cc3

You can see that I am forced to do some illegal procedures:
  * tools:::httpd needs to be replaced
* the server doesn't have knowledge of a directory so setwd needsto be set * the function must not end, otherwise the directory is changedduring the server lifetime (and depends on the current workingdirectory)
I would like to suggest and probe for willingness to expose thenative http server.
This would include:
* de-hardcoding the server so that we can register other functionsnot just httpd
* exporting many functions and renaming them (such as mime_type)
* writing better interfaces, `startDynamicHelp` is kind of hard towork with, something like httpd_start(dir, fun, port),httpd_stop(port) and httpd_status(port) would be much cleaner.
I would like to say that I have no idea what I am doing, I don'tunderstand webtech or the internal implementation, so if there arereasons why this isn't a great idea...
I am happy to make a PR for the R part. https://github.com/wch/r-source/blob/trunk/src/library/tools/R/dynamicHelp.RThe C part with the R's C internals look to me like a black magic andI don't feel confident enough. https://github.com/wch/r-source/blob/trunk/src/modules/internet/Rhttpd.c
See this old stackoverflow answer, where someone was looking for`python -m SimpleHTTPServer 8080`
https://stackoverflow.com/q/12636764/4868692

______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel
______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


--
Dr. Benjamin Bolker
Professor, Mathematics & Statistics and Biology, McMaster University
Director, School of Computational Science and Engineering

* E-mail is sent at my convenience; I don't expect replies outside ofworking hours.


______________________________________________
[email protected] mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

Re: [Rd] Exposing native httpd server

Reply via email to