Re: New Asterix REST API design

Mike Carey Fri, 15 Apr 2016 22:34:53 -0700

True. I was mainly thinking of the web console; usually one will wantone format for an app, I totally agree.

For the console it might be fun to be able to switch around.

I was thinking that either way the total effort put into computing thefinal (formatted) result would be the same, and also that the per-nodeinvestment in computing it would be the same - so that the timecomplexity would be the same.However, I agree that this would potentially increase the overallretrieval latency since we'd be doing serial just-in-time formatting asthe pickups occur.I'm fine either way - I was just thinking maybe things would be easier(from a boundary-finding standpoint) the binary way.

(Not sure!)
Cheers,
Mike


On 4/15/16 8:25 PM, Till Westmann wrote:

I think that it’s a trade-off. Either we do the work when the job is
evaluated or when the job is picked up. If we did it on pick-up, we could
pick it up more than once in different formats, but I don't think thatmanyapplications would need that (the web console might as somebodysitting in
front of it might want to look at it). The nice thing about the current
solution is, that we can do the serialization easily in parallel and the
pickup can happen sequentially and we don't have to interleave that with
more computation.

On 15 Apr 2016, at 17:39, Mike Carey wrote:
In a more perfect world, the query results would perhaps be persistedin binary ADM form still, and would be just-in-time reformatted whenthey are picked up for delivery back to the requester. At least thatseems like it would be better... No?
On 4/15/16 5:22 PM, Ildar Absalyamov wrote:
I agree that the example where CSV is embedded into return JSONlooks quirky (and I am not the big fan of it either).I believe the tradeoff here is following: do we want to keep numberof API calls just to get the data minimum, or logically separatemetadata (like plans, execution time metrics, etc) from the data onthe endpoint level.I have tried to address the former case, however left an option tomake this logical separation if the user is wiling to do that (viainclude-results parameter). There is no real way to do it other wayaround, since the plans, etc are generated before query is scheduledand any results could be returned.
On Apr 15, 2016, at 17:13, Till Westmann <ti...@apache.org> wrote:
Yes, this API is not ideal for "just getting the data". However,Ildar’sgoal was to separate the data from the HTML and to build an APIthat can bethe basis for the Web-interface - and I think that the API looksgood for
that :)
I'm wondering if an endpoint to get the data should be an option onthis oneor a different endpoint. The reason is, that all of the additionalrequestmetadata that we can ask for (plan, metrics, warnings, ..) cannoteasily bereturned with such an API. An API that play well with curl mighteven put
the format into the URI, e.g.:
curl http://host:19100/query/csv?statment=select+element+1+as+one;> one.csv
Thoughts? Trade-offs?

Cheers,
Till

On 15 Apr 2016, at 16:48, Cameron Samak wrote:
That hop is exactly what I think should be (optionally) avoidablethough
because
1. The user still needs to parse both JSON (to get the URL)along with
   the other format (i.e. CSV)
Consider curl {myquery} > myoutput.csv. That's harder with theproposed
   API.
2. It's an unnecessary round trip back to the server (which,depending
   on the environment, can be significant esp. with quick queries).


Understood for the result distribution + serialization.


Cameron
On Fri, Apr 15, 2016 at 4:24 PM, Till Westmann <ti...@apache.org>wrote:
I had a misunderstanding that I think I clarified now. I believedthat wedon’t have the separation into tuples anymore after resultdistribution andthat we only have bytes that we pass to the client. In that caselimiting
in
the HTTP server would have had to choose between
a) limiting based on the number of bytes or
b) re-establishing tuple boundaries.
However, even though result distribution has serialized thetuples towhatever format (ADM, JSON, CSV), we still send frames and so weshould be
able to separate the tuples (and limit the number that we return).
So I think that it should be feasible to add that (feature creepis coming
... :) )

Cheers,
Till


On 15 Apr 2016, at 14:55, Mike Carey wrote:
I read this much more simply: Can we enhance the API, in thecase where
you start with a handle and know that the results are ready now,to fetchthe results in blocks instead of as one giant result? So stillcomputingthe giant result - just not pushing it all back at once - seemslike it
might help?


On 4/15/16 2:48 PM, Till Westmann wrote:
Hi Wail,
I’m not completely sure that I understand how to implement theidea. If
we
do this only in the API, it might be tricky to get theboundaries betweenrecords right (e.g. if we do indentation on the server).However, if we
want
to push this into the query engine, we need to understandenough of the
query/statements to put the limit clause in.
Both approaches don't look great to me.

What did you have in mind?

Cheers,
Till

On 15 Apr 2016, at 13:19, Wail Alkowaileet wrote:

Hi Ildar,
I think if there's something I would love to have is gettingpartial
result
instead of all result at once. This can be beneficial for result
pagination. When I use AsterixDB UI, 50% of the time my tabcrashes (I
forget to limit the result).

Thanks...

On Fri, Apr 15, 2016 at 1:23 AM, Ildar Absalyamov <
ildar.absalya...@gmail.com> wrote:

Hi Devs,
Recently there have been a number of conversations about thefuture of
our
REST (aka HTTP) API. I summarized these discussions in anoutline of
the
new API design:
https://cwiki.apache.org/confluence/display/ASTERIXDB/New+HTTP+API+Design
<
https://cwiki.apache.org/confluence/display/ASTERIXDB/New+HTTP+API+Design
.
The need to refactor existing API came from differentdirections (and
from
different people), and is explained in motivation section.Thus I
believe
it’s about the time to take an effort and improve existingAPI, so
that it
will not drag us down in the future. However during thetransition
step I
believe it would be better to keep exiting API endpoints, sothat we
would
not break people’s current experimental setup.

It would be good to know feedback from the folks, who have been
contributing to that part of the systems recently.

Best regards,
Ildar
--

*Regards,*
Wail Alkowaileet
Best regards,
Ildar

Re: New Asterix REST API design

Reply via email to