Re: Error during the file ingestion

2016-06-02 Thread Yingyi Bu
> > > Subhasis Dasgupta > > University of California, > San Diego > San Diego Supercomputer Center, MC 0505 > 10100 Hopkins Drive > La Jolla, CA 92093-0505 > > > Office : 858-534-3693 > email : sudasgu...@ucsd.edu > > > On Jun 2, 2016, at 3:29 PM, Yingy

Re: Error during the file ingestion

2016-06-02 Thread Yingyi Bu
Subhasis, Are you running from Eclipse? If so, you need to run "mvn clean; mvn package -DskipTests" from the command line to build the project first. Thanks! Best, Yingyi On Thu, Jun 2, 2016 at 3:25 PM, Dasgupta, Subhasis wrote: > my Skype is

dataset for ASTERIXDB-1061?

2016-06-03 Thread Yingyi Bu
Does anyone know/have the dataset for reproducing ASTERIXDB-1061? https://issues.apache.org/jira/browse/ASTERIXDB-1061 https://code.google.com/archive/p/asterixdb/issues/918 It seems to be some issue dataset. Best, Yingyi

Re: dataset for ASTERIXDB-1061?

2016-06-03 Thread Yingyi Bu
Could you upload that to the issue? Thanks! Best, Yingyi On Fri, Jun 3, 2016 at 12:53 PM, Ian Maxon <ima...@uci.edu> wrote: > Yeah I think I have it. It was the google code issues JSON data. > > On Fri, Jun 3, 2016 at 11:31 AM, Yingyi Bu <buyin...@gmail.com> wrote: > &

Re: Unsigned integers data types

2016-06-16 Thread Yingyi Bu
>> Is there any database or SQL implementation supporting that? Ok, it turns out MySQL supports that, while Postgres, MS SQL and Hive do not have that. Best, Yingyi On Thu, Jun 16, 2016 at 3:40 PM, Yingyi Bu <buyin...@gmail.com> wrote: > >> I guess part of the reason why

Re: new AsterixDB web interface demo up and running

2016-06-16 Thread Yingyi Bu
u have some time can you give me an AQl query which > will produce something like that. > > thanks in advance, > Kaveen > > On 16 June 2016 at 22:28, Yingyi Bu <buyin...@gmail.com> wrote: > > > Awesome! Thanks, Kaveen! > > > > Best, > > Yingy

Re: Dead links

2016-07-29 Thread Yingyi Bu
tha...@gmail.com> wrote: > Thank you Yingyi. > > On Jul 29, 2016, at 10:01 PM, Yingyi Bu <buyin...@gmail.com> wrote: > > > > Hi Vignesh, > > > >We're still working on finalizing the SQL++ doc for AsterixDB. Here > is > > a draft that you can take a lo

Re: Dead links

2016-07-29 Thread Yingyi Bu
Hi Vignesh, We're still working on finalizing the SQL++ doc for AsterixDB. Here is a draft that you can take a look: https://github.com/sigmod/sqlpp/blob/master/document.md The ETA of the completed version would be next week. Thanks! Best, Yingyi On Fri, Jul 29, 2016 at 9:21

Re: unstable builds

2016-08-17 Thread Yingyi Bu
It seems there is some existing instance on the docker instance and hence it complains that the port is already used. Best, Yingyi On Wed, Aug 17, 2016 at 10:37 AM, Kaveen Rodrigo wrote: > It seems some of the tests have failed, as evident from >

Fwd: [jira] [Commented] (ASTERIXDB-1327) Spatial-intersect between point and circle not working correctly

2017-02-01 Thread Yingyi Bu
Fwd to dev. Best, Yingyi -- Forwarded message -- From: Willy Aguirre (JIRA) Date: Wed, Feb 1, 2017 at 3:24 PM Subject: [jira] [Commented] (ASTERIXDB-1327) Spatial-intersect between point and circle not working correctly To:

Re: Choosing defaults for AsterixDB

2017-02-03 Thread Yingyi Bu
> > > Ok, since there’s a lot of agreement and no concerns, I’ll go ahead. > > > > Thanks, > > Till > > > > On 10 Jan 2017, at 9:22, Yingyi Bu wrote: > > > >> +100! > >> > >> On Tue, Jan 10, 2017 at 9:17 AM, Mike Carey <d

Re: Google Summer of Code

2017-01-31 Thread Yingyi Bu
Also, use them when access > > the Web Interface or API call. > > > > Best, > > Taewoo > > > > On Tue, Jan 31, 2017 at 9:52 PM, Yingyi Bu <buyin...@gmail.com> wrote: > > > >> Yeah, sounds good to me. > >> > >> Best, > >

Re: Google Summer of Code

2017-01-31 Thread Yingyi Bu
; > On Tue, Jan 31, 2017 at 9:00 PM Yingyi Bu <buyin...@gmail.com> wrote: > > > I have two proposals for data loaders: > > > > 1. add AWS S3 as an external data source; > > 2. support POSTing a user file on the client side through HTTP into an > >

Re: Google Summer of Code

2017-01-31 Thread Yingyi Bu
smaller number of better vetted projects this year, but I > think we could still try for at least one spot. > Steven > > On Tue, Jan 31, 2017 at 9:21 PM Yingyi Bu <buyin...@gmail.com> wrote: > > > IMO, each of them could be a two-three month project. > > > > T

Re: Google Summer of Code

2017-01-31 Thread Yingyi Bu
I have two proposals for data loaders: 1. add AWS S3 as an external data source; 2. support POSTing a user file on the client side through HTTP into an internal dataset. Thanks! Best, Yingyi On Thu, Jan 19, 2017 at 2:06 PM, Steven Jacobs wrote: > Hi All, > The Google

Re: [VOTE] Release Apache AsterixDB 0.9.0 and Hyracks 0.3.0 (RC2)

2017-01-21 Thread Yingyi Bu
+1 - signatures and hashes of all 5 archives ok - nc service binary works - version api agrees with the commit id on ASF repo - source compilation works Best, Yingyi On Sat, Jan 21, 2017 at 9:16 AM, Steven Jacobs wrote: > +1 > Steven > > On Sat, Jan 21, 2017 at 7:36 AM Till

Improve test coverage

2017-02-21 Thread Yingyi Bu
Hi dev, In order to enhance the quality of our code base, I wrote up a test guideline [1] which could be a reference for future patches. More precisely, the guideline summarizes various kinds of tests that we currently have and contains pointers to examples. Going forward, it would be nice if

Re: Function name change: contains() -> string-contains()

2016-09-15 Thread Yingyi Bu
; > Best, > Taewoo > > On Thu, Sep 15, 2016 at 2:40 PM, Yingyi Bu <buyin...@gmail.com> wrote: > > > Hi Taewoo, > > > > Recently I have added several string functions into *DB: > > initcap(title), > > regexp_like, > > regexp

Re: Function name change: contains() -> string-contains()

2016-09-15 Thread Yingyi Bu
Hi Taewoo, Are those fulltext search syntax extensions are only a syntactic sugar (i.e., surface) thing that is translated into functions? In a not-too-distant-future, we will need to surface fulltext search in SQL++, probably using the same functions like Oracle. If the AQL fulltext

Re: Function name change: contains() -> string-contains()

2016-09-15 Thread Yingyi Bu
to the AsterixDB column. Best, Yingyi On Thu, Sep 15, 2016 at 2:27 PM, Yingyi Bu <buyin...@gmail.com> wrote: > Hi Taewoo, > > Are those fulltext search syntax extensions are only a syntactic sugar > (i.e., surface) thing that is translated into functions? > In a not-too-distant

Re: Function name change: contains() -> string-contains()

2016-09-16 Thread Yingyi Bu
Cool, +1! Best, Yingyi On Fri, Sep 16, 2016 at 1:54 PM, Taewoo Kim wrote: > So, in summary, we agree to use a function format for the full-text search, > rather than using XQuery syntax. "contains" doesn't have to be > "string-contains" and "text" doesn't have to be a

Re: Retain inputs on index-search

2016-09-16 Thread Yingyi Bu
I'd like to join the discussion. Best, Yingyi On Fri, Sep 16, 2016 at 10:47 AM, Steven Jacobs wrote: > Hi, > I have come across an issue where retainInputs is always set to false when > creating an index search in the select case. This was because it was > previously

Re: [jira] [Commented] (ASTERIXDB-1587) upsert runtime assumes that the datatype comes from the same dataverse as the dataset

2016-08-17 Thread Yingyi Bu
Best, Yingyi On Wed, Aug 17, 2016 at 6:19 PM, Steven Jacobs <sjaco...@ucr.edu> wrote: > I already pushed the change :( > But in this case it seemed like the test was unnecessary, although I do > have tests on my BAD branch that require this change. > Steven > > On Wed, Aug

Re: [jira] [Commented] (ASTERIXDB-1587) upsert runtime assumes that the datatype comes from the same dataverse as the dataset

2016-08-17 Thread Yingyi Bu
Steven, Can you add a regression test before you closing that issue? (Typically, fixing ASTERIXDB- should have a test case in the change.) Thanks! Best, Yingyi On Wed, Aug 17, 2016 at 6:05 PM, ASF subversion and git services (JIRA) < j...@apache.org> wrote: > > [

Re: Modified/Custom plan: Push-down SELECT for external source.

2016-08-26 Thread Yingyi Bu
Currently you can push Project into the source but not Select. You're welcome to enhance IMetedataProvider to support that. You can take a look at DataSourceScanPOperator: Pair p = mp.getScannerRuntime(dataSource, vars, projectVars,

Minor code style change proposal

2016-08-25 Thread Yingyi Bu
Hi committers, The current code style (http://asterixdb.apache.org/dev-setup.html) that we use clears line breaks for javadocs. I'm proposing a one line minor change in the code style: Attached is the updated code style file. Please reply this message if you have different

Re: Physical Operator Tag Change (SPLIT -> REPLICATE)

2016-09-27 Thread Yingyi Bu
this is merged, the review for the second step would be much easier. > > Best, > Taewoo > > On Tue, Sep 27, 2016 at 5:08 PM, Yingyi Bu <buyin...@gmail.com> wrote: > > > Taewoo, > > > > I agree with you. > > However, can you have two separate changes: &g

Re: Physical Operator Tag Change (SPLIT -> REPLICATE)

2016-09-27 Thread Yingyi Bu
Taewoo, I agree with you. However, can you have two separate changes: 1. Your current change https://asterix-gerrit.ics.uci.edu/#/c/1196/. Call your "Split" "PartitionSplit" for now. 2. Do what you proposed in the next change which only does the renaming. That makes reviews less overwhelming

Re: Does Projection affect count() performance?

2016-10-01 Thread Yingyi Bu
PS, if you still have the OOM instance, can you do a Yourkit memory profile? Thanks! Best, Yingyi On Sat, Oct 1, 2016 at 9:43 AM, Yingyi Bu <buyin...@gmail.com> wrote: > Wail, > > Can you attach the query plan for query 1? > I tried >count( fo

Re: Does Projection affect count() performance?

2016-10-01 Thread Yingyi Bu
Wail, Can you attach the query plan for query 1? I tried count( for $x in dataset beers return $x ) and got the following plan, which seems OK: -- DISTRIBUTE_RESULT |UNPARTITIONED| exchange -- ONE_TO_ONE_EXCHANGE |UNPARTITIONED| aggregate [$$5]

Re: Location of test results

2016-10-01 Thread Yingyi Bu
+1. Best, Yingyi On Sat, Oct 1, 2016 at 6:21 PM, Till Westmann wrote: > Hi, > > in a recent change [1] the test result location changed from "a directory > in > the module folder" to "a directory in the target folder". So now > a) the test results will get removed with every

Re: Line limit (120) is not enforced for an assignment statement.

2016-10-25 Thread Yingyi Bu
>> @Yingyi: I wonder whether your proposed change (do not remove blank lines >> in the comment section) is already applied to the format file It looks that I haven't... You can update this row: Thanks! Yingyi On Tue, Oct 25, 2016 at 4:26 PM, Till Westmann wrote: > +1 > >

Re: [jira] [Commented] (ASTERIXDB-1694) Fail running Tweet Feed on Cluster of 16 nodes (while succeed on 4 nodes)

2016-10-14 Thread Yingyi Bu
>> Log4J in Twitter4j doesn't initialize properly. Can you explain that a little bit more? What exceptions do you hit, "class not found", or sth. else? Thx! On Fri, Oct 14, 2016 at 2:20 PM, Wail Alkowaileet (JIRA) wrote: > > [

Re: Function name and format change in the codebase

2016-10-14 Thread Yingyi Bu
+1. For (1), the eventual goal is to only have underscores. Currently it's a transition period -- all docs are based on underscores. For new functions, it's good to only support underscores. I think that we'll end the life of hyphens after some time. Best, Yingyi On Fri, Oct 14, 2016 at 6:36

Re: type name changes

2016-10-18 Thread Yingyi Bu
yi On Tue, Oct 18, 2016 at 8:52 AM, Taewoo Kim <wangs...@gmail.com> wrote: > So, can we use "int" for "bigint" to be consistent? > > Best, > Taewoo > > On Tue, Oct 18, 2016 at 7:34 AM, Yingyi Bu <buyin...@gmail.com> wrote: > > > >>A

Re: type name changes

2016-10-18 Thread Yingyi Bu
ition (before:int for int64, after:int for integer > (int32)) and outside users that Mike mentioned did not have numbers greater > than INT32 range, I think it's OK. > > Best, > Taewoo > > On Tue, Oct 18, 2016 at 8:59 AM, Yingyi Bu <buyin...@gmail.com> wrote: > >

Re: type name changes

2016-10-18 Thread Yingyi Bu
ndor for copies of their > current ADMs? We should just check so we know if we need to warn them when > we release > > On Oct 17, 2016 11:49 PM, "Yingyi Bu" <buyin...@gmail.com> wrote: > > > This is the change that changes "record" to "object&q

Re: type name changes

2016-10-17 Thread Yingyi Bu
mentation. We used int for the abbreviation > for INT64 (I assume that is now bigint?) type. Now, INT is an abbreviation > for INT32? I thought we converted the default type to INT64 (bigint). > Aren't INT32 type displaying i32 as suffix? > > -- Forwarded message --

Re: About the Multiple Join Optimization on AsterixDB

2016-10-31 Thread Yingyi Bu
Mingda, I'm not sure how much re-ordering can be done at the Hyracks level, i.e., the runtime level. In the optimizer (the asterixdb/algebricks level), we don't have re-ordering for joins, because: --- the cost model has not been added yet. I'm not sure about the timeline for

Re: Let one Operator finished the job before another one begin in Hyracks

2016-10-11 Thread Yingyi Bu
tially the very same thing in their use > cases for parallel sorts and interval joins. Hyracks has everything needed > for this, as it turns out, without a multi-job need. > > > > On 10/11/16 9:26 AM, Yingyi Bu wrote: > >> You can search the usage of waitForCompletion in th

Re: Re: Let one Operator finished the job before another one begin in Hyracks

2016-10-11 Thread Yingyi Bu
You can search the usage of waitForCompletion in the code base, e.g.: APIFramework.java: public void executeJobArray(IHyracksClientConnection hcc, JobSpecification[] specs, PrintWriter out) throws Exception { for (JobSpecification spec : specs) { spec.setMaxReattempts(0);

Re: What's the best way to execute only a single test in ExecutionTest

2016-10-13 Thread Yingyi Bu
Put the test into only.xml or only_sqlpp.xml? On Thu, Oct 13, 2016 at 2:14 PM, Taewoo Kim wrote: > In the past, usually I modified executeTest() of TestExecutor class to > bypass another test cases by checking cUnit.getName() method and execute > only one test case.

Re: Function Change

2016-10-12 Thread Yingyi Bu
Pouria, Here is the function documentation: https://ci.apache.org/projects/asterixdb/sqlpp/builtins.html >> If yes, then what is regexp_contains@2 function ? It checks if the first argument contains a regular expression pattern defined by the second argument. >> Can matches@2 and

Re: Function Change

2016-10-12 Thread Yingyi Bu
h a place and one really needs to keep an eye on the check-in > changes list if he uses functions extensively. > > Thanks again for your help and info. > > Pouria > > On Tue, Oct 11, 2016 at 11:45 PM, Yingyi Bu <buyin...@gmail.com> wrote: > > > Pouria, > > &g

Re: Choosing defaults for AsterixDB

2017-01-10 Thread Yingyi Bu
+100! On Tue, Jan 10, 2017 at 9:17 AM, Mike Carey wrote: > +1 from me too for SQL++ and clean JSON. > > > > On 1/10/17 8:25 AM, Murtadha Hubail wrote: > >> +1 to SQL++ and clean JSON. >> >> Cheers, >> Murtadha >> >> On Jan 10, 2017, at 9:46 AM, Till Westmann

Exception refactoring proposal

2017-01-05 Thread Yingyi Bu
Hi dev, I recent made a change that moved some exception error message templates from the source code into properties files [1]. Here is an exception refactoring proposal [2]. I propose that: -- from now on, we all follow the proposal when we want to throw exceptions, and enforce

Re: Exception refactoring proposal

2017-01-06 Thread Yingyi Bu
Thoughts? > > Cheers, > Till > > > > On 5 Jan 2017, at 11:12, Yingyi Bu wrote: > > Hi dev, >> >> I recent made a change that moved some exception error message >> templates from the source code into properties files [1]. Here is an >> exceptio

Re: Exception refactoring proposal

2017-01-06 Thread Yingyi Bu
All right, sounds good! Best, Yingyi On Fri, Jan 6, 2017 at 3:36 PM, Till Westmann <ti...@apache.org> wrote: > Yes, that’s what I would think as well. > > Cheers, > Till > > On 6 Jan 2017, at 15:30, Yingyi Bu wrote: > > > Sounds good to me. > > I'm gues

Re: Time of Multiple Joins in AsterixDB

2016-12-20 Thread Yingyi Bu
ce they’re not needed for the desired message: > > picking the right join order matters. Nevertheless, I’d like to get a > > better understanding of what’s going on in the larger dataset regime. > > > > > > > > -Tyson > > > > > > > > Fr

Re: Time of Multiple Joins in AsterixDB

2016-12-21 Thread Yingyi Bu
0G. The efficiency for all the tests (good > and bad order) have been all improved to twice speed. > I will finish all the tests and update the result later. > > Bests, > Mingda > > On Tue, Dec 20, 2016 at 10:20 PM, Yingyi Bu <buyin...@gmail.com> wrote: > > > Hi Mingd

Re: Time of Multiple Joins in AsterixDB

2016-12-22 Thread Yingyi Bu
e.buffercache.size": 8589934592 You don't need to reload data but only need to restart the AsterixDB instance. Thanks! Best, Yingyi On Wed, Dec 21, 2016 at 9:22 PM, Mike Carey <dtab...@gmail.com> wrote: > Nice!! > > On Dec 21, 2016 8:43 PM, "Yingyi Bu" <bu

Re: Time of Multiple Joins in AsterixDB

2016-12-22 Thread Yingyi Bu
No need to reload data anymore :-) Best, Yingyi On Thu, Dec 22, 2016 at 11:36 AM, Yingyi Bu <buyin...@gmail.com> wrote: > Indeed, the change was merged yesterday. > If you grab the latest master, the computation parallelism can be set by > the parameter compiler.parallelism: >

Re: Time of Multiple Joins in AsterixDB

2016-12-22 Thread Yingyi Bu
flexible > parallelism stuff handles this! (Where you'd specify storage parallelism > based on drives, and compute parallelism based on cores, both spread across > all of the cluster's resources.) > > > On 12/22/16 10:57 AM, Yingyi Bu wrote: > >> Mingda, >> >> &

Re: Time of Multiple Joins in AsterixDB

2016-12-22 Thread Yingyi Bu
maxi-core parallelism > subsequently, one would just specify a large positive value >= the number > of available cores? (E.g., 1) > > > > On 12/22/16 11:37 AM, Yingyi Bu wrote: > >> No need to reload data anymore :-) >> >> Best, >> Yingyi >> >

New cluster installation options

2017-03-16 Thread Yingyi Bu
Hi dev, We recently added two cluster installation options and their documentation is online now: Ansible: https://ci.apache.org/projects/asterixdb/ansible.html AWS: https://ci.apache.org/projects/asterixdb/aws.html With the Ansible option, you can deploy, start, stop, and

Re: New cluster installation options

2017-04-04 Thread Yingyi Bu
n. I'm talking about this: > https://ci.apache.org/projects/asterixdb/udf.html. Unless they don't > support it, it may be early to deprecate the managix? > > Best, > Taewoo > > On Thu, Mar 16, 2017 at 9:22 AM, Yingyi Bu <buyin...@gmail.com> wrote: > > > H

Re: New cluster installation options

2017-04-05 Thread Yingyi Bu
s seamlessly in the cloud too? (They > will be feeding Tweets from one of the Twitter adaptors into *DB and using > UDFs to process them on the way in and in queries.) > > > > On 4/4/17 4:04 PM, Yingyi Bu wrote: > >> Thanks for the feedback. >> It hasn't supported yet. Bu

Re: When is it appropirate to add reserved words to the AQL/SQL++ Grammar?

2017-04-12 Thread Yingyi Bu
+1 for HTTP API. The end point can be easily removed from a real deployment by using a config parameter. Best, Yingyi On Wed, Apr 12, 2017 at 6:12 PM, Till Westmann wrote: > Coming back to the question in the subject: If we need such functionality > e.g. for testing of

Re: Nonpure functions in the context of SQL++

2017-04-19 Thread Yingyi Bu
Hi Steven, As a temporary workaround, you can do the following before your query: SET inline_with = "false" If you want to fix this, please take a look at InlineWithExpressionVisitor. What you can do is to not inline if there is a non-pure function call in the expression.

Re: Add the Java driver for Asterix

2017-03-03 Thread Yingyi Bu
Hi dev, Maybe we should call it a "JDBC-like" driver rather than a "JDBC" driver, because SQL++ is different from the standard SQL in the following aspects [1]. The JDBC-like client can wrap the current query service REST API and provide some abstractions similar to those in JDBC, e.g.,

Re: What is the new path to check Hyracks jobs status in AsterixDB?

2017-04-07 Thread Yingyi Bu
Hi Jianfeng, The admin console has been removed but the REST APIs which return JSON results are still there. Let's take the sample cluster as an example. To check nodes: http://localhost:16001/rest/nodes/ http://localhost:16001/rest/nodes/red

Re: Nested type + open-enforced-index question.

2017-07-13 Thread Yingyi Bu
Indeed, it's a bug! Best, Yingyi On Thu, Jul 13, 2017 at 9:52 PM, Mike Carey wrote: > Sounds like a bug to me. > > > > On 7/13/17 7:59 PM, Taewoo Kim wrote: > >> Currently, I am working on a field type propagation without using >> initializing the OptimizableSubTree in the

Re: Nested type + open-enforced-index question.

2017-07-13 Thread Yingyi Bu
hout index) should fail? Or the > second one (with an index) should succeed? > > Best, > Taewoo > > On Thu, Jul 13, 2017 at 9:58 PM, Yingyi Bu <buyin...@gmail.com> wrote: > > > Indeed, it's a bug! > > > > Best, > > Yingyi > > > > On Thu, Jul

Re: Simplifying the creation of functions

2017-07-11 Thread Yingyi Bu
Hi Riyafa, -- My question is whether there's a simpler way to implement a -- function where the arguments would be passed in the deserialized format and -- then in function implementation we can simply return the result rather than -- serializing it before returning. The evaluator interface

Re: Simplifying the creation of functions

2017-07-11 Thread Yingyi Bu
on > logic? > > Thanks > Ahmed > > On Tue, Jul 11, 2017 at 10:21 AM, Yingyi Bu <buyin...@gmail.com> wrote: > > > Hi Riyafa, > > > > -- My question is whether there's a simpler way to implement a > > -- function where the arguments would be passed i

Re: Nested type + open-enforced-index question.

2017-07-14 Thread Yingyi Bu
t; - We should (ideally) support their use in index-NL joins. > >>> > >>> Reflecting on our earlier conversation(s), I think I can see why you're > >>> asking this. :-) The augmented type information that'll be needed to do > >>> this completely/properly

Re: Creating aggregate functions

2017-07-23 Thread Yingyi Bu
Sorry, a typo: AVG: that's the logical function in the logical plan. On Sun, Jul 23, 2017 at 10:29 AM, Yingyi Bu <buyin...@gmail.com> wrote: > >> I see AVG, LOCAL_AVG, INTERMEDIATE_AVG and GLOBAL_AVG. > > AVG: that's the local function in the local plan. > LOC

Re: Creating aggregate functions

2017-07-23 Thread Yingyi Bu
there are so many things I still don't understand. One of them is for > the avg function itself there are several FuntionIdentifiers. What do they > all mean? > > I see AVG, LOCAL_AVG, INTERMEDIATE_AVG and GLOBAL_AVG. > > What do they all mean? > Please help > > On 1

Re: Creating aggregate functions

2017-07-19 Thread Yingyi Bu
Hi Riyafa, >> ScalarCountAggregateDescriptor It's used for counting a scalar array that appears inside a tuple. For example: SELECT u.id, array_count(u.friends) FROM users u; >> SerializableCountAggregateDescriptor Serialized aggregation descriptor implementations are only used

Re: How to write SQL 92 aggregate function?

2017-08-04 Thread Yingyi Bu
Hi Riyafa, >> I think it's because the function has not been implemented as an SQL 92 >> function. I would like to know how to do this? You're right. Please check the code that triggers SQL-92 rewrites here and you need to implement the SQL version for st_union:

Re: How to write SQL 92 aggregate function?

2017-08-04 Thread Yingyi Bu
Here you go: st_union([st_make_point(1.0,1.0),st_make_point(1.0,2.0)]); or st_union(SELECT VALUE gbu FROM [st_make_point(1.0,1.0),st_ make_point(1.0,2.0)] as gbu); On Fri, Aug 4, 2017 at 3:58 AM, Riyafa Abdul Hameed wrote: > Hi, > > After learning about sql92 and

Re: ASTERIXDB-1371: Geo data type format question

2017-05-14 Thread Yingyi Bu
Hi Wail, They're documented at: https://ci.apache.org/projects/asterixdb/aql/builtins.html#SpatialFunctions The prefixes in the documentation are "create_". Best, Yingyi On Sun, May 14, 2017 at 10:51 AM, Wail Alkowaileet wrote: > Previously, AsterixDB used to have

Re: CASE capabilities in SQL++

2017-05-15 Thread Yingyi Bu
Any subquery returns a collection, so try the following: select case when (select value count (country) from Tweets1)[0] > 50 THEN "true" ELSE "false" END; On Mon, May 15, 2017 at 2:16 PM, Vignesh Raghunathan < vignesh.raghunatha...@gmail.com> wrote: > Hi Steven, > > Could you try indexing the

Re: CASE capabilities in SQL++

2017-05-15 Thread Yingyi Bu
purposes, the following also > worked: > > select case when b > 50 THEN "true" ELSE "false" END from(select value > count(DataverseName) from Metadata.`Dataset`) b ; > > It sounds like this shouldn't have worked though since b is a collection? > > Stev

Re: String to int conversion

2017-06-08 Thread Yingyi Bu
Every type has a string constructor, e.g., bigint("123") Best, Yingyi On Thu, Jun 8, 2017 at 8:47 PM, Steven Jacobs wrote: > Hi all, > Does SQLPP offer any way to convert strings to ints? I can't see anything > in the documentation. Twitter data gives timestamps as a string

Re: Verify fails on SQLPP Execution Test

2017-06-08 Thread Yingyi Bu
I saw the same issue in Jenkins for my change too:

Re: String to int conversion

2017-06-08 Thread Yingyi Bu
> add it. > Steven > On Thu, Jun 8, 2017 at 8:58 PM Yingyi Bu <buyin...@gmail.com> wrote: > > > Every type has a string constructor, e.g., > > > > bigint("123") > > > > Best, > > Yingyi > > > > > > On Thu, Jun 8, 2017

Re: [COMP] Few questions about Query Optimizer

2017-06-24 Thread Yingyi Bu
Hi Wail, $22 should be a harmless bug -- it's related to the ordering of rules. For $19: we could potentially have a rule for that. Best, Yingyi On Sat, Jun 24, 2017 at 5:50 PM, Wail Alkowaileet wrote: > Hi Devs, > > I have few questions about the query optimizer.

Re: Extensions with Ansible or NCService

2017-05-23 Thread Yingyi Bu
Xikui has added the deployment mechanism for libraries. It's there in the master but hasn't been documented. Best, Yingyi On Tue, May 23, 2017 at 3:31 PM, Michael Blow wrote: > Steven, are you referring to configuration, or deployment? Configuration > of extensions is

Modules that could be removed?

2017-05-31 Thread Yingyi Bu
Hi dev, I wonder if the following potentially obsolete modules could be moved out of the AsterixDB code base: -- asterix-experiment -- asterix-tools -- hyracks-dist -- hyracks-sever Any thoughts? Best, Yingyi

Re: Commit messages

2017-06-15 Thread Yingyi Bu
+1! Best, Yingyi On Wed, Jun 14, 2017 at 10:43 PM, Mike Carey wrote: > +1 !!! > > I think this is a GREAT proposal, and we can also then hopefully do the > equivalent of grep'ing the commits to identify things that we might want to > incorporate in a high-level set of

Re: Commit messages

2017-06-15 Thread Yingyi Bu
- STATS (statistics etc.) - SITE - STORAGE - SQL++ - TEST - TXN (transaction) - TYPE (data model) - UDF (user defined function) - UI Best, Yingyi On Thu, Jun 15, 2017 at 1:09 AM, Yingyi Bu <buyin...@gmail.com> wrote: > +1! > > Best, > Yingyi > > On Wed, Jun 14, 2017 at 1

Re: Commit messages

2017-06-15 Thread Yingyi Bu
] https://github.com/apache/spark/commits/master > > > On 15 Jun 2017, at 14:55, Mike Carey wrote: > > +1 >> >> >> On 6/15/17 1:19 PM, Yingyi Bu wrote: >> >>> Each commit message should >>>>> 1) reference 1 or more JIRA issues (that hopeful

Re: Commit messages

2017-06-15 Thread Yingyi Bu
hat confuses me (but I don’t have a good alternative) is "IGS". > Any other alternatives that come to mind? > > Cheers, > Till > > > On 15 Jun 2017, at 15:27, Yingyi Bu wrote: > > +1 for short acronyms: >> >> Here is a list of acronyms: >> - API >&

Re: Searching for duplicates during feed ingestion.

2017-05-08 Thread Yingyi Bu
Abdullah has a pending change that disables searches if there's no secondary indexes [1]. Auto-generated ID could be another case for which we can disable searches as well. Best, Yingyi [1] https://asterix-gerrit.ics.uci.edu/#/c/1711/ On Mon, May 8, 2017 at 4:30 AM, Wail Alkowaileet

Re: Modules that could be removed?

2017-05-31 Thread Yingyi Bu
nd move it there, the > same > >> way we did with asterix-bad? > >> We can also launch an automated build in Jenkins to verify it builds > >> against master, again the same way BAD is working. This package does not > >> have a lot of dependencies, s

Re: Time to deprecate AQL?

2017-09-07 Thread Yingyi Bu
+1! Best, Yingyi On Thu, Sep 7, 2017 at 11:44 AM, Michael Carey wrote: > As AsterixDB evolves, and additional features are added - e.g., DISTINCT > aggregate support, or properly implemented query-bodied functions, > supporting two query languages is hugely expensive:

Re: OptimizedHybridHashJoinOperatorDescriptor vs. HybridHashJoinOperatorDescriptor

2018-11-08 Thread Yingyi Bu
I'm not sure if it's still correct, but based on my understanding, OptimizedHybridHashJoinOperatorDescriptor does the role reversal optimization which was done by Pouria, while HybridHashJoinOperatorDescriptor was the old implementation before Pouria's work and probably could be deleted. Best,