Re: OptimizedHybridHashJoinOperatorDescriptor vs. HybridHashJoinOperatorDescriptor

2018-11-08 Thread Yingyi Bu
I'm not sure if it's still correct, but based on my understanding, OptimizedHybridHashJoinOperatorDescriptor does the role reversal optimization which was done by Pouria, while HybridHashJoinOperatorDescriptor was the old implementation before Pouria's work and probably could be deleted. Best, Yin

Re: Time to deprecate AQL?

2017-09-07 Thread Yingyi Bu
+1! Best, Yingyi On Thu, Sep 7, 2017 at 11:44 AM, Michael Carey wrote: > As AsterixDB evolves, and additional features are added - e.g., DISTINCT > aggregate support, or properly implemented query-bodied functions, > supporting two query languages is hugely expensive: Updating two grammars, >

Re: How to write SQL 92 aggregate function?

2017-08-04 Thread Yingyi Bu
Here you go: st_union([st_make_point(1.0,1.0),st_make_point(1.0,2.0)]); or st_union(SELECT VALUE gbu FROM [st_make_point(1.0,1.0),st_ make_point(1.0,2.0)] as gbu); On Fri, Aug 4, 2017 at 3:58 AM, Riyafa Abdul Hameed wrote: > Hi, > > After learning about sql92 and sql++, I thinks that should

Re: How to write SQL 92 aggregate function?

2017-08-04 Thread Yingyi Bu
Hi Riyafa, >> I think it's because the function has not been implemented as an SQL 92 >> function. I would like to know how to do this? You're right. Please check the code that triggers SQL-92 rewrites here and you need to implement the SQL version for st_union: https://github.com

Re: Creating aggregate functions

2017-07-23 Thread Yingyi Bu
Sorry, a typo: AVG: that's the logical function in the logical plan. On Sun, Jul 23, 2017 at 10:29 AM, Yingyi Bu wrote: > >> I see AVG, LOCAL_AVG, INTERMEDIATE_AVG and GLOBAL_AVG. > > AVG: that's the local function in the local plan. > LOCAL_AVG, INTERMEDIATE

Re: Creating aggregate functions

2017-07-23 Thread Yingyi Bu
But there are so many things I still don't understand. One of them is for > the avg function itself there are several FuntionIdentifiers. What do they > all mean? > > I see AVG, LOCAL_AVG, INTERMEDIATE_AVG and GLOBAL_AVG. > > What do they all mean? > Please help > >

Re: Creating aggregate functions

2017-07-19 Thread Yingyi Bu
Hi Riyafa, >> ScalarCountAggregateDescriptor It's used for counting a scalar array that appears inside a tuple. For example: SELECT u.id, array_count(u.friends) FROM users u; >> SerializableCountAggregateDescriptor Serialized aggregation descriptor implementations are only used i

Re: Nested type + open-enforced-index question.

2017-07-14 Thread Yingyi Bu
gt;> > >>> Reflecting on our earlier conversation(s), I think I can see why you're > >>> asking this. :-) The augmented type information that'll be needed to do > >>> this completely/properly will actually have to associate types with > field > >>

Re: Nested type + open-enforced-index question.

2017-07-13 Thread Yingyi Bu
fail? Or the > second one (with an index) should succeed? > > Best, > Taewoo > > On Thu, Jul 13, 2017 at 9:58 PM, Yingyi Bu wrote: > > > Indeed, it's a bug! > > > > Best, > > Yingyi > > > > On Thu, Jul 13, 2017 at 9:52 PM, Mike Carey

Re: Nested type + open-enforced-index question.

2017-07-13 Thread Yingyi Bu
Indeed, it's a bug! Best, Yingyi On Thu, Jul 13, 2017 at 9:52 PM, Mike Carey wrote: > Sounds like a bug to me. > > > > On 7/13/17 7:59 PM, Taewoo Kim wrote: > >> Currently, I am working on a field type propagation without using >> initializing the OptimizableSubTree in the current index access

Re: Simplifying the creation of functions

2017-07-11 Thread Yingyi Bu
> Thanks > Ahmed > > On Tue, Jul 11, 2017 at 10:21 AM, Yingyi Bu wrote: > > > Hi Riyafa, > > > > -- My question is whether there's a simpler way to implement a > > -- function where the arguments would be passed in the deserialized > format > > an

Re: Simplifying the creation of functions

2017-07-11 Thread Yingyi Bu
Hi Riyafa, -- My question is whether there's a simpler way to implement a -- function where the arguments would be passed in the deserialized format and -- then in function implementation we can simply return the result rather than -- serializing it before returning. The evaluator interface it

Re: [COMP] Few questions about Query Optimizer

2017-06-24 Thread Yingyi Bu
Hi Wail, $22 should be a harmless bug -- it's related to the ordering of rules. For $19: we could potentially have a rule for that. Best, Yingyi On Sat, Jun 24, 2017 at 5:50 PM, Wail Alkowaileet wrote: > Hi Devs, > > I have few questions about the query optimizer. > > *- Given the que

Re: Parse GeoJSON data into a record in AsterixDB

2017-06-21 Thread Yingyi Bu
>> type appears to be keyword `type` would make it valid. >> We can't use the defining type within the same type recursively (ie. GeometryType within GeometryType) We don't support recursive type definition. >> The type object cannot be resolved We don't have a builtin name for a completely op

Re: Commit messages

2017-06-15 Thread Yingyi Bu
I don’t have a good alternative) is "IGS". > Any other alternatives that come to mind? > > Cheers, > Till > > > On 15 Jun 2017, at 15:27, Yingyi Bu wrote: > > +1 for short acronyms: >> >> Here is a list of acronyms: >> - API >> - AQL >> - CLU

Re: Commit messages

2017-06-15 Thread Yingyi Bu
spark/commits/master > > > On 15 Jun 2017, at 14:55, Mike Carey wrote: > > +1 >> >> >> On 6/15/17 1:19 PM, Yingyi Bu wrote: >> >>> Each commit message should >>>>> 1) reference 1 or more JIRA issues (that hopefully provide a

Re: Commit messages

2017-06-15 Thread Yingyi Bu
- STATS (statistics etc.) - SITE - STORAGE - SQL++ - TEST - TXN (transaction) - TYPE (data model) - UDF (user defined function) - UI Best, Yingyi On Thu, Jun 15, 2017 at 1:09 AM, Yingyi Bu wrote: > +1! > > Best, > Yingyi > > On Wed, Jun 14, 2017 at 10:43 PM, Mike Carey wrote: &

Re: Commit messages

2017-06-15 Thread Yingyi Bu
+1! Best, Yingyi On Wed, Jun 14, 2017 at 10:43 PM, Mike Carey wrote: > +1 !!! > > I think this is a GREAT proposal, and we can also then hopefully do the > equivalent of grep'ing the commits to identify things that we might want to > incorporate in a high-level set of release notes. I also rea

Re: String to int conversion

2017-06-08 Thread Yingyi Bu
; Steven > On Thu, Jun 8, 2017 at 8:58 PM Yingyi Bu wrote: > > > Every type has a string constructor, e.g., > > > > bigint("123") > > > > Best, > > Yingyi > > > > > > On Thu, Jun 8, 2017 at 8:47 PM, Steven Jacobs wrote: >

Re: String to int conversion

2017-06-08 Thread Yingyi Bu
Every type has a string constructor, e.g., bigint("123") Best, Yingyi On Thu, Jun 8, 2017 at 8:47 PM, Steven Jacobs wrote: > Hi all, > Does SQLPP offer any way to convert strings to ints? I can't see anything > in the documentation. Twitter data gives timestamps as a string > representing the

Re: Verify fails on SQLPP Execution Test

2017-06-08 Thread Yingyi Bu
I saw the same issue in Jenkins for my change too: https://asterix-jenkins.ics.uci.edu/job/asterix-gerrit-asterix-app-sql-execution/org.apache.asterix$asterix-app/28/testReport/junit/org.apache.asterix.test.runtime/SqlppExecutionFullParallelismIT/test_SqlppExecutionFullParallelismIT_329__dml__load_

Re: Modules that could be removed?

2017-05-31 Thread Yingyi Bu
b less complicated, at least if that allowed one to use a pre-compiled >>> Asterix distribution for experiments rather than requiring the full >>> source >>> code to be available. >>> >>> Ceej >>> aka Chris Hillery >>> >>> On Wed

Re: Modules that could be removed?

2017-05-31 Thread Yingyi Bu
gt; >> way we did with asterix-bad? > >> We can also launch an automated build in Jenkins to verify it builds > >> against master, again the same way BAD is working. This package does not > >> have a lot of dependencies, so it will be fairly painless to maintain > i

Modules that could be removed?

2017-05-31 Thread Yingyi Bu
Hi dev, I wonder if the following potentially obsolete modules could be moved out of the AsterixDB code base: -- asterix-experiment -- asterix-tools -- hyracks-dist -- hyracks-sever Any thoughts? Best, Yingyi

Re: Extensions with Ansible or NCService

2017-05-23 Thread Yingyi Bu
Xikui has added the deployment mechanism for libraries. It's there in the master but hasn't been documented. Best, Yingyi On Tue, May 23, 2017 at 3:31 PM, Michael Blow wrote: > Steven, are you referring to configuration, or deployment? Configuration > of extensions is supported in NCService /

Re: CASE capabilities in SQL++

2017-05-15 Thread Yingyi Bu
lso > worked: > > select case when b > 50 THEN "true" ELSE "false" END from(select value > count(DataverseName) from Metadata.`Dataset`) b ; > > It sounds like this shouldn't have worked though since b is a collection? > > Steven > > On Mon,

Re: CASE capabilities in SQL++

2017-05-15 Thread Yingyi Bu
Any subquery returns a collection, so try the following: select case when (select value count (country) from Tweets1)[0] > 50 THEN "true" ELSE "false" END; On Mon, May 15, 2017 at 2:16 PM, Vignesh Raghunathan < vignesh.raghunatha...@gmail.com> wrote: > Hi Steven, > > Could you try indexing the

Re: ASTERIXDB-1371: Geo data type format question

2017-05-14 Thread Yingyi Bu
Hi Wail, They're documented at: https://ci.apache.org/projects/asterixdb/aql/builtins.html#SpatialFunctions The prefixes in the documentation are "create_". Best, Yingyi On Sun, May 14, 2017 at 10:51 AM, Wail Alkowaileet wrote: > Previously, AsterixDB used to have such functions [1] pr

Re: Searching for duplicates during feed ingestion.

2017-05-08 Thread Yingyi Bu
Abdullah has a pending change that disables searches if there's no secondary indexes [1]. Auto-generated ID could be another case for which we can disable searches as well. Best, Yingyi [1] https://asterix-gerrit.ics.uci.edu/#/c/1711/ On Mon, May 8, 2017 at 4:30 AM, Wail Alkowaileet wrote: >

Re: Nonpure functions in the context of SQL++

2017-04-19 Thread Yingyi Bu
Hi Steven, As a temporary workaround, you can do the following before your query: SET inline_with = "false" If you want to fix this, please take a look at InlineWithExpressionVisitor. What you can do is to not inline if there is a non-pure function call in the expression.

Re: When is it appropirate to add reserved words to the AQL/SQL++ Grammar?

2017-04-12 Thread Yingyi Bu
+1 for HTTP API. The end point can be easily removed from a real deployment by using a config parameter. Best, Yingyi On Wed, Apr 12, 2017 at 6:12 PM, Till Westmann wrote: > Coming back to the question in the subject: If we need such functionality > e.g. for testing of performance measurements,

Re: What is the new path to check Hyracks jobs status in AsterixDB?

2017-04-07 Thread Yingyi Bu
Hi Jianfeng, The admin console has been removed but the REST APIs which return JSON results are still there. Let's take the sample cluster as an example. To check nodes: http://localhost:16001/rest/nodes/ http://localhost:16001/rest/nodes/red http://localhost:16001/r

Re: New cluster installation options

2017-04-05 Thread Yingyi Bu
too? (They > will be feeding Tweets from one of the Twitter adaptors into *DB and using > UDFs to process them on the way in and in queries.) > > > > On 4/4/17 4:04 PM, Yingyi Bu wrote: > >> Thanks for the feedback. >> It hasn't supported yet. But it should b

Re: New cluster installation options

2017-04-04 Thread Yingyi Bu
talking about this: > https://ci.apache.org/projects/asterixdb/udf.html. Unless they don't > support it, it may be early to deprecate the managix? > > Best, > Taewoo > > On Thu, Mar 16, 2017 at 9:22 AM, Yingyi Bu wrote: > > > Hi dev, > > > &g

New cluster installation options

2017-03-16 Thread Yingyi Bu
Hi dev, We recently added two cluster installation options and their documentation is online now: Ansible: https://ci.apache.org/projects/asterixdb/ansible.html AWS: https://ci.apache.org/projects/asterixdb/aws.html With the Ansible option, you can deploy, start, stop, and er

Re: Add the Java driver for Asterix

2017-03-03 Thread Yingyi Bu
Hi dev, Maybe we should call it a "JDBC-like" driver rather than a "JDBC" driver, because SQL++ is different from the standard SQL in the following aspects [1]. The JDBC-like client can wrap the current query service REST API and provide some abstractions similar to those in JDBC, e.g., resul

Improve test coverage

2017-02-21 Thread Yingyi Bu
Hi dev, In order to enhance the quality of our code base, I wrote up a test guideline [1] which could be a reference for future patches. More precisely, the guideline summarizes various kinds of tests that we currently have and contains pointers to examples. Going forward, it would be nice if

Re: Choosing defaults for AsterixDB

2017-02-03 Thread Yingyi Bu
eement and no concerns, I’ll go ahead. > > > > Thanks, > > Till > > > > On 10 Jan 2017, at 9:22, Yingyi Bu wrote: > > > >> +100! > >> > >> On Tue, Jan 10, 2017 at 9:17 AM, Mike Carey wrote: > >> > >>> +1 from me

Fwd: [jira] [Commented] (ASTERIXDB-1327) Spatial-intersect between point and circle not working correctly

2017-02-01 Thread Yingyi Bu
Fwd to dev. Best, Yingyi -- Forwarded message -- From: Willy Aguirre (JIRA) Date: Wed, Feb 1, 2017 at 3:24 PM Subject: [jira] [Commented] (ASTERIXDB-1327) Spatial-intersect between point and circle not working correctly To: notificati...@asterixdb.incubator.apache.org [ ht

Re: Google Summer of Code

2017-01-31 Thread Yingyi Bu
or API call. > > > > Best, > > Taewoo > > > > On Tue, Jan 31, 2017 at 9:52 PM, Yingyi Bu wrote: > > > >> Yeah, sounds good to me. > >> > >> Best, > >> Yingyi > >> > >> On Tue, Jan 31, 2017 at 9:31 PM, Steven Ja

Re: Google Summer of Code

2017-01-31 Thread Yingyi Bu
er vetted projects this year, but I > think we could still try for at least one spot. > Steven > > On Tue, Jan 31, 2017 at 9:21 PM Yingyi Bu wrote: > > > IMO, each of them could be a two-three month project. > > > > The project itself could be conceptually simple and

Re: Google Summer of Code

2017-01-31 Thread Yingyi Bu
, Jan 31, 2017 at 9:00 PM Yingyi Bu wrote: > > > I have two proposals for data loaders: > > > > 1. add AWS S3 as an external data source; > > 2. support POSTing a user file on the client side through HTTP into an > > internal dataset. > > > > Thanks! > &

Re: Google Summer of Code

2017-01-31 Thread Yingyi Bu
I have two proposals for data loaders: 1. add AWS S3 as an external data source; 2. support POSTing a user file on the client side through HTTP into an internal dataset. Thanks! Best, Yingyi On Thu, Jan 19, 2017 at 2:06 PM, Steven Jacobs wrote: > Hi All, > The Google Summer of Code official

Re: An exception: Input/output error?

2017-01-29 Thread Yingyi Bu
Hi Taewoo, This error message usually indicates that there are some disk failures. Please check /var/log/messages on the machine to see if that's the case. Best, Yingyi On Sun, Jan 29, 2017 at 4:00 PM, Taewoo Kim wrote: > Hi all, > > I'm trying to load json files into a dataset. I have

Re: [VOTE] Release Apache AsterixDB 0.9.0 and Hyracks 0.3.0 (RC2)

2017-01-21 Thread Yingyi Bu
+1 - signatures and hashes of all 5 archives ok - nc service binary works - version api agrees with the commit id on ASF repo - source compilation works Best, Yingyi On Sat, Jan 21, 2017 at 9:16 AM, Steven Jacobs wrote: > +1 > Steven > > On Sat, Jan 21, 2017 at 7:36 AM Till Westmann wrote: >

Re: Choosing defaults for AsterixDB

2017-01-10 Thread Yingyi Bu
+100! On Tue, Jan 10, 2017 at 9:17 AM, Mike Carey wrote: > +1 from me too for SQL++ and clean JSON. > > > > On 1/10/17 8:25 AM, Murtadha Hubail wrote: > >> +1 to SQL++ and clean JSON. >> >> Cheers, >> Murtadha >> >> On Jan 10, 2017, at 9:46 AM, Till Westmann wrote: >>> >>> Hi, >>> >>> as you kn

Re: Exception refactoring proposal

2017-01-06 Thread Yingyi Bu
All right, sounds good! Best, Yingyi On Fri, Jan 6, 2017 at 3:36 PM, Till Westmann wrote: > Yes, that’s what I would think as well. > > Cheers, > Till > > On 6 Jan 2017, at 15:30, Yingyi Bu wrote: > > > Sounds good to me. > > I'm guessing that eventually W

Re: Exception refactoring proposal

2017-01-06 Thread Yingyi Bu
change: > Let’s not start the names of ErrorCodes with "ERROR_". > In the code they are usually used as > > ErrorCode.ERROR_THIS_IS_THE_PROBLEM > > and it seems that > > ErrorCode.THIS_IS_THE_PROBLEM > > would work just as well. > > Thoughts? &g

Exception refactoring proposal

2017-01-05 Thread Yingyi Bu
Hi dev, I recent made a change that moved some exception error message templates from the source code into properties files [1]. Here is an exception refactoring proposal [2]. I propose that: -- from now on, we all follow the proposal when we want to throw exceptions, and enforce the

Re: Time of Multiple Joins in AsterixDB

2016-12-22 Thread Yingyi Bu
ism > subsequently, one would just specify a large positive value >= the number > of available cores? (E.g., 1) > > > > On 12/22/16 11:37 AM, Yingyi Bu wrote: > >> No need to reload data anymore :-) >> >> Best, >> Yingyi >> >> On Thu, Dec 22, 2

Re: Time of Multiple Joins in AsterixDB

2016-12-22 Thread Yingyi Bu
No need to reload data anymore :-) Best, Yingyi On Thu, Dec 22, 2016 at 11:36 AM, Yingyi Bu wrote: > Indeed, the change was merged yesterday. > If you grab the latest master, the computation parallelism can be set by > the parameter compiler.parallelism: > -- 0, the default, mean

Re: Time of Multiple Joins in AsterixDB

2016-12-22 Thread Yingyi Bu
andles this! (Where you'd specify storage parallelism > based on drives, and compute parallelism based on cores, both spread across > all of the cluster's resources.) > > > On 12/22/16 10:57 AM, Yingyi Bu wrote: > >> Mingda, >> >> >> Th

Re: Time of Multiple Joins in AsterixDB

2016-12-22 Thread Yingyi Bu
e.buffercache.size": 8589934592 You don't need to reload data but only need to restart the AsterixDB instance. Thanks! Best, Yingyi On Wed, Dec 21, 2016 at 9:22 PM, Mike Carey wrote: > Nice!! > > On Dec 21, 2016 8:43 PM, "Yingyi Bu" wrote: > > > C

Re: Time of Multiple Joins in AsterixDB

2016-12-21 Thread Yingyi Bu
tests (good > and bad order) have been all improved to twice speed. > I will finish all the tests and update the result later. > > Bests, > Mingda > > On Tue, Dec 20, 2016 at 10:20 PM, Yingyi Bu wrote: > > > Hi Mingda, > > > > I think that in your setting

Re: Time of Multiple Joins in AsterixDB

2016-12-20 Thread Yingyi Bu
quot;: "red", > "partitions": [{ > "active": true, > "partition_id": "partition_15" > }], > "state": "ACTIVE", > "statsUri"

Re: Time of Multiple Joins in AsterixDB

2016-12-20 Thread Yingyi Bu
d message: > > picking the right join order matters. Nevertheless, I’d like to get a > > better understanding of what’s going on in the larger dataset regime. > > > > > > > > -Tyson > > > > > > > > From: Yingyi Bu [mailto:buyin...@gmail.com] >

Re: Time of Multiple Joins in AsterixDB

2016-12-20 Thread Yingyi Bu
Hi Mingda, It looks that you didn't attach the pdf? Thanks! Best, Yingyi On Tue, Dec 20, 2016 at 4:15 PM, mingda li wrote: > Sorry for the wrong version of cc.conf. I convert it to pdf version as > attachment. > > On Tue, Dec 20, 2016 at 4:06 PM, mingda li wrote: > >> Dear all, >> >

Re: About the Multiple Join Optimization on AsterixDB

2016-10-31 Thread Yingyi Bu
Mingda, I'm not sure how much re-ordering can be done at the Hyracks level, i.e., the runtime level. In the optimizer (the asterixdb/algebricks level), we don't have re-ordering for joins, because: --- the cost model has not been added yet. I'm not sure about the timeline for this

Re: Line limit (120) is not enforced for an assignment statement.

2016-10-25 Thread Yingyi Bu
>> @Yingyi: I wonder whether your proposed change (do not remove blank lines >> in the comment section) is already applied to the format file It looks that I haven't... You can update this row: Thanks! Yingyi On Tue, Oct 25, 2016 at 4:26 PM, Till Westmann wrote: > +1 > > > On 25 Oct 2016,

Re: type name changes

2016-10-18 Thread Yingyi Bu
tion (before:int for int64, after:int for integer > (int32)) and outside users that Mike mentioned did not have numbers greater > than INT32 range, I think it's OK. > > Best, > Taewoo > > On Tue, Oct 18, 2016 at 8:59 AM, Yingyi Bu wrote: > > > Taewoo, > > &

Re: type name changes

2016-10-18 Thread Yingyi Bu
t, Yingyi On Tue, Oct 18, 2016 at 8:52 AM, Taewoo Kim wrote: > So, can we use "int" for "bigint" to be consistent? > > Best, > Taewoo > > On Tue, Oct 18, 2016 at 7:34 AM, Yingyi Bu wrote: > > > >>Actually I think Taewoo is right about having

Re: type name changes

2016-10-18 Thread Yingyi Bu
es of their > current ADMs? We should just check so we know if we need to warn them when > we release > > On Oct 17, 2016 11:49 PM, "Yingyi Bu" wrote: > > > This is the change that changes "record" to "object". > > https://asterix-ger

Re: type name changes

2016-10-17 Thread Yingyi Bu
This is the change that changes "record" to "object". https://asterix-gerrit.ics.uci.edu/#/c/1295/ The existing record functions will still work. If anyone thinks that the change breaks the current use case, please let me know. Best, Yingyi On Mon, Oct 17, 2016 at 10:26

Re: type name changes

2016-10-17 Thread Yingyi Bu
ion. We used int for the abbreviation > for INT64 (I assume that is now bigint?) type. Now, INT is an abbreviation > for INT32? I thought we converted the default type to INT64 (bigint). > Aren't INT32 type displaying i32 as suffix? > > -- Forwarded message -- >

Re: Function name and format change in the codebase

2016-10-14 Thread Yingyi Bu
+1. For (1), the eventual goal is to only have underscores. Currently it's a transition period -- all docs are based on underscores. For new functions, it's good to only support underscores. I think that we'll end the life of hyphens after some time. Best, Yingyi On Fri, Oct 14, 2016 at 6:36 PM,

Re: [jira] [Commented] (ASTERIXDB-1694) Fail running Tweet Feed on Cluster of 16 nodes (while succeed on 4 nodes)

2016-10-14 Thread Yingyi Bu
>> Log4J in Twitter4j doesn't initialize properly. Can you explain that a little bit more? What exceptions do you hit, "class not found", or sth. else? Thx! On Fri, Oct 14, 2016 at 2:20 PM, Wail Alkowaileet (JIRA) wrote: > > [ https://issues.apache.org/jira/browse/ASTERIXDB-1694? > page=com

Re: What's the best way to execute only a single test in ExecutionTest

2016-10-13 Thread Yingyi Bu
Put the test into only.xml or only_sqlpp.xml? On Thu, Oct 13, 2016 at 2:14 PM, Taewoo Kim wrote: > In the past, usually I modified executeTest() of TestExecutor class to > bypass another test cases by checking cUnit.getName() method and execute > only one test case. However, nowadays, even if I

Re: Function Change

2016-10-12 Thread Yingyi Bu
o keep an eye on the check-in > changes list if he uses functions extensively. > > Thanks again for your help and info. > > Pouria > > On Tue, Oct 11, 2016 at 11:45 PM, Yingyi Bu wrote: > > > Pouria, > > > > Here is the function documentation: > > h

Re: Function Change

2016-10-11 Thread Yingyi Bu
Pouria, Here is the function documentation: https://ci.apache.org/projects/asterixdb/sqlpp/builtins.html >> If yes, then what is regexp_contains@2 function ? It checks if the first argument contains a regular expression pattern defined by the second argument. >> Can matches@2 and regexp_contains

Re: Let one Operator finished the job before another one begin in Hyracks

2016-10-11 Thread Yingyi Bu
thing in their use > cases for parallel sorts and interval joins. Hyracks has everything needed > for this, as it turns out, without a multi-job need. > > > > On 10/11/16 9:26 AM, Yingyi Bu wrote: > >> You can search the usage of waitForCompletion in the code base, e.g.: &

Re: Re: Let one Operator finished the job before another one begin in Hyracks

2016-10-11 Thread Yingyi Bu
You can search the usage of waitForCompletion in the code base, e.g.: APIFramework.java: public void executeJobArray(IHyracksClientConnection hcc, JobSpecification[] specs, PrintWriter out) throws Exception { for (JobSpecification spec : specs) { spec.setMaxReattempts(0);

Re: Location of test results

2016-10-01 Thread Yingyi Bu
+1. Best, Yingyi On Sat, Oct 1, 2016 at 6:21 PM, Till Westmann wrote: > Hi, > > in a recent change [1] the test result location changed from "a directory > in > the module folder" to "a directory in the target folder". So now > a) the test results will get removed with every "mvn clean" and > b

Re: Does Projection affect count() performance?

2016-10-01 Thread Yingyi Bu
PS, if you still have the OOM instance, can you do a Yourkit memory profile? Thanks! Best, Yingyi On Sat, Oct 1, 2016 at 9:43 AM, Yingyi Bu wrote: > Wail, > > Can you attach the query plan for query 1? > I tried >count( for $x in dataset beers >

Re: Does Projection affect count() performance?

2016-10-01 Thread Yingyi Bu
Wail, Can you attach the query plan for query 1? I tried count( for $x in dataset beers return $x ) and got the following plan, which seems OK: -- DISTRIBUTE_RESULT |UNPARTITIONED| exchange -- ONE_TO_ONE_EXCHANGE |UNPARTITIONED| aggregate [$$5] <

Re: Physical Operator Tag Change (SPLIT -> REPLICATE)

2016-09-27 Thread Yingyi Bu
w for the second step would be much easier. > > Best, > Taewoo > > On Tue, Sep 27, 2016 at 5:08 PM, Yingyi Bu wrote: > > > Taewoo, > > > > I agree with you. > > However, can you have two separate changes: > > 1. Your current change https://ast

Re: Physical Operator Tag Change (SPLIT -> REPLICATE)

2016-09-27 Thread Yingyi Bu
Taewoo, I agree with you. However, can you have two separate changes: 1. Your current change https://asterix-gerrit.ics.uci.edu/#/c/1196/. Call your "Split" "PartitionSplit" for now. 2. Do what you proposed in the next change which only does the renaming. That makes reviews less overwhelming and

Re: Function name change: contains() -> string-contains()

2016-09-16 Thread Yingyi Bu
Cool, +1! Best, Yingyi On Fri, Sep 16, 2016 at 1:54 PM, Taewoo Kim wrote: > So, in summary, we agree to use a function format for the full-text search, > rather than using XQuery syntax. "contains" doesn't have to be > "string-contains" and "text" doesn't have to be a reserved word. > > The pos

Re: Retain inputs on index-search

2016-09-16 Thread Yingyi Bu
I'd like to join the discussion. Best, Yingyi On Fri, Sep 16, 2016 at 10:47 AM, Steven Jacobs wrote: > Hi, > I have come across an issue where retainInputs is always set to false when > creating an index search in the select case. This was because it was > previously assumed that variables be

Re: Function name change: contains() -> string-contains()

2016-09-15 Thread Yingyi Bu
ewoo > > On Thu, Sep 15, 2016 at 2:40 PM, Yingyi Bu wrote: > > > Hi Taewoo, > > > > Recently I have added several string functions into *DB: > > initcap(title), > > regexp_like, > > regexp_position, > > ltrim, > &g

Re: Function name change: contains() -> string-contains()

2016-09-15 Thread Yingyi Bu
All right, thanks! Best, Yingyi On Thu, Sep 15, 2016 at 2:48 PM, Taewoo Kim wrote: > @Yingyi: will add the mapping for "string-contains()" in AQL and > "contains()" in SQL++. > > Best, > Taewoo > > On Thu, Sep 15, 2016 at 2:45 PM, Yingyi Bu wrote: &g

Re: Function name change: contains() -> string-contains()

2016-09-15 Thread Yingyi Bu
;contains_text", "contains text" or "containstext". It would be nice if > one form of function is used for both AQL and SQL++. Currently, to follow > the Xquery spec, this doesn't work. > > Best, > Taewoo > > On Thu, Sep 15, 2016 at 2:27 PM, Yingyi

Re: Function name change: contains() -> string-contains()

2016-09-15 Thread Yingyi Bu
can add them to the AsterixDB column. Best, Yingyi On Thu, Sep 15, 2016 at 2:27 PM, Yingyi Bu wrote: > Hi Taewoo, > > Are those fulltext search syntax extensions are only a syntactic sugar > (i.e., surface) thing that is translated into functions? > In a not-too-distant-futur

Re: Function name change: contains() -> string-contains()

2016-09-15 Thread Yingyi Bu
Hi Taewoo, Are those fulltext search syntax extensions are only a syntactic sugar (i.e., surface) thing that is translated into functions? In a not-too-distant-future, we will need to surface fulltext search in SQL++, probably using the same functions like Oracle. If the AQL fulltext synta

Re: Modified/Custom plan: Push-down SELECT for external source.

2016-08-26 Thread Yingyi Bu
Currently you can push Project into the source but not Select. You're welcome to enhance IMetedataProvider to support that. You can take a look at DataSourceScanPOperator: Pair p = mp.getScannerRuntime(dataSource, vars, projectVars, scan.isProjectPushed(), scan.getMinFilterVars(), scan.ge

Minor code style change proposal

2016-08-25 Thread Yingyi Bu
Hi committers, The current code style (http://asterixdb.apache.org/dev-setup.html) that we use clears line breaks for javadocs. I'm proposing a one line minor change in the code style: Attached is the updated code style file. Please reply this message if you have different though

Re: [jira] [Commented] (ASTERIXDB-1587) upsert runtime assumes that the datatype comes from the same dataverse as the dataset

2016-08-17 Thread Yingyi Bu
Best, Yingyi On Wed, Aug 17, 2016 at 6:19 PM, Steven Jacobs wrote: > I already pushed the change :( > But in this case it seemed like the test was unnecessary, although I do > have tests on my BAD branch that require this change. > Steven > > On Wed, Aug 17, 2016 at 6:15 PM, Yingy

Re: [jira] [Commented] (ASTERIXDB-1587) upsert runtime assumes that the datatype comes from the same dataverse as the dataset

2016-08-17 Thread Yingyi Bu
Steven, Can you add a regression test before you closing that issue? (Typically, fixing ASTERIXDB- should have a test case in the change.) Thanks! Best, Yingyi On Wed, Aug 17, 2016 at 6:05 PM, ASF subversion and git services (JIRA) < j...@apache.org> wrote: > > [ https://iss

Re: unstable builds

2016-08-17 Thread Yingyi Bu
It seems there is some existing instance on the docker instance and hence it complains that the port is already used. Best, Yingyi On Wed, Aug 17, 2016 at 10:37 AM, Kaveen Rodrigo wrote: > It seems some of the tests have failed, as evident from > https://asterix-jenkins.ics.uci.edu/job/asterix-

Re: Dead links

2016-07-29 Thread Yingyi Bu
tha...@gmail.com> wrote: > Thank you Yingyi. > > On Jul 29, 2016, at 10:01 PM, Yingyi Bu wrote: > > > > Hi Vignesh, > > > >We're still working on finalizing the SQL++ doc for AsterixDB. Here > is > > a draft that you can take a look: > >

Re: Dead links

2016-07-29 Thread Yingyi Bu
Hi Vignesh, We're still working on finalizing the SQL++ doc for AsterixDB. Here is a draft that you can take a look: https://github.com/sigmod/sqlpp/blob/master/document.md The ETA of the completed version would be next week. Thanks! Best, Yingyi On Fri, Jul 29, 2016 at 9:21 A

Re: questions about index-only change

2016-07-08 Thread Yingyi Bu
xJjlg3S67zRY-M/edit?usp=sharing > > > > > Best, > Taewoo > > On Fri, Jul 8, 2016 at 1:09 PM, Yingyi Bu wrote: > >> Hi Taewoo, >> >> I have a few questions regarding to your index-only change (I'm cc-ing to >> dev just in case more people are in

questions about index-only change

2016-07-08 Thread Yingyi Bu
Hi Taewoo, I have a few questions regarding to your index-only change (I'm cc-ing to dev just in case more people are interested in the topic.): 1. Is there any design doc or write up for the index-only change? 2. Do you have ddls/queries that are designed for the index-only performance testing?

Re: Unsigned integers data types

2016-06-16 Thread Yingyi Bu
>> Is there any database or SQL implementation supporting that? Ok, it turns out MySQL supports that, while Postgres, MS SQL and Hive do not have that. Best, Yingyi On Thu, Jun 16, 2016 at 3:40 PM, Yingyi Bu wrote: > >> I guess part of the reason why we do that is because Ja

Re: Unsigned integers data types

2016-06-16 Thread Yingyi Bu
>> I guess part of the reason why we do that is because Java used to lack native support of unsigned integers. Is there any database or SQL implementation supporting that? FYI: http://dba.stackexchange.com/questions/53050/why-arent-unsigned-integer-types-available-in-the-top-database-platforms Be

Re: new AsterixDB web interface demo up and running

2016-06-16 Thread Yingyi Bu
can you give me an AQl query which > will produce something like that. > > thanks in advance, > Kaveen > > On 16 June 2016 at 22:28, Yingyi Bu wrote: > > > Awesome! Thanks, Kaveen! > > > > Best, > > Yingyi > > > > On Thu, Jun 16, 2016 at 9:56 AM, K

Re: new AsterixDB web interface demo up and running

2016-06-16 Thread Yingyi Bu
Awesome! Thanks, Kaveen! Best, Yingyi On Thu, Jun 16, 2016 at 9:56 AM, Kaveen Rodrigo wrote: > Yikes, Thanks Yingyi, > > I never expected the results array to contain values, I'll get on it > > On 16 June 2016 at 21:48, Yingyi Bu wrote: > > > Very cool!!

Re: new AsterixDB web interface demo up and running

2016-06-16 Thread Yingyi Bu
Very cool!! It seems that there is an assumption that returned results are records? For example, you can try the following query: Q1: for $m in dataset Metadata.Dataset return $m.DatasetName; Q2: 1+1; Best, Yingyi On Thu, Jun 16, 2016 at 12:20 AM, Kaveen Rodrigo wrote: > Hello all, > > I h

Re: dataset for ASTERIXDB-1061?

2016-06-06 Thread Yingyi Bu
B9fobkjZFASiUEROOUJNVDZHZlk/view?usp=sharing > > On Fri, Jun 3, 2016 at 12:56 PM, Yingyi Bu wrote: > > > Could you upload that to the issue? > > Thanks! > > > > Best, > > Yingyi > > > > On Fri, Jun 3, 2016 at 12:53 PM, Ian Maxon wrote: > > &

Re: dataset for ASTERIXDB-1061?

2016-06-03 Thread Yingyi Bu
Could you upload that to the issue? Thanks! Best, Yingyi On Fri, Jun 3, 2016 at 12:53 PM, Ian Maxon wrote: > Yeah I think I have it. It was the google code issues JSON data. > > On Fri, Jun 3, 2016 at 11:31 AM, Yingyi Bu wrote: > > > Does anyone know/have the dataset for rep

dataset for ASTERIXDB-1061?

2016-06-03 Thread Yingyi Bu
Does anyone know/have the dataset for reproducing ASTERIXDB-1061? https://issues.apache.org/jira/browse/ASTERIXDB-1061 https://code.google.com/archive/p/asterixdb/issues/918 It seems to be some issue dataset. Best, Yingyi

  1   2   >