The video should be available for posting to Youtube soon. On Mon, Jan 18, 2016 at 10:37 PM, Kuien Liu <[email protected]> wrote:
> Hi Greg, > > Is the video replay available for guys outside Pivotal network to watch? > I made a call with Chenliang Wang in 7:15pm~8:13pm yesterday evening, he > explained his case in evaluating the property prices, he is thinking to > implement this function (GWR) inner databases. I explain the usage of > Madlib and how to enable PostGIS in GPDB. I think Gautum's presentation > on BayesianAnalysis is a good tutorial for him to kick-off his plan. > > Cheers, > Kuien Liu > > On Fri, Jan 15, 2016 at 10:53 AM, Greg Chase <[email protected]> wrote: > >> We always share the presentation and post the video replay as well. >> >> You can also drink lots of coffee, and join us if you are crazy enough. >> >> We like crazy people. >> >> This email encrypted by tiny buttons & fat thumbs, beta voice >> recognition, and autocorrect on my iPhone. >> >> On Jan 14, 2016, at 6:35 PM, Kuien Liu <[email protected]> wrote: >> >> Yes, 2AM on Saturday... May you please share Gautam's representation >> video after the call? >> >> Cheers, >> Kuien Liu >> >> On Wed, Jan 13, 2016 at 6:36 PM, Greg Chase <[email protected]> wrote: >> >>> As I said, our next call is not China-friendly: >>> http://mail-archives.apache.org/mod_mbox/incubator-madlib-dev/201601.mbox/%3CCAMg1VtnKB-WoyVqCstfMNCcJVOn2HKQQ6wNfqdovhgnB7zd5cw%40mail.gmail.com%3E >>> >>> This is this Friday, 10AM Pacifc Standard Time which is 2AM Saturday >>> Beijing time. >>> >>> We will arrange a next call in a couple weeks at an Asia friendly time >>> to support contributors in Asia. >>> >>> However, if you make the next call, we will make time for you to talk :) >>> >>> Regards, >>> >>> -Greg >>> >>> On Wed, Jan 13, 2016 at 2:18 AM, Kuien Liu <[email protected]> wrote: >>> >>>> Great, I would like to join it, please send me an invitation if >>>> possible. >>>> >>>> Cheers, >>>> Kuien Liu >>>> >>>> On Wed, Jan 13, 2016 at 6:10 PM, Greg Chase <[email protected]> wrote: >>>> >>>>> Perhaps ChenLiang would like to join a call with the MADlib community >>>>> and discuss his contribution? >>>>> >>>>> We have a call this Friday 10AM PST which is not a friendly time for >>>>> China, but we can schedule a next call at a friendlier time. >>>>> >>>>> This email encrypted by tiny buttons & fat thumbs, beta voice >>>>> recognition, and autocorrect on my iPhone. >>>>> >>>>> > On Jan 13, 2016, at 1:53 AM, Ivan Novick <[email protected]> wrote: >>>>> > >>>>> > Cool! >>>>> > >>>>> >> On Wed, Jan 13, 2016 at 5:52 PM, Kuien Liu <[email protected]> wrote: >>>>> >> >>>>> >> Got it, I think I can have a (f2f) talk with Chenliang Wang, as he >>>>> was >>>>> >> graduated from an institute of CAS which is not far from our Beijing >>>>> >> office, and I am familiar with his supervisor and lab director. So >>>>> I think >>>>> >> it is highly possible to find him directly in Beijing. >>>>> >> >>>>> >> Cheers, >>>>> >> Kuien Liu >>>>> >> >>>>> >>> On Wed, Jan 13, 2016 at 3:05 PM, Ivan Novick <[email protected]> >>>>> wrote: >>>>> >>> >>>>> >>> Hello ChenLiang, >>>>> >>> >>>>> >>> I have read your description of the interface and to my >>>>> understanding >>>>> >>> this is a supervised machine learning algorithm that supports >>>>> geometry >>>>> >>> data. Am I correct? >>>>> >>> >>>>> >>> What could be a good industrial use case for this model for some >>>>> >>> examples? Could you train a system based on locations and weather >>>>> to find >>>>> >>> bad signals for cell phone? Can you provide any real world example >>>>> >>> scenario where this type of model will be useful for end users? >>>>> >>> >>>>> >>> Also I am adding CC to some of my colleagues at work. Kuien, Max, >>>>> >>> Yandong can you provide any feedback on this proposal from your >>>>> Point of >>>>> >>> View? >>>>> >>> >>>>> >>> >>>>> >>> >>>>> http://mail-archives.apache.org/mod_mbox/incubator-madlib-dev/201601.mbox/%[email protected]%3E >>>>> >>> >>>>> >>> Cheers, >>>>> >>> Ivan >>>>> >>> >>>>> >>> >>>>> >>> On Wed, Jan 13, 2016 at 11:20 AM, WangChenLiang < >>>>> [email protected]> >>>>> >>> wrote: >>>>> >>> >>>>> >>>> Sorry, the link of attachment (http://1drv.ms/1ZjAiCg) is lost >>>>> in the >>>>> >>>> previous letter. >>>>> >>>> >>>>> >>>>> From: [email protected] >>>>> >>>>> To: [email protected] >>>>> >>>>> Subject: RE: How to contribute a spatial module to MADlib >>>>> manipulating >>>>> >>>> objects from PostGIS >>>>> >>>>> Date: Wed, 13 Jan 2016 11:09:17 +0800 >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> Hi ,Caleb and Ivan! >>>>> >>>>> Thanks for your attention and help. I reviewed the previous >>>>> draft >>>>> >>>> and find >>>>> >>>>> something inappropriate. The archive containing the new draft and >>>>> >>>> example code >>>>> >>>>> is attached in the letter which would be more reasonable than >>>>> the >>>>> >>>> earlier edition. >>>>> >>>>> Please go over the manuscript and give suggestion again . >>>>> >>>>> The following are my answers to Caleb's questions. >>>>> >>>>> - Does this function require PostGIS to also be >>>>> >>>>> installed? If yes, it would be better >>>>> >>>>> if we disable the function if >>>>> >>>>> PostGIS is not present rather than introduce PostGIS >>>>> >>>>> as a dependency. (Similar >>>>> >>>>> to what we do with our requirement on the xml module with our >>>>> PMML >>>>> >>>> export >>>>> >>>>> functionality). >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> A:Yes. I am trying to avoid >>>>> >>>>> input any spatial datatypes in the interface of GWR. >>>>> >>>>> But I have no >>>>> >>>>> idea if it is necessary to provide simple alternative when >>>>> PostGIS is >>>>> >>>> not >>>>> >>>>> available. >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> - What are the exact datatypes in the function >>>>> >>>>> definition for regression_location >>>>> >>>>> and prediction_location? >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> A:I changed the datatype >>>>> >>>>> to TEXT as the name of POINT or MULTIPOLYGON >>>>> >>>>> (centroid of >>>>> >>>>> each polygon for estimation for GWR). >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> - In the description it describes >>>>> >>>>> regression_location as "The length of >>>>> >>>>> regression_location must be equal to the length of >>>>> >>>>> source_table", which signals to me that it is likely intended to >>>>> be a >>>>> >>>>> column of the source table? If not then how is >>>>> >>>>> this length represented? >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> A: In the previous >>>>> >>>>> interface, I was trying to input a geometry field which could be >>>>> >>>>> from another >>>>> >>>>> table having different row number. Now, I alter the argument >>>>> >>>>> definition and make it >>>>> >>>>> to TEXT. It must be the name of geometry field in the >>>>> >>>>> source table. >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> - You didn't mark regression_location as >>>>> >>>>> (optional). Due to the way Postgres >>>>> >>>>> functions work all optional arguments >>>>> >>>>> must come after all required arguments, >>>>> >>>>> so having a non-optional argument in >>>>> >>>>> the middle of the optional list must be >>>>> >>>>> avoided. >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> A:Thanks for >>>>> >>>>> reminding me of this mistake. It is really my fault. The order of >>>>> >>>>> argument is changed in this edition. >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> - I haven't read through the literature, but it is >>>>> >>>>> not immediately clear to me why >>>>> >>>>> prediction_location is a parameter to >>>>> >>>>> gwregr_train() rather than gwregr_predict(). >>>>> >>>>> Can you provide a brief >>>>> >>>>> description to the way that prediction_location is used in >>>>> >>>>> the model and its >>>>> >>>>> relationship to training and prediction. >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> A: Actually, >>>>> >>>>> there are three kinds location data including location of sample >>>>> data, >>>>> >>>>> regression and prediction in the modeling of GWR. >>>>> >>>>> >>>>> >>>>> Locations of sample data indicate where is sample >>>>> >>>>> data. Locations of regression >>>>> >>>>> indicate where regression should be conducted. If >>>>> >>>>> it is identical to data location >>>>> >>>>> (in most instances),diagnostic information can >>>>> >>>>> be calculated. >>>>> >>>>> >>>>> >>>>> Locations of >>>>> >>>>> prediction indicate where coefficients should be predicted. It >>>>> should >>>>> >>>> be a >>>>> >>>>> parameter for a predict function. Putting regression_location >>>>> into >>>>> >>>> training >>>>> >>>>> function is just for omitting kernel arguments and maybe not >>>>> >>>> appropriate. In the process of >>>>> >>>>> training, GWR estimates weight and coefficients with distance >>>>> >>>>> between data_loctions and regression_loctions. Then, diagnostic >>>>> >>>> information are >>>>> >>>>> estimated when these two locations are identical. We can treat >>>>> >>>> data_locationas regression_location to simplify the process not >>>>> taking >>>>> >>>> different locations from >>>>> >>>>> data location in the training step. >>>>> >>>>> >>>>> >>>>> In the process of >>>>> >>>>> prediction , there are two new information including new >>>>> >>>>> independent variables and new locations. Therefore, coefficients >>>>> and >>>>> >>>> weight >>>>> >>>>> vector must be estimated >>>>> >>>>> again. GWR can >>>>> >>>>> estimate coefficients in any positions >>>>> >>>>> using independent variables of sample data. >>>>> >>>>> If we also provide independent >>>>> >>>>> variables in any positions,we can also obtain >>>>> >>>>> dependent variable in any position. So if we treat coefficients >>>>> at >>>>> >>>> prediction_location as a training result to put >>>>> >>>>> coefficients into prediction >>>>> >>>>> directly, it is reasonable to put it into training function. But >>>>> if we >>>>> >>>> treat it as a part of prediction, it is appropriate to set >>>>> >>>> predicton_location within predict function. And then, prediction >>>>> function >>>>> >>>> must require kernel >>>>> >>>>> parameters in addition to new data and locations for prediction. >>>>> Maybe >>>>> >>>> this way >>>>> >>>>> is more clear >>>>> >>>>> and reasonable, and is similar with others GWR packages in R. >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> I >>>>> >>>>> rewrote the description of interface taking your suggestion into >>>>> >>>> account. I >>>>> >>>>> moved >>>>> >>>>> prediction_location into predict function and modified >>>>> >>>>> some mistake and >>>>> >>>>> unnecessary arguments. The new draft of interface design is >>>>> attached >>>>> >>>> in the >>>>> >>>>> letter. >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> Regards, >>>>> >>>>> >>>>> >>>>> ChenLiang Wang >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>>> From: [email protected] >>>>> >>>>>> Date: Tue, 5 Jan 2016 10:31:20 -0800 >>>>> >>>>>> Subject: Re: How to contribute a spatial module to MADlib >>>>> >>>> manipulating objects from PostGIS >>>>> >>>>>> To: [email protected] >>>>> >>>>>> >>>>> >>>>>> Hi ChenLiang, >>>>> >>>>>> >>>>> >>>>>> Thanks for taking the next step to flush this out. >>>>> >>>>>> >>>>> >>>>>> As a whole: >>>>> >>>>>> - naming and basic interface seems consistent with existing >>>>> >>>> conventions. >>>>> >>>>>> - names are descriptive. >>>>> >>>>>> - references to the literature is provided. >>>>> >>>>>> - functionality is complementary to the library. >>>>> >>>>>> >>>>> >>>>>> What is not clear to me is: >>>>> >>>>>> - Does this function require PostGIS to also be installed? If >>>>> yes, >>>>> >>>> it >>>>> >>>>>> would be better if we disable the function if PostGIS is not >>>>> present >>>>> >>>> rather >>>>> >>>>>> than introduce PostGIS as a dependency. (Similar to what we do >>>>> with >>>>> >>>> our >>>>> >>>>>> requirement on the xml module with our PMML export >>>>> functionality). >>>>> >>>>>> - What are the exact datatypes in the function definition for >>>>> >>>>>> regression_location and prediction_location? >>>>> >>>>>> - In the description it describes regression_location as "The >>>>> length >>>>> >>>> of >>>>> >>>>>> regression_location must be equal to the length of >>>>> source_table", >>>>> >>>> which >>>>> >>>>>> signals to me that it is likely intended to be a column of the >>>>> source >>>>> >>>>>> table? If not then how is this length represented? >>>>> >>>>>> - You didn't mark regression_location as (optional). Due to >>>>> the way >>>>> >>>>>> Postgres functions work all optional arguments must come after >>>>> all >>>>> >>>> required >>>>> >>>>>> arguments, so having a non-optional argument in the middle of >>>>> the >>>>> >>>> optional >>>>> >>>>>> list must be avoided. >>>>> >>>>>> - I haven't read through the literature, but it is not >>>>> immediately >>>>> >>>> clear to >>>>> >>>>>> me why prediction_location is a parameter to gwregr_train() >>>>> rather >>>>> >>>> than >>>>> >>>>>> gwregr_predict(). Can you provide a brief description to the >>>>> way >>>>> >>>> that >>>>> >>>>>> prediction_location is used in the model and its relationship to >>>>> >>>> training >>>>> >>>>>> and prediction. >>>>> >>>>>> >>>>> >>>>>> Regards, >>>>> >>>>>> Caleb >>>>> >>>>> ChenLiang 要与你在 OneDrive >>>>> >>>> 上共享一个文件。要查看该文件,请单击下面的链接。 >>>>> >>>> gwr4madlib.rar >>>>> >> >>>>> >>>> >>>> >>> >> >
