Hi Greg,

Is the video replay available for guys outside Pivotal network to watch?
I made a call with Chenliang Wang in 7:15pm~8:13pm yesterday evening, he
explained his case in evaluating the property prices, he is thinking to
implement this function (GWR) inner databases. I explain the usage of
Madlib and how to enable PostGIS in GPDB. I think Gautum's presentation
on BayesianAnalysis is a good tutorial for him to kick-off his plan.

Cheers,
Kuien Liu

On Fri, Jan 15, 2016 at 10:53 AM, Greg Chase <[email protected]> wrote:

> We always share the presentation and post the video replay as well.
>
> You can also drink lots of coffee, and join us if you are crazy enough.
>
> We like crazy people.
>
> This email encrypted by tiny buttons & fat thumbs, beta voice recognition,
> and autocorrect on my iPhone.
>
> On Jan 14, 2016, at 6:35 PM, Kuien Liu <[email protected]> wrote:
>
> Yes, 2AM on Saturday... May you please share Gautam's representation video
> after the call?
>
> Cheers,
> Kuien Liu
>
> On Wed, Jan 13, 2016 at 6:36 PM, Greg Chase <[email protected]> wrote:
>
>> As I said, our next call is not China-friendly:
>> http://mail-archives.apache.org/mod_mbox/incubator-madlib-dev/201601.mbox/%3CCAMg1VtnKB-WoyVqCstfMNCcJVOn2HKQQ6wNfqdovhgnB7zd5cw%40mail.gmail.com%3E
>>
>> This is this Friday, 10AM Pacifc Standard Time which is 2AM Saturday
>> Beijing time.
>>
>> We will arrange a next call in a couple weeks at an Asia friendly time to
>> support contributors in Asia.
>>
>> However, if you make the next call, we will make time for you to talk :)
>>
>> Regards,
>>
>> -Greg
>>
>> On Wed, Jan 13, 2016 at 2:18 AM, Kuien Liu <[email protected]> wrote:
>>
>>> Great, I would like to join it, please send me an invitation if possible.
>>>
>>> Cheers,
>>> Kuien Liu
>>>
>>> On Wed, Jan 13, 2016 at 6:10 PM, Greg Chase <[email protected]> wrote:
>>>
>>>> Perhaps ChenLiang would like to join a call with the MADlib community
>>>> and discuss his contribution?
>>>>
>>>> We have a call this Friday 10AM PST which is not a friendly time for
>>>> China, but we can schedule a next call at a friendlier time.
>>>>
>>>> This email encrypted by tiny buttons & fat thumbs, beta voice
>>>> recognition, and autocorrect on my iPhone.
>>>>
>>>> > On Jan 13, 2016, at 1:53 AM, Ivan Novick <[email protected]> wrote:
>>>> >
>>>> > Cool!
>>>> >
>>>> >> On Wed, Jan 13, 2016 at 5:52 PM, Kuien Liu <[email protected]> wrote:
>>>> >>
>>>> >> Got it, I think I can have a (f2f) talk with Chenliang Wang, as he
>>>> was
>>>> >> graduated from an institute of CAS which is not far from our Beijing
>>>> >> office, and I am familiar with his supervisor and lab director. So I
>>>> think
>>>> >> it is highly possible to find him directly in Beijing.
>>>> >>
>>>> >> Cheers,
>>>> >> Kuien Liu
>>>> >>
>>>> >>> On Wed, Jan 13, 2016 at 3:05 PM, Ivan Novick <[email protected]>
>>>> wrote:
>>>> >>>
>>>> >>> Hello ChenLiang,
>>>> >>>
>>>> >>> I have read your description of the interface and to my
>>>> understanding
>>>> >>> this is a supervised machine learning algorithm that supports
>>>> geometry
>>>> >>> data.  Am I correct?
>>>> >>>
>>>> >>> What could be a good industrial use case for this model for some
>>>> >>> examples?  Could you train a system based on locations and weather
>>>> to find
>>>> >>> bad signals for cell phone?  Can you provide any real world example
>>>> >>> scenario where this type of model will be useful for end users?
>>>> >>>
>>>> >>> Also I am adding CC to some of my colleagues at work.  Kuien, Max,
>>>> >>> Yandong can you provide any feedback on this proposal from your
>>>> Point of
>>>> >>> View?
>>>> >>>
>>>> >>>
>>>> >>>
>>>> http://mail-archives.apache.org/mod_mbox/incubator-madlib-dev/201601.mbox/%[email protected]%3E
>>>> >>>
>>>> >>> Cheers,
>>>> >>> Ivan
>>>> >>>
>>>> >>>
>>>> >>> On Wed, Jan 13, 2016 at 11:20 AM, WangChenLiang <
>>>> [email protected]>
>>>> >>> wrote:
>>>> >>>
>>>> >>>> Sorry, the link of attachment (http://1drv.ms/1ZjAiCg) is lost in
>>>> the
>>>> >>>> previous letter.
>>>> >>>>
>>>> >>>>> From: [email protected]
>>>> >>>>> To: [email protected]
>>>> >>>>> Subject: RE: How to contribute a spatial module to MADlib
>>>> manipulating
>>>> >>>> objects from PostGIS
>>>> >>>>> Date: Wed, 13 Jan 2016 11:09:17 +0800
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> Hi   ,Caleb and Ivan!
>>>> >>>>>   Thanks for your attention and help. I reviewed the previous
>>>> draft
>>>> >>>> and find
>>>> >>>>> something inappropriate. The archive containing the new draft and
>>>> >>>> example code
>>>> >>>>> is attached in the letter which would be more reasonable  than the
>>>> >>>> earlier edition.
>>>> >>>>> Please go over the manuscript and give suggestion again .
>>>> >>>>> The following are my answers to Caleb's questions.
>>>> >>>>> - Does this function require PostGIS to also be
>>>> >>>>> installed? If yes, it would be better
>>>> >>>>> if we disable the function if
>>>> >>>>> PostGIS is not present rather than introduce PostGIS
>>>> >>>>> as a dependency. (Similar
>>>> >>>>> to what we do with our requirement on the xml module with our PMML
>>>> >>>> export
>>>> >>>>> functionality).
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> A:Yes. I am trying to avoid
>>>> >>>>> input any spatial datatypes in the interface of GWR.
>>>> >>>>> But I have no
>>>> >>>>> idea if it is necessary to provide simple alternative when
>>>> PostGIS is
>>>> >>>> not
>>>> >>>>> available.
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> - What are the exact datatypes in the function
>>>> >>>>> definition for regression_location
>>>> >>>>> and prediction_location?
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> A:I changed the datatype
>>>> >>>>> to TEXT as the name of POINT or MULTIPOLYGON
>>>> >>>>> (centroid of
>>>> >>>>> each polygon for estimation for GWR).
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> - In the description it describes
>>>> >>>>> regression_location as "The length of
>>>> >>>>> regression_location must be equal to the length of
>>>> >>>>> source_table", which signals to me that it is likely intended to
>>>> be a
>>>> >>>>> column of the source table? If not then how is
>>>> >>>>> this length represented?
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> A: In the previous
>>>> >>>>> interface, I was trying to input a geometry field which could be
>>>> >>>>> from another
>>>> >>>>> table having different row number. Now, I alter the argument
>>>> >>>>> definition and make it
>>>> >>>>> to TEXT. It must be the name of geometry field in the
>>>> >>>>> source table.
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> - You didn't mark regression_location as
>>>> >>>>> (optional). Due to the way Postgres
>>>> >>>>> functions work all optional arguments
>>>> >>>>> must come after all required arguments,
>>>> >>>>> so having a non-optional argument in
>>>> >>>>> the middle of the optional list must be
>>>> >>>>> avoided.
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> A:Thanks for
>>>> >>>>> reminding me of this mistake. It is really my fault. The order of
>>>> >>>>> argument is changed in this edition.
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> - I haven't read through the literature, but it is
>>>> >>>>> not immediately clear to me why
>>>> >>>>> prediction_location is a parameter to
>>>> >>>>> gwregr_train() rather than gwregr_predict().
>>>> >>>>> Can you provide a brief
>>>> >>>>> description to the way that prediction_location is used in
>>>> >>>>> the model and its
>>>> >>>>> relationship to training and prediction.
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> A: Actually,
>>>> >>>>> there are three kinds location data including location of sample
>>>> data,
>>>> >>>>> regression and prediction in the modeling of GWR.
>>>> >>>>>
>>>> >>>>> Locations of sample data indicate where is sample
>>>> >>>>> data. Locations of regression
>>>> >>>>> indicate where regression should be conducted. If
>>>> >>>>> it is identical to data location
>>>> >>>>> (in most instances),diagnostic information can
>>>> >>>>> be calculated.
>>>> >>>>>
>>>> >>>>> Locations of
>>>> >>>>> prediction indicate where coefficients should be predicted. It
>>>> should
>>>> >>>> be a
>>>> >>>>> parameter for a predict function. Putting regression_location into
>>>> >>>> training
>>>> >>>>> function is just for omitting kernel arguments and maybe not
>>>> >>>> appropriate.  In the process of
>>>> >>>>> training, GWR estimates weight and coefficients with distance
>>>> >>>>> between data_loctions and regression_loctions. Then, diagnostic
>>>> >>>> information are
>>>> >>>>> estimated when these two locations are identical. We can treat
>>>> >>>> data_locationas regression_location to simplify the process not
>>>> taking
>>>> >>>> different locations from
>>>> >>>>> data location in the training step.
>>>> >>>>>
>>>> >>>>>  In the process of
>>>> >>>>> prediction , there are two new information including new
>>>> >>>>> independent variables and new locations. Therefore, coefficients
>>>> and
>>>> >>>> weight
>>>> >>>>> vector must be estimated
>>>> >>>>> again. GWR can
>>>> >>>>> estimate coefficients in any positions
>>>> >>>>> using independent variables of sample data.
>>>> >>>>> If we also provide independent
>>>> >>>>> variables in any positions,we can also obtain
>>>> >>>>> dependent variable in any position. So if we treat coefficients at
>>>> >>>> prediction_location as a training result to put
>>>> >>>>> coefficients into prediction
>>>> >>>>> directly, it is reasonable to put it into training function. But
>>>> if we
>>>> >>>> treat it as a part of prediction, it is appropriate to set
>>>> >>>> predicton_location within predict function. And then, prediction
>>>> function
>>>> >>>> must require kernel
>>>> >>>>> parameters in addition to new data and locations for prediction.
>>>> Maybe
>>>> >>>> this way
>>>> >>>>> is more clear
>>>> >>>>> and reasonable, and is similar with others GWR packages in R.
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>       I
>>>> >>>>> rewrote the description of interface taking your suggestion into
>>>> >>>> account. I
>>>> >>>>> moved
>>>> >>>>> prediction_location into predict function and modified
>>>> >>>>> some mistake and
>>>> >>>>> unnecessary arguments.  The new draft of interface design is
>>>> attached
>>>> >>>> in the
>>>> >>>>> letter.
>>>> >>>>>
>>>> >>>>>
>>>> >>>>> Regards,
>>>> >>>>>
>>>> >>>>> ChenLiang Wang
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>
>>>> >>>>>> From: [email protected]
>>>> >>>>>> Date: Tue, 5 Jan 2016 10:31:20 -0800
>>>> >>>>>> Subject: Re: How to contribute a spatial module to MADlib
>>>> >>>> manipulating objects from PostGIS
>>>> >>>>>> To: [email protected]
>>>> >>>>>>
>>>> >>>>>> Hi ChenLiang,
>>>> >>>>>>
>>>> >>>>>> Thanks for taking the next step to flush this out.
>>>> >>>>>>
>>>> >>>>>> As a whole:
>>>> >>>>>> - naming and basic interface seems consistent with existing
>>>> >>>> conventions.
>>>> >>>>>> - names are descriptive.
>>>> >>>>>> - references to the literature is provided.
>>>> >>>>>> - functionality is complementary to the library.
>>>> >>>>>>
>>>> >>>>>> What is not clear to me is:
>>>> >>>>>> - Does this function require PostGIS to also be installed?  If
>>>> yes,
>>>> >>>> it
>>>> >>>>>> would be better if we disable the function if PostGIS is not
>>>> present
>>>> >>>> rather
>>>> >>>>>> than introduce PostGIS as a dependency.  (Similar to what we do
>>>> with
>>>> >>>> our
>>>> >>>>>> requirement on the xml module with our PMML export
>>>> functionality).
>>>> >>>>>> - What are the exact datatypes in the function definition for
>>>> >>>>>> regression_location and prediction_location?
>>>> >>>>>> - In the description it describes regression_location as "The
>>>> length
>>>> >>>> of
>>>> >>>>>> regression_location must be equal to the length of source_table",
>>>> >>>> which
>>>> >>>>>> signals to me that it is likely intended to be a column of the
>>>> source
>>>> >>>>>> table?  If not then how is this length represented?
>>>> >>>>>> - You didn't mark regression_location as (optional).  Due to the
>>>> way
>>>> >>>>>> Postgres functions work all optional arguments must come after
>>>> all
>>>> >>>> required
>>>> >>>>>> arguments, so having a non-optional argument in the middle of the
>>>> >>>> optional
>>>> >>>>>> list must be avoided.
>>>> >>>>>> - I haven't read through the literature, but it is not
>>>> immediately
>>>> >>>> clear to
>>>> >>>>>> me why prediction_location is a parameter to gwregr_train()
>>>> rather
>>>> >>>> than
>>>> >>>>>> gwregr_predict().  Can you provide a brief description to the way
>>>> >>>> that
>>>> >>>>>> prediction_location is used in the model and its relationship to
>>>> >>>> training
>>>> >>>>>> and prediction.
>>>> >>>>>>
>>>> >>>>>> Regards,
>>>> >>>>>>  Caleb
>>>> >>>>>                                    ChenLiang 要与你在 OneDrive
>>>> >>>> 上共享一个文件。要查看该文件,请单击下面的链接。
>>>> >>>>                                                    gwr4madlib.rar
>>>> >>
>>>>
>>>
>>>
>>
>

Reply via email to