Hello All,
Seeking some advice regarding the following:
I have a JSON ETL task. You know we all done some ETL in our lives before -
extract data, apply some transformation to it, and load it back.
I have a fairly huge amount of JSON that I need to iterate over and check for
the
How are you folks getting over the learning curves associated with things like
Nifi and AirFlow ?
> On May 28, 2016, at 9:50 AM, Suneel Marthi wrote:
>
> Debo,
>
> On Tue, May 17, 2016 at 9:18 PM, Andrew Palumbo wrote:
>
>> We are certainly interested
To the community, active committers, etc.
> On Jun 1, 2016, at 11:01 AM, Suneel Marthi <smar...@apache.org> wrote:
>
> Was that question directed to the community or were u asking urself loud ?
>
> On Wed, Jun 1, 2016 at 10:48 AM, Khurrum Nasim <khurrum.n
Sounds good - I’ll take a look.
Thanks,
Khurrum
> On Mar 15, 2016, at 5:12 PM, Khurrum Nasim <khurrum.na...@useitc.com> wrote:
>
> Hi,
>
> How do I get committer access to this project ? I am interested in becoming
> an active contributor.
>
>
> Thanks,
> Khurrum
>
Hi,
How do I get committer access to this project ? I am interested in becoming an
active contributor.
Thanks,
Khurrum
Thanks Dimirtry.
I take a look at see where I can start pitching in. Do I need contributor
access ? how would I create feature branch of my work ?
Khurrum
> On Mar 30, 2016, at 1:12 PM, Dmitriy Lyubimov wrote:
>
> Oh but of course! please do!
>
> You may work on any
:57 PM, Dmitriy Lyubimov <dlie...@gmail.com> wrote:
>
> PS You may also want to sign up with ASF Jira so we can assign issues to
> yourself.
>
> On Wed, Mar 30, 2016 at 11:52 AM, Dmitriy Lyubimov <dlie...@gmail.com>
> wrote:
>
>>
>>
>> On We
you with the algorithms and linear algebra stuff.
>
>
> Welcome aboard !!
>
>
> On Wed, Mar 30, 2016 at 3:05 PM, Khurrum Nasim <khurrum.na...@useitc.com>
> wrote:
>
>> Thanks for the advice Dimitry. I’m already signed up on ASF jira.My
>> handle is “
@Prakash - Albeit I’m a Mahout noob - if you can represent your problem as a
network with 2d input then yes Mahout can be used (so i’ve heard).
IMO - every machine based computation problem can be represented as a graph -
although this may not always be optimal.
Taking this notion of fuzzy
ome samples later if
> you're interested.
>
> Sent from my iPhone
>
>> On Apr 28, 2016, at 10:41 AM, Khurrum Nasim <khurrum.na...@useitc.com> wrote:
>>
>>
>> What type of JSON payload size are we talking about here ?
>>
>>> On Apr 28, 2
@Saikat- why use EL instead of Lucene directly.
> On Apr 28, 2016, at 12:08 PM, Saikat Kanjilal wrote:
>
> This is great information thank you, based on this recommendation I won't
> create a JIRA but start work on my project and when the code approaches the
>
at I plan on tying into
> mahout. I plan on wrapping some of the clustering algorithms that I
> implement using Mahout and Spark as a service which can then make calls into
> other services (namely elasticsearch and neo4j graph service).
>
> Sent from my iPhone
>
>> O
Interesting.
> On May 21, 2016, at 10:30 AM, Steven NASa wrote:
>
> Hi Pat,
>
> Thank you for your reply, I fully understand that core algorithms and data
> are 2 different part of the system, this is why we have 2 major idea: "Big
> data" and "Machine Learning".
>
> My
Sounds more like demand prediction to me.
However your system should be able to interact with other non-C/C++ systems.
There is something called Apache Thrift.
Which brings me to the following - would it be a valuable feature to Mahout
library to provide
connectivity with other systems
hey all ,
#1. where is all the matrix operations code lying in mahout or which packages
i should say ?
#2. i have a fairly large JSON string - My question is how can I apply mahout
library to it to analyze this string by providing/creating a training model for
this string and hopefully
Hey All,
I’d like to know if Mahout uses any randomized algorithms. I’m thinking it
probably does. Can somebody point me to the packages that utilized randomized
algos.
Thanks,
Khurrum
Re: Mahout contributions
>>
>> I want to start with social data as an example, for example data returned
>> from FB graph API as well user Twitter data, will send some samples later if
>> you're interested.
>>
>> Sent from my iPhone
>>
>>> On Apr 2
case of spark), mostly are using their standard hash samplers (which are in
> math speak are uniform multinomial samplers really).
>
> On Mon, May 2, 2016 at 9:25 AM, Khurrum Nasim <khurrum.na...@useitc.com>
> wrote:
>
>> Hey Dimitri -
>>
>> Yes I meant probabil
ugh i have done some things
> outside the public domain, nothing has been contributed.
>
> You are very welcome to try something if you don't have big constraints on
> oss contribution.
>
> -d
>
> On Mon, May 2, 2016 at 7:49 AM, Khurrum Nasim <khurrum.na...@useitc.com>
>
ayes inference.
>>
>> the parallelization strategies are are just standard spark mechanisms (in
>> case of spark), mostly are using their standard hash samplers (which are in
>> math speak are uniform multinomial samplers really).
>>
>> On Mon, May 2, 2016 at
>>
>> the parallelization strategies are are just standard spark mechanisms (in
>> case of spark), mostly are using their standard hash samplers (which are in
>> math speak are uniform multinomial samplers really).
>>
>> On Mon, May 2, 2016 at 9:25 AM, Khurrum Nasim &l
.
>
>
> Welcome aboard !!
>
>
> On Wed, Mar 30, 2016 at 3:05 PM, Khurrum Nasim <khurrum.na...@useitc.com>
> wrote:
>
>> Thanks for the advice Dimitry. I’m already signed up on ASF jira.My
>> handle is “nasimk”
>>
>> Do I need to be a
w for larger problems though, my
>>> investigation points that there are architectural problems in spark that
>>> are hard to overcome at this point for high IO algorithms.
>>>
>>> On Mon, Apr 18, 2016 at 11:49 AM, Khurrum Nasim <khurrum.na...@useit
Congrats.
Sent from my iPhone
> On Apr 20, 2016, at 8:33 PM, Andrew Palumbo wrote:
>
> Thanks you guys!
>
> Original message
> From: Andrew Musselman
> Date: 04/20/2016 8:14 PM (GMT-05:00)
> To: dev@mahout.apache.org,
andy is the popular guy !
> On Apr 21, 2016, at 11:21 AM, Pat Ferrel wrote:
>
> Congratulations Andy, well deserved.
>
> On Apr 21, 2016, at 6:01 AM, Shannon Quinn wrote:
>
> Thanks Suneel for your excellent leadership.
>
> Congratulations
yes - stickers would be nice.
Thanks,
Khurrum.
On Jan 31, 2017, 6:28 AM -0500, Sharan F , wrote:
> Hi All
>
> Just for info - I've been talking to Andrew Palumbo about getting some
> Mahout stickers printed for the community to use and also generally to
> see if there was
Codacy is free for open source projects. And does a decent job of reviewing
your code.
Might be worthwhile to have it review mahout forks and branches.
Khurrum
> On Sep 26, 2016, at 1:21 PM, Suneel Marthi wrote:
>
> @Tiramisu most sparse networks like DBNs are modeled
Can mahout be used for self driving tech ?
Thanks,
Khurrum.
On Apr 24, 2017, 10:34 PM -0400, KHATWANI PARTH BHARAT
, wrote:
> @Trevor and @Dmitriy
>
> Tough Bug in Aggregating Transpose is fixed. One issue is still left which
> is causing hindrance in
t 4:57 PM, Andrew Palumbo <ap@outlook.com> wrote:
>
> > I disagree with it being too bland- I find the open space and the
> > formatting much easier to navigate and read docs from.
> >
> >
> >
> > From: Khurrum Nasim <
Where is the conference.
Sent from my iPhone
> On May 28, 2017, at 2:33 PM, Andrew Palumbo wrote:
>
> I won't be attending but would be happy to help any way I can, given the
> timeline, and my schedule .. (I have some time restraints over the next 6 -
> 8 weeks, so
Too bland looking
Thanks,
Khurrum.
On May 8, 2017, 1:53 PM -0400, Trevor Grant , wrote:
> Hey all,
>
> The new website is staged. You can view it here
>
> http://mahout.staging.apache.org/
>
> Won't be publishing for a bit yet- there are still a few JIRAs left to do
>
31 matches
Mail list logo