question

2016-06-01 Thread Khurrum Nasim
Hello All, Seeking some advice regarding the following: I have a JSON ETL task. You know we all done some ETL in our lives before - extract data, apply some transformation to it, and load it back. I have a fairly huge amount of JSON that I need to iterate over and check for the

Re: [NEW member] Hi

2016-06-01 Thread Khurrum Nasim
How are you folks getting over the learning curves associated with things like Nifi and AirFlow ? > On May 28, 2016, at 9:50 AM, Suneel Marthi wrote: > > Debo, > > On Tue, May 17, 2016 at 9:18 PM, Andrew Palumbo wrote: > >> We are certainly interested

Re: [NEW member] Hi

2016-06-01 Thread Khurrum Nasim
To the community, active committers, etc. > On Jun 1, 2016, at 11:01 AM, Suneel Marthi <smar...@apache.org> wrote: > > Was that question directed to the community or were u asking urself loud ? > > On Wed, Jun 1, 2016 at 10:48 AM, Khurrum Nasim <khurrum.n

Re: [jira] [Created] (MAHOUT-1815) dsqDist(X,Y) and dsqDist(X) failing in flink tests.

2016-03-15 Thread Khurrum Nasim
Sounds good - I’ll take a look. Thanks, Khurrum > On Mar 15, 2016, at 5:12 PM, Khurrum Nasim <khurrum.na...@useitc.com> wrote: > > Hi, > > How do I get committer access to this project ? I am interested in becoming > an active contributor. > > > Thanks, > Khurrum >

Re: [jira] [Created] (MAHOUT-1815) dsqDist(X,Y) and dsqDist(X) failing in flink tests.

2016-03-15 Thread Khurrum Nasim
Hi, How do I get committer access to this project ? I am interested in becoming an active contributor. Thanks, Khurrum

Re: [jira] [Commented] (MAHOUT-1788) spark-itemsimilarity integration test script cleanup

2016-03-30 Thread Khurrum Nasim
Thanks Dimirtry. I take a look at see where I can start pitching in. Do I need contributor access ? how would I create feature branch of my work ? Khurrum > On Mar 30, 2016, at 1:12 PM, Dmitriy Lyubimov wrote: > > Oh but of course! please do! > > You may work on any

Re: [jira] [Commented] (MAHOUT-1788) spark-itemsimilarity integration test script cleanup

2016-03-30 Thread Khurrum Nasim
:57 PM, Dmitriy Lyubimov <dlie...@gmail.com> wrote: > > PS You may also want to sign up with ASF Jira so we can assign issues to > yourself. > > On Wed, Mar 30, 2016 at 11:52 AM, Dmitriy Lyubimov <dlie...@gmail.com> > wrote: > >> >> >> On We

Re: [jira] [Commented] (MAHOUT-1788) spark-itemsimilarity integration test script cleanup

2016-03-31 Thread Khurrum Nasim
you with the algorithms and linear algebra stuff. > > > Welcome aboard !! > > > On Wed, Mar 30, 2016 at 3:05 PM, Khurrum Nasim <khurrum.na...@useitc.com> > wrote: > >> Thanks for the advice Dimitry. I’m already signed up on ASF jira.My >> handle is “

Re: About reuters-fkmeans-centroids

2016-04-28 Thread Khurrum Nasim
@Prakash - Albeit I’m a Mahout noob - if you can represent your problem as a network with 2d input then yes Mahout can be used (so i’ve heard). IMO - every machine based computation problem can be represented as a graph - although this may not always be optimal. Taking this notion of fuzzy

Re: Mahout contributions

2016-04-28 Thread Khurrum Nasim
ome samples later if > you're interested. > > Sent from my iPhone > >> On Apr 28, 2016, at 10:41 AM, Khurrum Nasim <khurrum.na...@useitc.com> wrote: >> >> >> What type of JSON payload size are we talking about here ? >> >>> On Apr 28, 2

Re: Mahout contributions

2016-04-28 Thread Khurrum Nasim
@Saikat- why use EL instead of Lucene directly. > On Apr 28, 2016, at 12:08 PM, Saikat Kanjilal wrote: > > This is great information thank you, based on this recommendation I won't > create a JIRA but start work on my project and when the code approaches the >

Re: Mahout contributions

2016-04-28 Thread Khurrum Nasim
at I plan on tying into > mahout. I plan on wrapping some of the clustering algorithms that I > implement using Mahout and Spark as a service which can then make calls into > other services (namely elasticsearch and neo4j graph service). > > Sent from my iPhone > >> O

Re: [Hello] from NASa

2016-05-22 Thread Khurrum Nasim
Interesting. > On May 21, 2016, at 10:30 AM, Steven NASa wrote: > > Hi Pat, > > Thank you for your reply, I fully understand that core algorithms and data > are 2 different part of the system, this is why we have 2 major idea: "Big > data" and "Machine Learning". > > My

Re: [Hello] from NASa

2016-05-20 Thread Khurrum Nasim
Sounds more like demand prediction to me. However your system should be able to interact with other non-C/C++ systems. There is something called Apache Thrift. Which brings me to the following - would it be a valuable feature to Mahout library to provide connectivity with other systems

Re: LLR quick clarification

2016-05-12 Thread Khurrum Nasim
hey all , #1. where is all the matrix operations code lying in mahout or which packages i should say ? #2. i have a fairly large JSON string - My question is how can I apply mahout library to it to analyze this string by providing/creating a training model for this string and hopefully

stochastic nature

2016-05-02 Thread Khurrum Nasim
Hey All, I’d like to know if Mahout uses any randomized algorithms. I’m thinking it probably does. Can somebody point me to the packages that utilized randomized algos. Thanks, Khurrum

Re: Mahout contributions

2016-05-02 Thread Khurrum Nasim
Re: Mahout contributions >> >> I want to start with social data as an example, for example data returned >> from FB graph API as well user Twitter data, will send some samples later if >> you're interested. >> >> Sent from my iPhone >> >>> On Apr 2

Re: stochastic nature

2016-05-02 Thread Khurrum Nasim
case of spark), mostly are using their standard hash samplers (which are in > math speak are uniform multinomial samplers really). > > On Mon, May 2, 2016 at 9:25 AM, Khurrum Nasim <khurrum.na...@useitc.com> > wrote: > >> Hey Dimitri - >> >> Yes I meant probabil

Re: stochastic nature

2016-05-02 Thread Khurrum Nasim
ugh i have done some things > outside the public domain, nothing has been contributed. > > You are very welcome to try something if you don't have big constraints on > oss contribution. > > -d > > On Mon, May 2, 2016 at 7:49 AM, Khurrum Nasim <khurrum.na...@useitc.com> >

Re: stochastic nature

2016-05-03 Thread Khurrum Nasim
ayes inference. >> >> the parallelization strategies are are just standard spark mechanisms (in >> case of spark), mostly are using their standard hash samplers (which are in >> math speak are uniform multinomial samplers really). >> >> On Mon, May 2, 2016 at

Re: stochastic nature

2016-05-03 Thread Khurrum Nasim
>> >> the parallelization strategies are are just standard spark mechanisms (in >> case of spark), mostly are using their standard hash samplers (which are in >> math speak are uniform multinomial samplers really). >> >> On Mon, May 2, 2016 at 9:25 AM, Khurrum Nasim &l

Re: [jira] [Commented] (MAHOUT-1788) spark-itemsimilarity integration test script cleanup

2016-04-18 Thread Khurrum Nasim
. > > > Welcome aboard !! > > > On Wed, Mar 30, 2016 at 3:05 PM, Khurrum Nasim <khurrum.na...@useitc.com> > wrote: > >> Thanks for the advice Dimitry. I’m already signed up on ASF jira.My >> handle is “nasimk” >> >> Do I need to be a

Re: [jira] [Commented] (MAHOUT-1788) spark-itemsimilarity integration test script cleanup

2016-04-19 Thread Khurrum Nasim
w for larger problems though, my >>> investigation points that there are architectural problems in spark that >>> are hard to overcome at this point for high IO algorithms. >>> >>> On Mon, Apr 18, 2016 at 11:49 AM, Khurrum Nasim <khurrum.na...@useit

Re: Congratulations to our new Chair

2016-04-20 Thread khurrum . nasim
Congrats. Sent from my iPhone > On Apr 20, 2016, at 8:33 PM, Andrew Palumbo wrote: > > Thanks you guys! > > Original message > From: Andrew Musselman > Date: 04/20/2016 8:14 PM (GMT-05:00) > To: dev@mahout.apache.org,

Re: Congratulations to our new Chair

2016-04-21 Thread Khurrum Nasim
andy is the popular guy ! > On Apr 21, 2016, at 11:21 AM, Pat Ferrel wrote: > > Congratulations Andy, well deserved. > > On Apr 21, 2016, at 6:01 AM, Shannon Quinn wrote: > > Thanks Suneel for your excellent leadership. > > Congratulations

Re: FOSDEM 2017 Open Source Conference - Brussels

2017-01-31 Thread Khurrum Nasim
yes - stickers would be nice. Thanks, Khurrum. On Jan 31, 2017, 6:28 AM -0500, Sharan F , wrote: > Hi All > > Just for info - I've been talking to Andrew Palumbo about getting some > Mahout stickers printed for the community to use and also generally to > see if there was

code review

2016-10-04 Thread Khurrum Nasim
Codacy is free for open source projects. And does a decent job of reviewing your code. Might be worthwhile to have it review mahout forks and branches. Khurrum > On Sep 26, 2016, at 1:21 PM, Suneel Marthi wrote: > > @Tiramisu most sparse networks like DBNs are modeled

Re: Trying to write the KMeans Clustering Using "Apache Mahout Samsara"

2017-04-25 Thread Khurrum Nasim
Can mahout be used for self driving tech ? Thanks, Khurrum. On Apr 24, 2017, 10:34 PM -0400, KHATWANI PARTH BHARAT , wrote: > @Trevor and @Dmitriy > > Tough Bug in Aggregating Transpose is fixed. One issue is still left which > is causing hindrance in

Re: New Website is Staged

2017-05-09 Thread Khurrum Nasim
t 4:57 PM, Andrew Palumbo <ap@outlook.com> wrote: > > > I disagree with it being too bland- I find the open space and the > > formatting much easier to navigate and read docs from. > > > > > > > > From: Khurrum Nasim <

Re: Looking for help with a talk

2017-05-28 Thread Khurrum Nasim
Where is the conference. Sent from my iPhone > On May 28, 2017, at 2:33 PM, Andrew Palumbo wrote: > > I won't be attending but would be happy to help any way I can, given the > timeline, and my schedule .. (I have some time restraints over the next 6 - > 8 weeks, so

Re: New Website is Staged

2017-05-08 Thread Khurrum Nasim
Too bland looking Thanks, Khurrum. On May 8, 2017, 1:53 PM -0400, Trevor Grant , wrote: > Hey all, > > The new website is staged. You can view it here > > http://mahout.staging.apache.org/ > > Won't be publishing for a bit yet- there are still a few JIRAs left to do >