Re: Memory leak Executor pio train UR

2017-05-24 Thread Pat Ferrel
Can you give me more of the stack trace? This is on a Spark executor? Wild guess is that you need more memory available to the JVM on the executor machine. On May 24, 2017, at 1:21 AM, Dennis Honders wrote: I receive the following error when training with UR 0.6.0

Re: Only one result Universal Recommender

2017-05-23 Thread Pat Ferrel
I assume you are using 0.6.0 so you’ll have to wait for docs in progress. Remember that you are substituting the cart id for the typical user id so do a user-based query with the cart id but passed in as “user", or send a list of items as “item-set” in the query. curl -H "Content-Type:

The Universal Recommender v0.6.0

2017-05-23 Thread Pat Ferrel
This is a major release with several new features. Get tag 0.6.0 or pull from the master branch here: https://github.com/actionml/universal-recommender The AML doc site the the UR is being updated so be patient, most docs still apply with a

Re: GitHub primary repo

2017-05-19 Thread Pat Ferrel
72hour lazy consensus if not an all out vote first. In general I like it though On May 18, 2017 5:15 PM, "Pat Ferrel" <p...@occamsmachete.com> wrote: > Ok, is there a catch? Why would we not do this? > > If someone wants to talk about GH going down, I’d still take the r

Re: Host PIO on Github (not ASF Git)

2017-05-19 Thread Pat Ferrel
https://issues.apache.org/jira/browse/INFRA-14191 <https://issues.apache.org/jira/browse/INFRA-14191> On May 18, 2017, at 4:56 PM, Pat Ferrel <p...@occamsmachete.com> wrote: Heard it from Mahout. I agree it’s not completely clear. Unless someone can illuminate us I’ll follow u

Re: Host PIO on Github (not ASF Git)

2017-05-19 Thread Pat Ferrel
https://issues.apache.org/jira/browse/INFRA-14191 <https://issues.apache.org/jira/browse/INFRA-14191> On May 18, 2017, at 4:56 PM, Pat Ferrel <p...@occamsmachete.com> wrote: Heard it from Mahout. I agree it’s not completely clear. Unless someone can illuminate us I’ll follow u

Re: mahout spark-rowsimilarity error

2017-05-19 Thread Pat Ferrel
and average items/user for the feature are also useful when compared to the same for conversion or primary feature/indicator. On May 19, 2017, at 9:20 AM, Pat Ferrel <p...@occamsmachete.com> wrote: Some ideas are left on the SO question On May 18, 2017, at 6:14 PM, Daniel Gabrieli &

Re: Host PIO on Github (not ASF Git)

2017-05-18 Thread Pat Ferrel
On Thu, May 18, 2017 at 3:26 PM Pat Ferrel <p...@occamsmachete.com> wrote: > BTW there seems now to be a way to host on Github (not ASF Git) by linking > our ASF accounts to our git accounts. This would make a bunch of things > much easier like PRs can be reviewed and merged dire

Re: Host PIO on Github (not ASF Git)

2017-05-18 Thread Pat Ferrel
On Thu, May 18, 2017 at 3:26 PM Pat Ferrel <p...@occamsmachete.com> wrote: > BTW there seems now to be a way to host on Github (not ASF Git) by linking > our ASF accounts to our git accounts. This would make a bunch of things > much easier like PRs can be reviewed and merged dire

Host PIO on Github (not ASF Git)

2017-05-18 Thread Pat Ferrel
BTW there seems now to be a way to host on Github (not ASF Git) by linking our ASF accounts to our git accounts. This would make a bunch of things much easier like PRs can be reviewed and merged directly, direct merging of doc changes by users, lots of nice stuff that is GUI related. This

Host PIO on Github (not ASF Git)

2017-05-18 Thread Pat Ferrel
BTW there seems now to be a way to host on Github (not ASF Git) by linking our ASF accounts to our git accounts. This would make a bunch of things much easier like PRs can be reviewed and merged directly, direct merging of doc changes by users, lots of nice stuff that is GUI related. This

Re: GitHub primary repo

2017-05-18 Thread Pat Ferrel
d turn on two-factor authentication. and talk to infra Trevor Grant Data Scientist https://github.com/rawkintrevo http://stackexchange.com/users/3002022/rawkintrevo http://trevorgrant.org *"Fortunate is he, who is able to know the causes of things." -Virgil* On Thu, May 18, 2017

Re: GitHub primary repo

2017-05-18 Thread Pat Ferrel
What, actual hosting on GH so merging PRs and reviewing with the GUI? This would open up a whole new toolchain, +1 I’ll write that Jira if that’s what you mean. BTW github is mirroring the ASF git server, not svn afaik. On May 17, 2017, at 7:39 AM, Trevor Grant

PIO 0.12.0 and Graduation Time?

2017-05-18 Thread Pat Ferrel
Should we apply for graduation to Apache TLP soon, like maybe the next release? I started a Google doc for features we want and any Mentors can add things we need to demonstrate for graduation since some may not be Jira-able. All user’s should feel free to input.

Re: UR v0.6.0m RC1

2017-05-17 Thread Pat Ferrel
This should be fixed in RC2, now in the UR develop branch Thanks again Bolmo! On May 17, 2017, at 8:15 AM, Pat Ferrel <p...@occamsmachete.com> wrote: Indeed a bug. This is a blocker (no work around) in some conditions so tracking it down now. On May 15, 2017, at 12:02 PM, Pat Fer

Re: Data UR

2017-05-17 Thread Pat Ferrel
"item", "entityId" : "31", "properties" : { category: ["5", "8"], manufacturer: "55", label: "test-item", price: "$1-$5" } } And the eventdate can be applied to event cart-transa

Re: UR v0.6.0m RC1

2017-05-17 Thread Pat Ferrel
Indeed a bug. This is a blocker (no work around) in some conditions so tracking it down now. On May 15, 2017, at 12:02 PM, Pat Ferrel <p...@occamsmachete.com> wrote: I’ll check that, thanks. On May 15, 2017, at 10:10 AM, Bolmo Joosten <bolmo.joos...@gmail.com <mailto:bolmo.joos.

Re: Data UR

2017-05-17 Thread Pat Ferrel
entityId: 17 entityType: “user" targetEntityId: 31 targetEntityType: “item” }, ... ] with another event { "event" : "item", "entityType" : "item", "entityId" : "31", "properties" : {

Re: Problem scaling UR

2017-05-15 Thread Pat Ferrel
single partition…? > On May 10, 2017, at 6:09 PM, Pat Ferrel <p...@occamsmachete.com > <mailto:p...@occamsmachete.com>> wrote: > > What is the physical architecture? Do you have HBase, Elasticsearch, and > Spark running on separate machines? If the CPU load is low then it

Re: UR For New Products

2017-05-15 Thread Pat Ferrel
Yes, to all Mars said. There are several methods so search through the group archives. They amount to: 1) metadata similarity with some example product. So it you are viewing a product similar products can be recommended based on item properties. This is seldom all that useful but you may have

Re: UR v0.6.0m RC1

2017-05-15 Thread Pat Ferrel
event. Input data: Purchase (userid, itemid A, 1 B, 3 C, 3 View: A, 1 B, 2 C, 2 If I remove the last event from view, training fails. Bolmo On Sun, May 14, 2017 at 9:01 AM Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>> wrote: If anyone want to try out RC1 of the

UR v0.6.0m RC1

2017-05-14 Thread Pat Ferrel
If anyone want to try out RC1 of the UR with pio 0.11.0 get the `develop` branch of the UR here: https://github.com/actionml/universal-recommender pull the repo then `git checkout develop` before you build it with pio I defaults to Spark 1.6, ES 1.7 and Scala 2.10, a UR v0.6.1 will have new

Re: Website Incident

2017-05-13 Thread Pat Ferrel
+1, headline: "A serendipitous mistake leads to fast action at Mahout” We now know Trevor is a webdev god :-) On May 13, 2017, at 8:21 AM, Andrew Musselman wrote: Trevor, thanks for the late night repairing things; I'm a +1 and will scrub this weekend for any

Re: PredictionIO quickstart

2017-05-12 Thread Pat Ferrel
Elasticsearch is required because it performs the last part of the algorithm, not just as a store and Postgres does not. However Postgres can be used for the EventStore and MetaStore, replacing HBase but if you do you will likely face scaling issues because recommenders are big-data

Re: Problem scaling UR

2017-05-10 Thread Pat Ferrel
Hbase was unable to process this amount of data on the cluster. This means we can't provide any personalized recommendations, but that is ok for now. 2017-05-10 10:22 GMT-07:00 Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>>: You can’t bypass HBase, you can im

Re: UR PredictionIO quickstart

2017-05-10 Thread Pat Ferrel
. Half of the customers have bought at least one product according to this trainingsdata. 1500 categories. 150 manufactures. Currently a maximum of 5 properties for the products and customers. What setup do you recommend? Op 10 mei 2017 om 20:44 heeft Pat Ferrel <p...@occamsmachete.com

Re: Docs Universal Recommender

2017-05-10 Thread Pat Ferrel
nar...@gmail.com> wrote: So in you opinion, do you think that the NLP task should be done in the Engine part using a library like mallet or should be implemented in algorithm focused library : mahout? 2017-05-10 23:52 GMT+04:00 Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>

Re: UR PredictionIO quickstart

2017-05-10 Thread Pat Ferrel
uction) with the UR? 2017-05-10 19:13 GMT+02:00 Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>>: Yes unless you have large-ish data. We also have and AWS AMI all set up here: http://actionml.com/docs/awssetupguide <http://actionml.com/docs/awssetupguide&

Re: UR PredictionIO quickstart

2017-05-10 Thread Pat Ferrel
Yes unless you have large-ish data. We also have and AWS AMI all set up here: http://actionml.com/docs/awssetupguide . Both should be fine for experimentation but will be too small for big-data. BTW all are being updated to the UR V0.6.0 and PIO 0.11.0

Re: Docs Universal Recommender

2017-05-10 Thread Pat Ferrel
ducts (never bought/sold), will recommendations be made with content-based filtering techniques? If so, what techniques are used in UR? 2017-05-08 19:02 GMT+02:00 Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>>: yes to all for UR v0.5.0 UR v0.6.0 is sitting

Re: New Website is Staged

2017-05-09 Thread Pat Ferrel
Are you guys ready for serious comments on the new design or is this just a first running version? On May 9, 2017, at 8:20 AM, Trevor Grant wrote: In the interest of getting this thing up and running, use DFW Meetup video as a place holder for time being? Trevor

Re: New Website is Staged

2017-05-09 Thread Pat Ferrel
Are you guys ready for serious comments on the new design or is this just a first running version? On May 9, 2017, at 8:20 AM, Trevor Grant wrote: In the interest of getting this thing up and running, use DFW Meetup video as a place holder for time being? Trevor

Re: LLR and other similarity metrics graph in The Universal Recommender with CCO slide

2017-05-04 Thread Pat Ferrel
That was generated using the old Mahout Mapreduce recommenders, which had pluggable similarity metrics. I ran it on a vey large E-Commerce dataset from a real ecom site. The data was for 6 months of sales. We did cross-validation of an 80 training set and 20% held out probe/test set. The test

Re: How $set events works together?

2017-05-04 Thread Pat Ferrel
Correct. The purchase is associated with the user so any properties would be attached to user objects and for the UR there are no user objects in the EventStore. Users are represented by the collection of their events, not properties. On May 3, 2017, at 10:47 PM, Vaghawan Ojha

Attack email

2017-05-03 Thread Pat Ferrel
If anyone gets a Google Docs share from me don’t click it. The URL is https:// accounts.google.com……. but it is an attack to get your contacts. Delete it.

Attack email

2017-05-03 Thread Pat Ferrel
If anyone gets a Google Docs share from me don’t click it. The URL is https:// accounts.google.com……. but it is an attack to get your contacts. Delete it.

Re: New logo

2017-05-03 Thread Pat Ferrel
nterlocking solid yellow/blue background 3rd is simple letter M as wireframe but prefer the diagram be in yellow. I don't care for the loopy curved logos (sorry Andrew!) Good luck!! Ellen Friedman On Thu, Apr 27, 2017 at 12:56 PM, Pat Ferrel <p...@occamsmachete.com <mailto:p...@occ

Re: New logo

2017-05-03 Thread Pat Ferrel
nterlocking solid yellow/blue background 3rd is simple letter M as wireframe but prefer the diagram be in yellow. I don't care for the loopy curved logos (sorry Andrew!) Good luck!! Ellen Friedman On Thu, Apr 27, 2017 at 12:56 PM, Pat Ferrel <p...@occamsmachete.com <mailto:p...@occ

Re: Items blacklisted in the query made to Elasticsearch by UR

2017-05-03 Thread Pat Ferrel
eir “facet” history? Otherwise I’m not sure where they’d come from. Yes I confirm that this user has those items in his facet history. 2017-04-05 18:18 GMT+02:00 Pat Ferrel <p...@occamsmachete.com >: Ok thanks for ruling out a couple things, I’ll take a look at this. BTW I assume "u

Re: Use user property for recommendation

2017-05-03 Thread Pat Ferrel
This is done automatically since we know the user’s history of purchases. In fact they may have many tendencies, not just one and the UR also compares these to other users to find what similar users bought. This is the essence of collaborative filtering, finding from the data what tendencies

Re: Scaling up spark Iitem similarity on big data data sets

2017-05-01 Thread Pat Ferrel
I just ran into the opposite case Sebastian mentions, where a very large % of users have only one interaction. They come from Social media or Search and see only thing and leave. Processing this data turned into a huge job but led to virtually no change in the model since users with very few

Re: Developing Evaluator Similar product template

2017-05-01 Thread Pat Ferrel
Cross validation for item-based recs is problematic and of dubious value. I’d A/B test changes by starting from default and gong from there. On May 1, 2017, at 8:34 AM, Dennis Honders wrote: Hi, I'm currently working on an Evaluator for the Similar product template.

Re: New logo

2017-04-27 Thread Pat Ferrel
hu, Apr 27, 2017 at 5:54 PM, Pat Ferrel <p...@occamsmachete.com> wrote: > Fair enough, I think Trevor feels the same. > > The blue man can continue, all it takes is a -1 > > > On Apr 27, 2017, at 3:50 PM, Ted Dunning <ted.dunn...@gmail.com> wrote: > &

Re: New logo

2017-04-27 Thread Pat Ferrel
hu, Apr 27, 2017 at 5:54 PM, Pat Ferrel <p...@occamsmachete.com> wrote: > Fair enough, I think Trevor feels the same. > > The blue man can continue, all it takes is a -1 > > > On Apr 27, 2017, at 3:50 PM, Ted Dunning <ted.dunn...@gmail.com> wrote: > &

Re: New logo

2017-04-27 Thread Pat Ferrel
uggest a better path and I hate negative feedback. But there it is. On Thu, Apr 27, 2017 at 3:48 PM, Pat Ferrel <p...@occamsmachete.com> wrote: > Do you have constructive input (guidance or opinion is welcome input) or > would you like to discontinue the contest. If the later, -1 now. >

Re: New logo

2017-04-27 Thread Pat Ferrel
uggest a better path and I hate negative feedback. But there it is. On Thu, Apr 27, 2017 at 3:48 PM, Pat Ferrel <p...@occamsmachete.com> wrote: > Do you have constructive input (guidance or opinion is welcome input) or > would you like to discontinue the contest. If the later, -1 now. >

Re: New logo

2017-04-27 Thread Pat Ferrel
Apr 27, 2017 at 3:36 PM, Pat Ferrel <p...@occamsmachete.com> wrote: > Yes, -1 means you hate them all or think the designers are not worth > paying. We have to pay to continue, I’ll foot the bill (donations > appreciated) but don’t want to unless people think it will lead t

Re: New logo

2017-04-27 Thread Pat Ferrel
Apr 27, 2017 at 3:36 PM, Pat Ferrel <p...@occamsmachete.com> wrote: > Yes, -1 means you hate them all or think the designers are not worth > paying. We have to pay to continue, I’ll foot the bill (donations > appreciated) but don’t want to unless people think it will lead t

Re: New logo

2017-04-27 Thread Pat Ferrel
ments/84/84017/attachment_84017937 >> >> I like the stylized and simple "M" and it reminds me of diagrams showing >> vector multiplication. >> >> On Thu, Apr 27, 2017 at 12:56 PM, Pat Ferrel <p...@occamsmachete.com> >> wrote: >> >>> We

Re: New logo

2017-04-27 Thread Pat Ferrel
ments/84/84017/attachment_84017937 >> >> I like the stylized and simple "M" and it reminds me of diagrams showing >> vector multiplication. >> >> On Thu, Apr 27, 2017 at 12:56 PM, Pat Ferrel <p...@occamsmachete.com> >> wrote: >> >>> We

Re: New logo

2017-04-27 Thread Pat Ferrel
you have 24 hours to vote Here’s my +1 to continue refining. On Apr 27, 2017, at 11:41 AM, Pat Ferrel <p...@occamsmachete.com> wrote: Here is a second group, hopefully picked to be unique.https://99designs.com/contests/poll/vl7xed We got a lot of responses, these 2 polls contain th

Re: New logo

2017-04-27 Thread Pat Ferrel
you have 24 hours to vote Here’s my +1 to continue refining. On Apr 27, 2017, at 11:41 AM, Pat Ferrel <p...@occamsmachete.com> wrote: Here is a second group, hopefully picked to be unique.https://99designs.com/contests/poll/vl7xed We got a lot of responses, these 2 polls contain th

Re: New logo

2017-04-27 Thread Pat Ferrel
Here is a second group, hopefully picked to be unique.https://99designs.com/contests/poll/vl7xed We got a lot of responses, these 2 polls contain the best afaict. On Apr 27, 2017, at 11:25 AM, Pat Ferrel <p...@occamsmachete.com> wrote: Vote: https://99designs.com/contests/poll/rqcg

Re: New logo

2017-04-27 Thread Pat Ferrel
Here is a second group, hopefully picked to be unique.https://99designs.com/contests/poll/vl7xed We got a lot of responses, these 2 polls contain the best afaict. On Apr 27, 2017, at 11:25 AM, Pat Ferrel <p...@occamsmachete.com> wrote: Vote: https://99designs.com/contests/poll/rqcg

New logo

2017-04-27 Thread Pat Ferrel
Vote: https://99designs.com/contests/poll/rqcgif We asked for something “mathy” and asked for no elephant and rider. We have the rest of the week to tweak so leave comments about what you like or would like to change. We don’t have to pick one of these, so if you hate them all, make that known

New logo

2017-04-27 Thread Pat Ferrel
Vote: https://99designs.com/contests/poll/rqcgif We asked for something “mathy” and asked for no elephant and rider. We have the rest of the week to tweak so leave comments about what you like or would like to change. We don’t have to pick one of these, so if you hate them all, make that known

Re: Knowing the probability of purchase

2017-04-26 Thread Pat Ferrel
t you would like to buy" Thank you for your prompt reply. Thanks On Wed, Apr 26, 2017 at 12:28 AM, Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>> wrote: I read this dissertation and came away wondering why it was important. The job of a recommender is n

New site and logo

2017-04-24 Thread Pat Ferrel
The Mahout site is moving to Jekyll with a bit if a new look and so it might be nice to get an update of the logo. I think the consensus was to keep the Mahout name but I didn’t get a feel for the logo. One concern mentioned is that Mahout is no longer attached to Hadoop (the elephant) so

New site and logo

2017-04-24 Thread Pat Ferrel
The Mahout site is moving to Jekyll with a bit if a new look and so it might be nice to get an update of the logo. I think the consensus was to keep the Mahout name but I didn’t get a feel for the logo. One concern mentioned is that Mahout is no longer attached to Hadoop (the elephant) so

Re: [VOTE] Release Apache PredictionIO 0.11.0 (incubating) RC2

2017-04-23 Thread Pat Ferrel
mply with the licensing. >> >> Here's my +1, not waiting for anything to be fixed. >> >> John >> >> On Sat, Apr 22, 2017 at 7:43 PM Andrew Purtell <andrew.purt...@gmail.com> >> wrote: >> >>> I will too, and then you will have t

Re: [VOTE] Release Apache PredictionIO 0.11.0 (incubating) RC2

2017-04-22 Thread Pat Ferrel
But is it worth doing yet another podling RC and release vote? If it is, please vote -1, at least we won’t be left waiting and we thank you for being the one who took a look either way. We are just trying to move, out if possible or iterate if not. These issues have not changed from the

Re: [VOTE] Release Apache PredictionIO 0.11.0 (incubating) RC2

2017-04-22 Thread Pat Ferrel
There have been no binding votes, thanks. On Apr 22, 2017, at 11:31 AM, Joe Schaefer <joes...@gmail.com> wrote: How many binding votes do you need at this point? On Wed, Apr 19, 2017 at 12:34 PM Pat Ferrel <p...@occamsmachete.com> wrote: > +1 non-binding > > Next rel

Re: Incubator Governance Change

2017-04-22 Thread Pat Ferrel
er all of the votes that come up. >> >> Julian >> >> >>> On Apr 22, 2017, at 8:46 AM, Pat Ferrel <p...@occamsmachete.com> wrote: >>> >>> Probably the wrong place for this but… >>> >>> What do people think about a

Re: Proposal for changing Mahout's Git branching rules

2017-04-22 Thread Pat Ferrel
p, any preference? On Sat, Apr 22, 2017 at 10:30 AM, Pat Ferrel <p...@occamsmachete.com> wrote: > It hasn't been often but I’ve been bit by it and had to ask users of a > dependent project to checkout a specific commit, nasty. > > The main affect would be to automation efforts that are cu

Re: Proposal for changing Mahout's Git branching rules

2017-04-22 Thread Pat Ferrel
er/dev branch approach is solid. On Sat, Apr 22, 2017 at 10:06 AM, Pat Ferrel <p...@occamsmachete.com> wrote: > I’ve been introduced to what is now being called git-flow, which at it’s > simplest is just a branching strategy with several key benefits. The most > important part of

Re: Proposal for changing Mahout's Git branching rules

2017-04-22 Thread Pat Ferrel
er/dev branch approach is solid. On Sat, Apr 22, 2017 at 10:06 AM, Pat Ferrel <p...@occamsmachete.com> wrote: > I’ve been introduced to what is now being called git-flow, which at it’s > simplest is just a branching strategy with several key benefits. The most > important part of

Re: Incubator Governance Change

2017-04-22 Thread Pat Ferrel
commit to voting on two release candidates per year. There are so many members of the IPMC that this would easily cover all of the votes that come up. Julian > On Apr 22, 2017, at 8:46 AM, Pat Ferrel <p...@occamsmachete.com> wrote: > > Probably the wrong place for this but… >

Proposal for changing Mahout's Git branching rules

2017-04-22 Thread Pat Ferrel
I’ve been introduced to what is now being called git-flow, which at it’s simplest is just a branching strategy with several key benefits. The most important part of it is that the master branch is rock solid all the time because we use the “develop” branch for integrating Jiras, PRs, features,

Proposal for changing Mahout's Git branching rules

2017-04-22 Thread Pat Ferrel
I’ve been introduced to what is now being called git-flow, which at it’s simplest is just a branching strategy with several key benefits. The most important part of it is that the master branch is rock solid all the time because we use the “develop” branch for integrating Jiras, PRs, features,

Incubator Governance Change

2017-04-22 Thread Pat Ferrel
all cases) so a veto role by the IPMC seems to have minimum danger to the ASF system of checks and balances. On Apr 19, 2017, at 9:33 AM, Pat Ferrel <p...@occamsmachete.com> wrote: +1 non-binding Next release we could exclude the doc site. Do build files like .sbt require licenses? I s

Re: Use whitelist or blacklist in query

2017-04-21 Thread Pat Ferrel
not have A2 in property 2)" as query? It seems that this cannot be done in the version you mentioned, since the query is used as "and" but not "or"? Any advice? Thank you so much! Best regards, Amy Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>&

Re: Dose v.011 support Spark ML, DataFrame and Pipeline

2017-04-20 Thread Pat Ferrel
PIO, which are using RDD instead of DataFrame. Can I also implement a template algorithm with SparkML and DataFrame? Is there any guide online? @Pat Ferrel <mailto:p...@occamsmachete.com> Is the template that you maintaining on the github? If yes, could you provide the link? Many Thanks, F

PredictionServer Timeouts

2017-04-19 Thread Pat Ferrel
I have a PredictionServer timeout when issuing a query for a non-existent user. We can only use a weeks worth of data so doing the db clean every day to drop events that are too old, without messing with $whatever events. Users are never created with $set. So we are asking for a list of event

Re: [VOTE] Release Apache PredictionIO 0.11.0 (incubating) RC2

2017-04-19 Thread Pat Ferrel
+1 non-binding Next release we could exclude the doc site. Do build files like .sbt require licenses? I suppose it can be done in comments. But again can we push to next release? Can other binding voters have a look? I know everyone is busy but hey, tax day is past ;-) On Apr 18, 2017, at

Re: Cart/Complementary purchase recommendation with UR

2017-04-19 Thread Pat Ferrel
"fields": [ > { > "name": "store_id", > "values": ["2"] > } > ] > }' http://10.10.10.2:8123/queries.json <http://10.10.10.2:8123/queries.json> It works without bias. But I don't think it's a valid

Re: Dose v.011 support Spark ML, DataFrame and Pipeline

2017-04-19 Thread Pat Ferrel
There is no restriction in templates for what they use of Spark. The ones you are looking at simply don’t need those interfaces. If you need them and are writing templates you can use them. In fact I maintain a template that does not use Spark for the Algorithm, only for IO. If you think some

Re: Similar product template

2017-04-18 Thread Pat Ferrel
preference for every user event, and category preference as well. Let's say I trained the model and queried with the preference parameter, in that case is it using multi model for each preference? If you could describe a bit about this, it would be reall On Thu, Apr 13, 2017 at 9:15 PM, Pat Ferr

Re: Lambda and Kappa CCO

2017-04-17 Thread Pat Ferrel
gt; Pat- > > What can we do from the mahout side? Would we need any new data > structures? Trevor and I were just discussing some of the troubles of > near real time matrix streaming. > ---------- > *From:* Pat Ferrel <p...@occamsmachete.com> > *

Re: Use whitelist or blacklist in query

2017-04-17 Thread Pat Ferrel
Using properties of items there are new exclusion rules is in the next release of v0.6.0, which will be released shortly after PIO v0.11.0 and Mahout v0.13.0, both are going release now. “Bias" in The Universal Recommender v0.6.0: 0: exclude items with properties specified (must_not in ES

Re: [VOTE] Apache PredictionIO (incubating) 0.11.0 Release (RC2)

2017-04-13 Thread Pat Ferrel
+1 binding was able to build and run integration tests on the UR, which has special needs for ES1 On Apr 11, 2017, at 2:41 PM, Chan Lee wrote: [+1] binding accept On Mon, Apr 10, 2017 at 1:22 PM, Donald Szeto wrote: > Thanks! Just a reminder that

Re: Similar product template

2017-04-13 Thread Pat Ferrel
I’m surprised that ALS seemed clear because is is based on a complicated matrix factorization algorithm that transforms the user vectors into a smaller dimensional space that is composed of “important” features. These are not interactions with items like “buys”, they can only be described as

Re: Terminology in event creation API

2017-04-12 Thread Pat Ferrel
xample this : https://github.com/detrevid/predictionio-template-classification-dl4j <https://github.com/detrevid/predictionio-template-classification-dl4j> it's on your templates but it not working anymore so ? ᐧ 2017-04-12 17:01 GMT+01:00 Pat Ferrel <p...@occamsmachete.com <mailto:p...

Re: Terminology in event creation API

2017-04-12 Thread Pat Ferrel
Actually each Template has different “event” interpretation. @Vaghawan is giving the interpretation of a recommender. Other Templates will use the event fields in different ways so we should backup to that a Template is and does and what the rest of PIO does. - Template: code that implements

Re: classification template

2017-04-12 Thread Pat Ferrel
We support the Universal Recommender here. See the PredictionIO mailing list for further support (CCed) On Apr 12, 2017, at 7:16 AM, YOUSSEF ASSATA wrote: hello everyone please can you suggest me a classification template or a tuto based on items and their

Re: Errors please help

2017-04-12 Thread Pat Ferrel
For PredicitonIO related questions ask the PredictionIO user group(CCed). This group is for the Universal Recommender and accompanying startup instructions. We do not support Postgres for the UR. On Apr 12, 2017, at 4:06 AM, gautamrbharad...@gmail.com wrote: Hi, I am trying to install

Re: How Can I count events In The Event Server?

2017-04-11 Thread Pat Ferrel
As you may know from setting up other databases the storage in indexes for a DB define it’s performance so the EventServer is tuned for use with Templates. There are a few queries (thanks Marius) but they are not extensive. I suggest you keep them in your own DB along side the EventServer if

Re: Existing javascript SDK for predicitonIO (0.10) ?

2017-04-10 Thread Pat Ferrel
On Mon, Apr 10, 2017 at 8:13 AM Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>> wrote: using Javascript from the client is a problem because you will make it possible for some malicious agent to see your access key by examining your client code. Although PIO s

Re: [PIO 0.10.0][Universtal Recommender 0.5] Exception when using remote Elasticsearch while training

2017-04-10 Thread Pat Ferrel
not an immediate issue but just wanted to make that clear. On Apr 10, 2017, at 8:27 AM, Marius Rabenarivo <mariusrabenar...@gmail.com> wrote: Yes, I set the clustername in Elasticsearch config too. Thank you for your advices about fields. 2017-04-10 19:07 GMT+04:00 Pat Ferrel <p...@occamsma

Re: microsoft azure

2017-04-10 Thread Pat Ferrel
Get an Ubuntu instance and install PredictionIO On Apr 10, 2017, at 8:18 AM, YOUSSEF ASSATA <assatayous...@gmail.com> wrote: ah :/ and so how ? please ? ᐧ 2017-04-10 16:17 GMT+01:00 Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>>: I believe I said “

Re: [PIO 0.10.0][Universtal Recommender 0.5] Exception when using remote Elasticsearch while training

2017-04-10 Thread Pat Ferrel
m monday. 2017-04-07 22:23 GMT+04:00 Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>>: We use a clustername different from hostname all the time. In fact if you have more than one node you will have to make them different since there are multiple hostnames and one

Re: [PIO 0.10.0][Universtal Recommender 0.5] Exception when using remote Elasticsearch while training

2017-04-07 Thread Pat Ferrel
effect as the code I mentioned below but I still getting the same exception. 2017-04-07 5:03 GMT+04:00 Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>>: BTW the docs for sparkConf are here: http://actionml.com/docs/ur_config#ur-spark-conf <http://actionml.com/

Re: [PIO 0.10.0][Universtal Recommender 0.5] Exception when using remote Elasticsearch while training

2017-04-06 Thread Pat Ferrel
BTW the docs for sparkConf are here: http://actionml.com/docs/ur_config#ur-spark-conf <http://actionml.com/docs/ur_config#ur-spark-conf> On Apr 6, 2017, at 4:52 PM, Pat Ferrel <p...@occamsmachete.com> wrote: PIO in the past used the “transport client” of Elasticsearch, which is o

Re: Adding a new application is slow

2017-04-06 Thread Pat Ferrel
Hmm, interesting. I have heard of this being a problem in PredictionIO but haven’t heard this explanation before IIRC. On Apr 6, 2017, at 9:47 AM, juha.syrj...@gmail.com wrote: Ah, I just found the problem: a fresh AWS instance takes a long time to generate enough entropy so that a random

Re: Loading data from files - Samsara

2017-04-04 Thread Pat Ferrel
Mahout-Samsara has a couple CLI drivers but these are mostly for examples. They read from csv files but may not do what you want. Mahout can also run in a Spark Shell or as a library to your app, which gives you all the data loading functions of Spark or Scala. For instance I use

Re: Items blacklisted in the query made to Elasticsearch by UR

2017-04-04 Thread Pat Ferrel
ot;: { "appName": "piourcluster", "indexName": "urindex", "typeName": "items", "eventNames": ["facet", "view"], "blacklistEvents": [], "maxEventsPerEve

Re: Error while pio status

2017-04-03 Thread Pat Ferrel
<http://actionml.com/> here http://actionml.com/docs/single_machine <http://actionml.com/docs/single_machine> Its all i have copied from. And it also suggest aml version of prediction which is also incorrect. On Mon, 3 Apr 2017 at 8:41 PM, Pat Ferrel <p...@occamsmachete.com <mailt

Re: Error while pio status

2017-04-03 Thread Pat Ferrel
magination To Reality Skype:- infoquestsolutions Gtalk:- infoquestindia On Mon, Apr 3, 2017 at 6:52 PM, Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>> wrote: If you are still using the UR you don’t need HDFS as a storage backend. In setup instructions, “some-m

Re: Reg:-Integrating Mahout with Solr

2017-04-02 Thread Pat Ferrel
017 at 23:46, Pat Ferrel <p...@occamsmachete.com> wrote: > You want to create “Behavioral Search”? This is where you boost items that > have the search terms in them more likely to be favored by the individual > user? > > You want to use the CCO algorithm in Mahout. You need to

Re: Reg:-Integrating Mahout with Solr

2017-04-01 Thread Pat Ferrel
You want to create “Behavioral Search”? This is where you boost items that have the search terms in them more likely to be favored by the individual user? You want to use the CCO algorithm in Mahout. You need to collect behavioral information like conversions, detailed page views, etc. Run each

Re: Can I train and deploy on different machine

2017-03-30 Thread Pat Ferrel
>>: For the host where we run the training, do we have to put the path to ES_CONF_DIR and HADOOP_CONF_DIR in pio-env.sh even if we use remote ES and Haddop clulsters? 2017-03-30 21:58 GMT+04:00 Pat Ferrel <p...@occamsmachete.com <mailto:p...@occamsmachete.com>>: To run locally in th

<    1   2   3   4   5   6   7   8   9   10   >