Re: Can The Classification Engine Handle Multiple Sets of Independent Data

2016-09-11 Thread Kenneth Chan
you can filter the data you need by using PEventStore API or modifying the DataSource.scala. Then train different engine instance with different configuration. On Wed, Sep 7, 2016 at 4:57 PM, Donald Szeto wrote: > Hi, > > If you are using the stock classification engine template, your easiest

Re: Batch import, Java

2016-09-11 Thread Kenneth Chan
I think the Java SDK already supports it creating events to a file, but it's not documented. https://github.com/PredictionIO/PredictionIO-Java-SDK/commit/6691144ebf1382aa1d060770a4fb7c0268f849d3 On Fri, Sep 9, 2016 at 7:59 AM, Pat Ferrel wrote: > The page is now live > > > On Sep 8, 2016, at 1

Re: Remove engine registration

2016-09-16 Thread Kenneth Chan
Pat, would you explain more about the 'instanceId' as in `pio register --variant path/to/some-engine.json --instanceId some-REST-compatible-resource-id` ? Currently PIO also has a concept of engineInstanceId, which is output of train. I think you are referring to different thing, right? Kenneth

Re: delay of engines

2016-09-26 Thread Kenneth Chan
re: kappa vs lambda. as far as i understand, at high-level, kappa is more like a subset of lambda (ie. only keep the real-time part) https://www.ericsson.com/research-blog/data-knowledge/data-processing-architectures-lambda-and-kappa/ Gerog, would you be more specific when you talk about "latency

Re: "Unifying Multichannel Data with Webhooks" - user identities

2016-09-26 Thread Kenneth Chan
more details of how to use channel is here: http://predictionio.incubator.apache.org/datacollection/channel/ there are some implementation here (segment, mailchimp) https://github.com/apache/incubator-predictionio/tree/develop/data/src/main/scala/org/apache/predictionio/data/webhooks basically it

Re: How to access Spark Context in predict?

2016-09-26 Thread Kenneth Chan
Hasan, Spark randomforest algo doesn't need RDD. much simpler to simply serialize it and use in local memory in predict(). see example here. https://github.com/PredictionIO/template-scala-parallel-leadscoring/blob/develop/src/main/scala/RFAlgorithm.scala For accessing evernt store in predict(), y

Re: Very few predictions

2016-09-26 Thread Kenneth Chan
1: Why model data is so small and why I'm getting predictions only for a couple of items? the model size depends on number of items. It only save the "item-vector" of each item. 2: Is this event data quality problem? If yes, How can I test and improve the data quality? could be because your data

Re: Complementary Purchase Engine Template

2016-09-26 Thread Kenneth Chan
there is no MLLib used in this template. It's simple implementation of the basic idea in Spark code. as explained here http://predictionio.incubator.apache.org/templates/complementarypurchase/dase/#algorithm But i know there is also "Associate Rule Algo" implemented in latest Spark version. http

Re: delay of engines

2016-09-27 Thread Kenneth Chan
rk SQL) prior the > model can be evaluated. Here, a higher latency would be OK. > Regarding the low volume data: some features will require some sort of SQL > for extraction. > > > > Kenneth Chan schrieb am Di., 27. Sep. 2016 um > 07:43 Uhr: > >> re: kappa vs lambda

Re: Docker setup

2016-09-29 Thread Kenneth Chan
there are also some community contributed http://predictionio.incubator.apache.org/community/projects/#docker-installation-for-predictionio https://github.com/tobilg/docker-predictionio Kenneth On Thu, Sep 29, 2016 at 2:31 AM, Tom McCaul wrote: > Thanks, Marcin. I’ll check this out and loo

Re: predictionIO classpath

2016-10-05 Thread Kenneth Chan
did you also follow what Chan suggested? " you can install the latest version by following the instructions here http://predictionio.incubator.apache.org/install/install-sourcecode/ and using the develop branch when running ./make-distribution.sh. " Kenneth On Wednesday, October 5, 2016, Cristin

Re: predictionIO classpath

2016-10-05 Thread Kenneth Chan
i meant did you also install PIO from source code (dev branch) and run make-distribution.sh as described by Chan? On Wednesday, October 5, 2016, Cristina Giannone < cristina.giann...@gmail.com> wrote: > I modified the the classpaths in build.sbt and in the scala file as > suggested by Chan. > > B

Re: Predictionio on Windows

2016-10-06 Thread Kenneth Chan
predictionio on windows is not supported. but you can use vagrant to bring up VM and install predictionio for simple testing. http://predictionio.incubator.apache.org/install/install-vagrant/ On Thursday, October 6, 2016, Saurav Sarkar wrote: > Hi All, > > We are evaluating predictionio for o

Re: ecommerce template prediction based on price

2016-10-17 Thread Kenneth Chan
You could add the price as "properties" of item. Then modify the Query class to add 'price' as one of field Modify isCandidate() function to take the price into account https://github.com/apache/incubator-predictionio-template-ecom-recommender/blob/23daa4b66015af0a45f0f3cb0115c547ca00958a/src/main

Re: Recommendation template, Implicit Feedback, only "buy" events

2017-02-15 Thread Kenneth Chan
1. yes. try follow this to modify template to use trainImplicit http://predictionio.incubator.apache.org/templates/recommendation/training-with-implicit-preference/ 2. just import event to PIO as { event: buy entitiy: user entityId: uid targetEntity: article targetEntityId: article_id eventTime:

Re: template-similar-product, import_eventserver error, status: 401 Invalid accessKey

2017-02-16 Thread Kenneth Chan
would you check if your access key is the same as the one you see in $pai app list ? On Thu, Feb 16, 2017 at 9:47 AM Pat Ferrel wrote: > Is the EventServer running? > > > On Feb 16, 2017, at 9:01 AM, Сергей Купрюшин wrote: > > yes, i try that. > And try insert *<**My_Access_Key**> *in script

Re: Issue while deploying OpenNLP Sentiment Analysis Template

2017-02-19 Thread Kenneth Chan
would you try change EmptyParams() to new EmptyEvaluationInfo() ? On Sun, Feb 19, 2017 at 10:12 PM, infoquest india wrote: > Hi > > I am trying to convert OpenNLP Sentiment Analysis Template to work with > prediction-0.10.0-incubating after replacing prediction.io with > org.apache.predictioni

Re: Issue while deploying OpenNLP Sentiment Analysis Template

2017-02-20 Thread Kenneth Chan
RROR] [Console$] [error] (compile:compile) Compilation failed > > [ERROR] [Console$] [error] Total time: 11 s, completed Feb 20, 2017 > 2:34:17 PM > > [ERROR] [Console$] Return code of previous step is 1. Aborting. > > > > > Thanks > Gaurav > > > On Mon,

Re: custom Access_KEY

2017-02-20 Thread Kenneth Chan
try specify the access-key param when you create the app again pai app new ... --access-key XYZ.. On Mon, Feb 20, 2017 at 5:09 PM Malay Tripathi wrote: > Hi All, > > I had to change & recreate my app/engine & that changed my old accessKey > with new one. > Due to some reasons I can't change t

Re: Saving predictions on training data with unsupervised learning

2017-03-04 Thread Kenneth Chan
I guess your use case is not for real time label classify for unseen data? batch prediction is basically the same as batch eval. see if this example helps? http://predictionio.incubator.apache.org/templates/recommendation/batch-evaluator/ On Sat, Mar 4, 2017 at 11:56 AM Mars Hall wrote: >

Re: Help requested: Multiple events and algorithms on one engine

2017-03-04 Thread Kenneth Chan
whether combining multiple into one or separate them as diff engines really depends on your use case and design choice. just like having single program with some logic to handle different things so they can share some common logic and configured VS individual program for each one for clean separat

Re: Recommendation template, Implicit feedback problems

2017-03-04 Thread Kenneth Chan
may not be 100% up to dated but the basic idea should be the same. which part you have question? On Wed, Mar 1, 2017 at 1:46 AM Kasper Bjerke wrote: > Modifying the Recommendation template using: > > http://predictionio.incubator.apache.org/templates/recommendation/training-with-implicit-pref

Re: Recommendation template, Implicit Feedback, only "buy" events

2017-03-04 Thread Kenneth Chan
. I > can't seem to find out how to include the eventTime parameter in the > create_event function. Does anyone have experience importing lots of data > to the app using Ruby? Is there an alternative to the FileExporter in > python for Ruby? > > 16. feb. 2017 08:56 skrev "Ke

Re: Eventserver API in an Engine?

2017-06-30 Thread Kenneth Chan
I agree that there is confusion regarding event server VS event storage and the unclear usage definition of types of data storage (e.g. meta-data vs model) but i'm not sure if bundling Event Server with Engine Server (or Pat calls it PredictionServer) is a good solution. currently PIO has 3 "ty

Re: Eventserver API in an Engine?

2017-07-08 Thread Kenneth Chan
secure Server. And here > multi-tenancy is super lightweight. Since most users have only one > Template, they may have to install supporting compute engines or stores. > This is a one time issue for them and Templates should come with containers > and scripts to compose them. We’re alrea

Re: Eventserver API in an Engine?

2017-07-08 Thread Kenneth Chan
ut data problem, maybe other way is that the template should also provide a "event validator" which can be loaded into event server and advanced user can also customize it. On Sat, Jul 8, 2017 at 12:31 AM, Kenneth Chan wrote: > # re: " I see it as objects you see it as data sto

Re: Import Inital Data Model

2017-07-08 Thread Kenneth Chan
re: " i’m always getting an exception with the initial load from User & Product" what's the error you got? On Fri, Jul 7, 2017 at 12:46 PM, Sebastian Fix wrote: > Hello together, > > I finally got the System running and would like to import some initial > data now. > > I’m using the E-Commerce-T

Re: Kafka support as Event Store

2017-07-08 Thread Kenneth Chan
thomas, if you always your own data store backend for events or your application data, you don't have to use EventServer. Just implement a PIO engine which directly reads from your data source. On Thu, Jul 6, 2017 at 7:32 AM, Pat Ferrel wrote: > The EventStore is queryable in flexible ways and

Re: Getting the probability values in the random forest classification template of PredictionIO

2017-07-08 Thread Kenneth Chan
see here https://spark.apache.org/docs/latest/mllib-ensembles.html#prediction you can ask in Apache Spark's mailing list too. On Fri, Jun 30, 2017 at 4:09 AM, Pat Ferrel wrote: > You probably should ask on PredictionIO mailing list. > > Signup here: http://predictionio.incubator.apache.org/supp

Re: Exception: Job aborted due to stage failure: Task 0 in stage 3.0 failed 1 times,

2017-07-08 Thread Kenneth Chan
java.lang.NullPointerException >> >> Driver stacktrace: >> at org.apache.spark.scheduler.DAGScheduler.org$apache$spark$sch >> eduler$DAGScheduler$$failJobAndIndependentStages(DAGScheduler.scala:1431) >> Caused by: java.lang.NullPointerException >> >> >> Best rega

Re: Eventserver API in an Engine?

2017-07-09 Thread Kenneth Chan
in the client for a long time but since the events looked > perfectly fine to the PIO EventServer, the errors were never detected and > the data was in fact ignored. Within a day of replaying exported PIO events > to the prototype server the issue was resolved and fixed in the client. >

Re: Eventserver API in an Engine?

2017-07-10 Thread Kenneth Chan
streams can have db TTLs to age out old data for Lambda Engines. The system > is always self-cleaning with no heavyweight operation required to keep just > the right data (the db cleaner is heavyweight and slow), the data does not > grow forever by design. This was never addressed as a desig

Re: Eventserver API in an Engine?

2017-07-10 Thread Kenneth Chan
it's all same set of events collected for my application and i can create multiple engine to use these data for different purpose.

Re: Accumulo support

2017-07-10 Thread Kenneth Chan
yes. HBase is currently supported, which seems similar to Accumulo. it s not difficult to add another storage backend support option - just need to implement the stroage layer interface. Contribution is welcome! On Mon, Jul 10, 2017 at 8:54 AM Miller, Clifford < clifford.mil...@phoenix-opsgroup.

Re: Eventserver API in an Engine?

2017-07-11 Thread Kenneth Chan
often not under ASF > rules. > > In any case—how much of your problem is workflow vs installation vs > bundling of APIs? Can you explain it more? > > > On Jul 11, 2017, at 9:37 AM, Mars Hall wrote: > > > On Jul 10, 2017, at 18:03, Kenneth Chan wrote: > > > >

Re: Eventserver API in an Engine?

2017-07-11 Thread Kenneth Chan
me machine? On Tue, Jul 11, 2017 at 12:49 PM, Kenneth Chan wrote: > I think we are having wrong impression that every template are supposed to > work together out of the box. > > The templates are meant to be examples and demonstration - that's why they > are called templ

Re: Eventserver API in an Engine?

2017-07-12 Thread Kenneth Chan
gured in engine.json. > > I think to resolve Mars immediate need, we can implement embedded event > server in a couple phases. Roughly it would be wiring the existing event > server in (with some refactoring) and mark it experimental, then continue > toward a clean, app-specific

Re: Eventserver API in an Engine?

2017-07-12 Thread Kenneth Chan
f docs & discussion suggesting its share-ability, > there is precious little documentation that presents how the multi-backend > Storage really works in PIO. (I didn't understand it until I read a bunch > of Storage source code.) > > > From Kenneth Chan on Tue, 1