Re: Would like to add PredictionIO to Spark Third-Party wiki page...

2016-09-26 Thread Donald Szeto
Hi Sarasija,

Thanks for helping out on adding links in the Spark wiki. It is managed by
the Apache Spark PMC and we do not have access. I have the best way to go
is to subscribe to d...@spark.apache.org and suggest an edit to the Spark
PMC.

Regards,
Donald

On Mon, Sep 26, 2016 at 1:31 PM, Sarasija Parthasarthy <
sparthasar...@salesforce.com> wrote:

> Hi:
>
> I would like to add PredictionIO as part of Infrastructure projects to this
> wiki:
> https://cwiki.apache.org/confluence/display/SPARK/Third+Party+Projects
>
> I registered, but do not seem to have edit rights to the wiki.  Can you
> give me access?
>
> Thanks,
> Sarasija
>


[GitHub] incubator-predictionio issue #302: Bind fail when running pio

2016-09-26 Thread dszeto
Github user dszeto commented on the issue:

https://github.com/apache/incubator-predictionio/issues/302
  
Hi, please redirect usage questions to 
u...@predictionio.incubator.apache.org for more exposure to the rest of the 
community. Subscription instructions can be found at 
http://predictionio.incubator.apache.org/support/.

Do you already have something using port 7070? You may also take a look at 
`pio.log` in the working directory (i.e. where you ran `pio eventserver`) for 
more details.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] incubator-predictionio issue #297: Update document

2016-09-26 Thread dszeto
Github user dszeto commented on the issue:

https://github.com/apache/incubator-predictionio/pull/297
  
@djKooks No problem. Please keep updating. Thanks for your contribution!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (PIO-35) Add integration tests for major templates

2016-09-26 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/PIO-35?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15524988#comment-15524988
 ] 

ASF GitHub Bot commented on PIO-35:
---

Github user dszeto commented on a diff in the pull request:


https://github.com/apache/incubator-predictionio/pull/300#discussion_r80616820
  
--- Diff: tests/pio_tests/scenarios/basic_app_usecases.py ---
@@ -61,17 +61,13 @@ class BasicAppUsecases(BaseTestCase):
   def setUp(self):
 random.seed(3)
 self.log.info("Setting up the engine")
-
-template_path = pjoin(
-self.test_context.engine_directory, "recommendation-engine")
+template_path = 
"https://github.com/chanlee514/template-scala-parallel-recommendation;
--- End diff --

That's a good idea. In fact, the templates have been donated and are in the 
process of migrating to Apache. I will ping you when that's done, so you can 
start using those new Git URLs as well.


> Add integration tests for major templates
> -
>
> Key: PIO-35
> URL: https://issues.apache.org/jira/browse/PIO-35
> Project: PredictionIO
>  Issue Type: Improvement
>Reporter: Chan
>
> Developers of engine templates should be able to test that their template 
> works with the latest changes in PredictionIO. As a starting point, we can 
> expand the integration test suite to all previously "official" templates.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[RESULT] [VOTE]: Apache PredictionIO (incubating) 0.10.0 Release (RC4)

2016-09-26 Thread Donald Szeto
The vote passes, with 4 +1 votes (4 binding) and no -1 votes.

+1 Alex Merritt (binding)
+1 Donald Szeto (binding)
+1 Pat Ferrel (binding)
+1 Suneel Marthi (binding)

Thanks all for voting.

Apache PredictionIO (incubating) 0.10.0 RC4 has been accepted as the
release. The voting will now proceed to the Incubator PMC.

Regards,
Donald


[jira] [Commented] (PIO-38) add Apache Parquet as a data source

2016-09-26 Thread Wojciech Indyk (JIRA)

[ 
https://issues.apache.org/jira/browse/PIO-38?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15523861#comment-15523861
 ] 

Wojciech Indyk commented on PIO-38:
---

Hello [~Ziemin]! Sorry for late response.
I would like to have a chance to provide events to PredictionIO using my 
current place of storing events. As I can see PredictionIO can work with a pair 
of Elasticsearch+HBase. Therefore to use Elasticsearch as a backend I need to 
use HBase as an event-store. I don't know PredictionIO so good, so correct me 
if I'm wrong.
I don't want to use HBase, because it enlarges my technology stack and has no 
benefit in case of training model in batch. Parquet is more suitable to this 
case, when I append my archive of events once a day, then can use this data 
(subset) to train a recommendation model without duplication data in HBase.
Is it clear enough?

> add Apache Parquet as a data source
> ---
>
> Key: PIO-38
> URL: https://issues.apache.org/jira/browse/PIO-38
> Project: PredictionIO
>  Issue Type: New Feature
>Reporter: Wojciech Indyk
>  Labels: features
>
> Apache Parquet (https://parquet.apache.org/) is a columnar data store, native 
> for Apache Spark and very well suited to storing batch data (as an input) for 
> PredictionIO Engine.
> Parquet is very popular to archive clickstream, so it would enable to use 
> PredictionIO without additional import of data (and duplication) to HBase.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)