[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2014-12-10 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
https://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=78rev2=79

Comment:
Update HDP, cull intel entry :-(, add disclaimer at end of MapR's assertions of 
superiority. Someone needs to add GCE, RAX, MSFT cloud offerings  RHAT entries.

  ## and generally get overexcited. This is not an advertisement site.
  ## 3. This is about products built from Hadoop or designed to run on it -not 
replacements.
  ## 4. Please don't be rude about the ASF products or any of your competitors.
+ ##If you make claims that your product is better than the ASF releases 
then we may cut them or at least demand evidence of validity against the 
current ASF releases.
  ## 5. You write it -you get to keep the links up to date. If they break we'll 
just cut the links entirely.
  
  = Products that include Apache Hadoop or derivative works and Commercial 
Support =
@@ -19, +20 @@

  
  The sole products that can be called a release of Apache Hadoop come from 
[[http://www.apache.org/dyn/closer.cgi/hadoop/|apache.org]]. Some companies 
release or sell products that include the official Apache Hadoop release files, 
and/or their own and other useful tools. Other companies or organizations 
release products that include artifacts build from modified or extended 
versions of the Apache Hadoop source tree. Such derivative works are not 
supported by the Apache Team: all support issues must be directed to the 
suppliers themselves.
  
- The Apache Software Foundation strongly encourages users of Hadoop -in any 
form- to get involved in the Apache-hosted mailing lists. Even though you may 
only get support through the supplier of any derivative work of Apache Hadoop, 
by participating in the Hadoop user and developer lists, you can become an 
active part of the Hadoop community. Your needs may be addressed in future 
versions of the code, and you will be able to get in touch with many other 
users of the technology.
+ The Apache Software Foundation strongly encourages users of Hadoop —in any 
form— to get involved in the Apache-hosted mailing lists. Even though you may 
only get support through the supplier of any derivative work of Apache Hadoop, 
by participating in the Hadoop user and developer lists, you can become an 
active part of the Hadoop community. Your needs may be addressed in future 
versions of the code, and you will be able to get in touch with many other 
users of the technology.
  
  The Hadoop developers would like you to be aware that filing JIRA issues is 
not a way to get support to get your Hadoop installation up and running. Those 
bug reports are not bugs, and will [[InvalidJiraIssues|be closed as invalid]]. 
Either use the [[http://hadoop.apache.org/general_lists.html#User|hadoop-user]] 
mailing list, the organisations providing support listed below, or delve into 
the Hadoop source itself.
  
@@ -71, +72 @@

  
   * [[http://www.hortonworks.com|Hortonworks]]
* Major contributors to Apache Hadoop and dedicated to working with the 
community to make Apache Hadoop more robust and easier to install, manage, use, 
integrate and extend.
-   * Provides 
[[http://www.hortonworks.com/technology/hortonworksdataplatform/|Hortonworks 
Data Platform Powered by Apache Hadoop]], which is a 100% open source big-data 
platform based upon Apache Hadoop. HDP-2 is built on Apache Hadoop 2.2.
+   * Provides 
[[http://www.hortonworks.com/technology/hortonworksdataplatform/|Hortonworks 
Data Platform Powered by Apache Hadoop]], which is a 100% open source big-data 
platform based upon Apache Hadoop. HDP-2.2 is built on Apache Hadoop 2.6.
* Provider of [[http://www.hortonworks.com/support/|expert technical 
support]], [[http://www.hortonworks.com/training/|training]] and 
[[http://www.hortonworks.com/partners/|partner-enablement services]] for both 
end-user organizations and technology vendors.
  
   * [[http://www.hstreaming.com|HStreaming]]
@@ -95, +96 @@

* The HCM (Hadoop Cluster Management) tool is a solution that automates the 
cluster setup and management activities, thus reducing the overall time, cost 
and effort required for setting up and managing Hadoop clusters. 
[[http://bigdata.impetus.com/hadoop_mgmt_cluster_data#|More info about HCM 
@Impetus]]
* With a strong focus, established thought leadership and open source 
contributions in the area of Big Data analytics and consulting services, 
Impetus uses its Global Delivery Model to help technology businesses and 
enterprises evaluate and implement solutions tailored to their specific 
context, without being biased towards a particular solution. 
[[http://bigdata.impetus.com/#|More info about BigData @Impetus]]
  
-  * [[http://hadoop.intel.com/|Intel]]
-   * the [[http://hadoop.intel.com/|Intel® Distribution for Apache Hadoop]] 

[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2014-06-16 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
https://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=72rev2=73

Comment:
add comments at the top providing guidelines to contributors; add the Apache 
prefix in more places.

  ## page was renamed from Distribution
+ ##
+ ## EDITORS:
+ ## 1. Use Apache Hadoop when appropriate -and do the same for Apache HBase 
and other ASF projects.
+ ## 2. Do not go overboard in explaining how wonderful your product is, stick 
in more URLs than a myspace page#
+ ## and generally get overexcited. This is not an advertisement site.
+ ## 3. This is about products built from Hadoop or designed to run on it -not 
replacements.
+ ## 4. Please don't be rude about the ASF products or any of your competitors.
+ ## 5. You write it -you get to keep the links up to date. If they break we'll 
just cut the links entirely.
+ 
  = Products that include Apache Hadoop or derivative works and Commercial 
Support =
  The following companies provide products that include Apache Hadoop, a 
derivative work thereof, commercial support, and/or tools and utilities related 
to Hadoop.
  
@@ -19, +28 @@

   * [[http://aws.amazon.com/|Amazon Web Services]]
* Amazon offers a version of Apache Hadoop on their EC2 infrastructure, 
sold as [[http://aws.amazon.com/elasticmapreduce|Amazon Elastic MapReduce]].
  
-  * [[https://incubator.apache.org/bigtop|Bigtop]]
+  * [[https://incubator.apache.org/bigtop|Apache Bigtop]]
-   * Bigtop is a project for the development of packaging and tests of the 
Apache Hadoop ecosystem. This includes testing at various levels (packaging, 
platform, runtime, upgrade, etc...) developed by a community with a focus on 
the system as a whole, rather than individual projects.
+   * Apache Bigtop is a project for the development of packaging and tests of 
the Apache Hadoop ecosystem. This includes testing at various levels 
(packaging, platform, runtime, upgrade, etc...) developed by a community with a 
focus on the system as a whole, rather than individual projects.
-   * Bigtop doesn't provide binary artifacts of its releases, it is source 
only project.
+   * Apache Bigtop doesn't provide binary artifacts of its releases, it is 
source only project.
  
   * [[http://www.cascading.org/|Cascading]] - Cascading is a popular 
feature-rich API for defining and executing complex and fault tolerant data 
processing workflows on a Apache Hadoop cluster. Cascading 2.0 is 
Apache-licensed.
  
@@ -36, +45 @@

* DAS Log File Aggregator is a plug-in to DAS that makes it easy to import 
large numbers of log files stored on disparate servers.
  
   * [[http://www.dataminelab.com|Data Mine Lab]]
-   * Data Mine Lab is a London based consultancy developing solutions based on 
Hadoop, Mahout, HBase and Amazon Web Services. Data Mine Lab uses combination 
of cloud computing, MapReduce, columnar databases and open source Business 
Intelligence tools to develop solutions that add value to their customers' 
businesses and the data they collect.
+   * Data Mine Lab is a London-based consultancy developing solutions based on 
Apache Hadoop, Apache Mahout, Apache HBase and Amazon Web Services. Data Mine 
Lab uses combination of cloud computing, MapReduce, columnar databases and open 
source Business Intelligence tools to develop solutions that add value to their 
customers' businesses and the data they collect.
  
   * [[http://www.datasalt.com/|Datasalt]]
-   * Datasalt is a Hadoop consulting company which has released two 
open-source products on top of Hadoop ([[http://pangool.net|Pangool]], an 
easier low-level API for Hadoop, and [[http://sploutsql.com|Splout SQL]], a 
low-latency SQL serving engine on top of Hadoop). Datasalt provides commercial 
support, public / private training and custom Hadoop development.
+   * Datasalt is an Apache Hadoop consulting company which has released two 
open-source products on top of Hadoop ([[http://pangool.net|Pangool]], an 
easier low-level API for Hadoop, and [[http://sploutsql.com|Splout SQL]], a 
low-latency SQL serving engine on top of Hadoop). Datasalt provides commercial 
support, public / private training and custom Hadoop development.
  
   * [[http://www.datastax.com|DataStax]]
-   * DataStax provides a product which fully integrates 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-hadoop|hadoop]]
 with 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-cassandra|Apache
 Cassandra]] and 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-solr|Apache
 Solr]] in its 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise|DataStax
 Enterprise platform]]. DataStax Enterprise is completely free to use for 
development environments with no 

[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2014-02-15 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
https://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=66rev2=67

Comment:
review, including: remove/update version numbers, move Platform under IBM, drop 
obsolete critiques of Hadoop from MapR

   * [[http://www.cascading.org/|Cascading]] - Cascading is a popular 
feature-rich API for defining and executing complex and fault tolerant data 
processing workflows on a Apache Hadoop cluster. Cascading 2.0 is 
Apache-licensed.
  
   * [[http://www.cloudera.com/|Cloudera]]
-   * Cloudera distributes a platform of open source Apache projects called 
[[http://www.cloudera.com/downloads/|Cloudera's Distribution including Apache 
Hadoop]] or CDH. In addition, Cloudera offers its enterprise customers a family 
of 
[[http://www.cloudera.com/content/cloudera/en/products-and-services.html|product
 and services]] that complement the open-source Apache Hadoop platform. These 
include comprehensive 
[[http://www.cloudera.com/content/cloudera/en/training.html|training 
sessions]], 
[[http://www.cloudera.com/content/cloudera/en/products-and-services/professional-services.html|architectural
 services]] and 
[[http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-support.html|technical
 support]] for Hadoop  clusters in development or in production. We serve a 
wide range of 
[[http://www.cloudera.com/content/cloudera/en/our-customers.html|customers]] 
including retail, government, financial service, healthcare, life sciences, 
digital media, advertising, networking and telephony enterprises.
+   * Cloudera distributes a platform of open-source projects called 
[[http://www.cloudera.com/downloads/|Cloudera's Distribution including Apache 
Hadoop]] or CDH. In addition, Cloudera offers its enterprise customers a family 
of 
[[http://www.cloudera.com/content/cloudera/en/products-and-services.html|product
 and services]] that complement the open-source Apache Hadoop platform. These 
include comprehensive 
[[http://www.cloudera.com/content/cloudera/en/training.html|training 
sessions]], 
[[http://www.cloudera.com/content/cloudera/en/products-and-services/professional-services.html|architectural
 services]] and 
[[http://www.cloudera.com/content/cloudera/en/products-and-services/cloudera-support.html|technical
 support]] for Hadoop  clusters in development or in production. We serve a 
wide range of 
[[http://www.cloudera.com/content/cloudera/en/our-customers.html|customers]] 
including retail, government, financial service, healthcare, life sciences, 
digital media, advertising, networking and telephony enterprises.
  
   * [[http://www.cloudspace.com/|Cloudspace]]
* Cloudspace is a web technology consulting company, since 1996. Cloudspace 
uses Apache Hadoop to scale client and internal projects on Amazon's EC2 and 
bare metal architectures.
@@ -39, +39 @@

* Data Mine Lab is a London based consultancy developing solutions based on 
Hadoop, Mahout, HBase and Amazon Web Services. Data Mine Lab uses combination 
of cloud computing, MapReduce, columnar databases and open source Business 
Intelligence tools to develop solutions that add value to their customers' 
businesses and the data they collect.
  
   * [[http://www.datasalt.com/|Datasalt]]
-   * Datasalt is a Hadoop consulting company which released two open-source 
products on top of Hadoop ([[http://pangool.net|Pangool]], an easier low-level 
API for Hadoop, and [[http://sploutsql.com|Splout SQL]], a low-latency SQL 
serving engine on top of Hadoop). Datasalt provides commercial support, public 
/ private training and custom Hadoop development.
+   * Datasalt is a Hadoop consulting company which has released two 
open-source products on top of Hadoop ([[http://pangool.net|Pangool]], an 
easier low-level API for Hadoop, and [[http://sploutsql.com|Splout SQL]], a 
low-latency SQL serving engine on top of Hadoop). Datasalt provides commercial 
support, public / private training and custom Hadoop development.
  
   * [[http://www.datastax.com|DataStax]]
-   * DataStax provides a product of 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-hadoop|Hadoop]]
 which fully integrates Apache Hadoop with 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-cassandra|Apache
 Cassandra]] and 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-solr|Apache
 Solr]] in its 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise|DataStax
 Enterprise platform]]. DataStax Enterprise is completely free to use for 
development environments with no restrictions. In addition, DataStax supplies 
[[http://www.datastax.com/what-we-offer/products-services/datastax-opscenter|OpsCenter]]
 for visual management and monitoring, 

[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2013-03-25 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
http://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=62rev2=63

Comment:
fix up use of the phrase hadoop distribution, insert section on Intel's 
product

  Entries are listed alphabetically by company name.
  
   * [[http://aws.amazon.com/|Amazon Web Services]]
-   * Amazon offers a version of Apache Hadoop on their EC2 infrastructure, 
sold as  [[http://aws.amazon.com/elasticmapreduce|Amazon Elastic MapReduce]]. 
+   * Amazon offers a version of Apache Hadoop on their EC2 infrastructure, 
sold as [[http://aws.amazon.com/elasticmapreduce|Amazon Elastic MapReduce]]. 
  
   * [[https://incubator.apache.org/bigtop|Bigtop]]
* Bigtop is a project for the development of packaging and tests of the 
Apache Hadoop ecosystem. This includes testing at various levels (packaging, 
platform, runtime, upgrade, etc...) developed by a community with a focus on 
the system as a whole, rather than individual projects.
* Bigtop doesn't provide binary artifacts of its releases, it is source 
only project.
  
-  * [[http://www.cascading.org/|Cascading]] - Cascading is a feature-rich API 
for defining and executing complex and fault tolerant data processing workflows 
on a Apache Hadoop cluster. Cascading 2.0 is Apache-licensed. 
+  * [[http://www.cascading.org/|Cascading]] - Cascading is a popular 
feature-rich API for defining and executing complex and fault tolerant data 
processing workflows on a Apache Hadoop cluster. Cascading 2.0 is 
Apache-licensed. 
  
   * [[http://www.cloudera.com/|Cloudera]]
* Cloudera distributes a platform of open source Apache projects called 
[[http://www.cloudera.com/downloads/|Cloudera's Distribution including Apache 
Hadoop]] or CDH. In addition, Cloudera offers its enterprise customers a family 
of [[http://www.cloudera.com/products-services/|product and services]] that 
complement the open-source Apache Hadoop platform. These include comprehensive 
[[http://www.cloudera.com/hadoop-training/|training sessions]], 
[[http://www.cloudera.com/hadoop-services/|architectural services]] and 
[[http://www.cloudera.com/hadoop-support/|technical support]] for Hadoop  
clusters in development or in production. We serve a wide range of 
[[http://www.cloudera.com/customers-partners/|customers]] including retail, 
government, financial service, healthcare, life sciences, digital media, 
advertising, networking and telephony enterprises.
@@ -39, +39 @@

* Data Mine Lab is a London based consultancy developing solutions based on 
Hadoop, Mahout, HBase and Amazon Web Services. Data Mine Lab uses combination 
of cloud computing, MapReduce, columnar databases and open source Business 
Intelligence tools to develop solutions that add value to their customers' 
businesses and the data they collect.
  
   * [[http://www.datastax.com|DataStax]]
-   * DataStax provides a distribution of 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-hadoop|Hadoop]]
 that is fully integrated with 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-cassandra|Apache
 Cassandra]] and 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-solr|Apache
 Solr]] in its 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise|DataStax
 Enterprise platform]]. DataStax Enterprise is completely free to use for 
development environments with no restrictions. In addition, DataStax supplies 
[[http://www.datastax.com/what-we-offer/products-services/datastax-opscenter|OpsCenter]]
 for visual management and monitoring, along with 
[[http://www.datastax.com/what-we-offer/products-services/support|expert 
support]], 
[[http://www.datastax.com/what-we-offer/products-services/training|training]], 
and 
[[http://www.datastax.com/what-we-offer/products-services/consulting|consulting 
services]] for Hadoop, Cassandra, and Solr.
+   * DataStax provides a product of 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-hadoop|Hadoop]]
 which fully integrates Apache Hadoop with 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-cassandra|Apache
 Cassandra]] and 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise/apache-solr|Apache
 Solr]] in its 
[[http://www.datastax.com/what-we-offer/products-services/datastax-enterprise|DataStax
 Enterprise platform]]. DataStax Enterprise is completely free to use for 
development environments with no restrictions. In addition, DataStax supplies 
[[http://www.datastax.com/what-we-offer/products-services/datastax-opscenter|OpsCenter]]
 for visual management and monitoring, along with 
[[http://www.datastax.com/what-we-offer/products-services/support|expert 

[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2013-01-23 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
http://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=58rev2=59

  The sole products that can be called a release of Apache Hadoop come from 
[[http://www.apache.org/dyn/closer.cgi/hadoop/|apache.org]]. Some companies 
release or sell products that include the official Apache Hadoop release files, 
and/or their own and other useful tools. Other companies or organizations 
release products that include artifacts build from modified or extended 
versions of the Apache Hadoop source tree. Such derivative works are not 
supported by the Apache Team: all support issues must be directed to the 
suppliers themselves.
  
  The Apache Software Foundation strongly encourages users of Hadoop -in any 
form- to get involved in the Apache-hosted mailing lists. Even though you may 
only get support through the supplier of any derivative work of Apache Hadoop, 
by participating in the Hadoop user and developer lists, you can become an 
active part of the Hadoop community. Your needs may be addressed in future 
versions of the code, and you will be able to get in touch with many other 
users of the technology. 
+ 
+ The Hadoop developers would like you to be aware that filing JIRA issues is 
not a way to get support to get your Hadoop installation up and running. Those 
bug reports are not bugs, and will [[InvalidJiraIssues|be closed as invalid]]. 
Either use the [[http://hadoop.apache.org/general_lists.html#User|hadoop-user]] 
mailing list, the organisations providing support listed below, or delve into 
the Hadoop source itself.
  
  Entries are listed alphabetically by company name.
  


[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2012-10-11 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
http://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=56rev2=57

Comment:
fix hdp-1 version, remove all commitments for the future

  
   * [[http://www.hortonworks.com|Hortonworks]]
* Major contributors to Apache Hadoop and dedicated to working with the 
community to make Apache Hadoop more robust and easier to install, manage, use, 
integrate and extend.
-   * Provides 
[[http://www.hortonworks.com/technology/hortonworksdataplatform/|Hortonworks 
Data Platform Powered by Apache Hadoop]], which is a 100% open source 
distribution of Apache Hadoop. Version 1 is based upon Hadoop-0.20.205 and 
Version 2 will be based upon Hadoop-0.23. 
+   * Provides 
[[http://www.hortonworks.com/technology/hortonworksdataplatform/|Hortonworks 
Data Platform Powered by Apache Hadoop]], which is a 100% open source 
distribution of Apache Hadoop. Version 1 is based upon Hadoop-1.0.3. 
* Provider of [[http://www.hortonworks.com/support/|expert technical 
support]], [[http://www.hortonworks.com/training/|training]] and 
[[http://www.hortonworks.com/partners/|partner-enablement services]] for both 
end-user organizations and technology vendors.
  
   * [[http://www.hstreaming.com|HStreaming]]


[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2011-10-12 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
http://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=45rev2=46

Comment:
Make the greenplum entry slightly less marketing, and don't call it a 
distribution of Hadoop, instead try and separate the HD brand from Hadoop -as 
in the enterprise edition it is something different

* A Debian package of Apache Hadoop is available. Please see the 
[[http://wiki.debian.org/Hadoop|Debian Wiki on Hadoop]].
  
   * [[http://www.greenplum.com|Greenplum, A Division of EMC]]
-   * Greenplum HD enables you to take advantage of big data analytics without 
the overhead and complexity of a project built from scratch. Available in 
Community and Enterprise editions, Greenplum HD software provides a complete 
platform, including installation, training, global support, and value-add 
beyond simple packaging of the Apache Hadoop distribution. In addition, the 
Greenplum HD Module combines Hadoop and the Greenplum Database in one 
purpose-built Data Computing Appliance. Greenplum HD makes Hadoop faster, more 
dependable, and easier to use.
-   * Twelve technology companies have partnered with Greenplum to create a 
vibrant and powerful partner ecosystem, offering additional business 
intelligence, data transfer, and other technology capabilities on top of 
Greenplum’s Hadoop offerings. Greenplum HD partners include Concurrent, CSC, 
Datameer, Informatica, Jaspersoft, Karmasphere, Microstrategy, Pentaho, SAS, 
SnapLogic, Talend and VMWare.
+   * Greenplum HD offers two products based on Apache Hadoop that offer big 
data analytics. Available in Community and Enterprise editions, Greenplum HD 
software provides a complete platform, including installation, training, and 
global support. In addition, the Greenplum HD Module combines Hadoop and the 
Greenplum Database in one purpose-built Data Computing Appliance. Greenplum HD 
makes Hadoop faster, more dependable, and easier to use.
+   * Twelve technology companies have partnered with Greenplum offering 
additional business intelligence, data transfer, and other technology 
capabilities on top of Greenplum’s HD products. The Greenplum HD partners 
include Concurrent, CSC, Datameer, Informatica, Jaspersoft, Karmasphere, 
Microstrategy, Pentaho, SAS, SnapLogic, Talend and VMWare.
* The Greenplum HD Enterprise Edition is based on technology from MapR 
Technologies.
  
   * [[http://www.hortonworks.com|Hortonworks]]
@@ -88, +88 @@

   * [[http://www.pervasivedatarush.com|Pervasive Software]]
* Provides [[http://www.pervasivedatarush.com|Pervasive DataRush]], a 
parallel dataflow framework which improves performance of Apache Hadoop and 
MapReduce jobs by exploiting fine-grained parallelism on multicore servers.  
[[mailto:i...@pervasivedatarush.com|(contact)]]
   * [[http://www.platform.com|Platform Computing]]
-   *[[http://www.platform.com/mapreduce|Platform Computing]] provides an 
Enterprise Class MapReduce solution for Big Data Analytics with high 
scalability and fault tolerance. 
[[http://www.platform.com/products/mapreduce|Platform MapReduce]] provides 
unique scheduling capabilities and its architecture is based on almost two 
decades of distributed computing research and development. Based on the same 
low-latency distributed architecture deployed in the leading financial 
institutions on Wallstreet, the solution meets the needs of the most demanding 
enterprise customers. With comprehensive GUI management tools and commercial 
support available for HDFS, the solution also supports other distributed file 
systems. 
+   *[[http://www.platform.com/mapreduce|Platform Computing]] provides an 
Enterprise Class MapReduce solution for Big Data Analytics with high 
scalability and fault tolerance. 
[[http://www.platform.com/products/mapreduce|Platform MapReduce]] provides 
unique scheduling capabilities and its architecture is based on almost two 
decades of distributed computing research and development. Based on the same 
low-latency distributed architecture deployed in the leading financial 
institutions on Wall Street, the solution meets the needs of the most demanding 
enterprise customers. With comprehensive GUI management tools and commercial 
support available for HDFS, the solution also supports other distributed file 
systems. 
   * [[http://www.sematext.com/|Sematext International]]
* Provides consulting services around Apache Hadoop and Apache HBase, along 
with large-scale search using Apache Lucene, Apache Solr, and Elastic Search.
* Runs the popular [[http://search-hadoop.com/|search-hadoop.com]] search 
service.


[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2011-10-12 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
http://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=46rev2=47

Comment:
Flush out IBM entry; not merged with Platform. 

   * [[http://aws.amazon.com/|Amazon Web Services]]
* Amazon offers a version of Apache Hadoop on their EC2 infrastructure, 
sold as  [[http://aws.amazon.com/elasticmapreduce|Amazon Elastic MapReduce]]. 
  
-  * [[http://www.cascading.org/|Cascading]] - Cascading is a feature-rich API 
for defining and executing complex and fault tolerant data processing workflows 
on a Apache Hadoop cluster.
+  * [[http://www.cascading.org/|Cascading]] - Cascading is a feature-rich API 
for defining and executing complex and fault tolerant data processing workflows 
on a Apache Hadoop cluster. Cascading 2.0 is Apache-licensed. 
  
   * [[Cloudera]]: [[http://www.cloudera.com/downloads/|Cloudera's Distribution 
including Apache Hadoop]] currently includes:
* [[http://www.cloudera.com/downloads/|Docs and Setup Guide]]
@@ -61, +61 @@

* Available as [[http://www.hstreaming.com/products/cloud/|cloud service]] 
and as a [[http://www.hstreaming.com/products/enterprise/|software license]].
  
   * [[http://www.alphaworks.ibm.com/tech/idah|IBM]]
-   * IBM offers a repackaged version of Apache Hadoop that IBM supports on IBM 
JVMs.
+   * IBM offers a derivative version of Apache Hadoop that IBM supports on IBM 
JVMs on a number of platforms/operating systems. Their 
[http://www-01.ibm.com/software/data/infosphere/biginsights/|IBM BigInsights] 
product is built upon Apache Hadoop. 
  
   * [[http://www.impetus.com/ |Impetus]]
*Impetus' LADAP system is built for large enterprises and Websites to 
effectively derive intelligence out of raw data from discrete sources. With 
LADAP, an in-depth analysis can be undertaken on data from many different 
sources including social networks, to find out the patterns and structures 
within it. [[http://bigdata.impetus.com/big_data_analytics_platform# | More 
info about LADAP @Impetus]]


[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2011-09-30 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
http://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=41rev2=42

Comment:
roll back some of the hype, remove personal we claims, use Apache name more 
thoroughly

   * [[http://aws.amazon.com/|Amazon Web Services]]
* Amazon offers a version of Apache Hadoop on their EC2 infrastructure, 
sold as  [[http://aws.amazon.com/elasticmapreduce|Amazon Elastic MapReduce]]. 
  
-  * [[http://www.cascading.org/|Cascading]] - Cascading is a feature rich API 
for defining and executing complex and fault tolerant data processing workflows 
on a Hadoop cluster.
+  * [[http://www.cascading.org/|Cascading]] - Cascading is a feature-rich API 
for defining and executing complex and fault tolerant data processing workflows 
on a Apache Hadoop cluster.
  
   * [[Cloudera]]: [[http://www.cloudera.com/downloads/|Cloudera's Distribution 
including Apache Hadoop]] currently includes:
* [[http://www.cloudera.com/downloads/|Docs and Setup Guide]]
-   * Tested and integrated packages for related Hadoop projects (hive, pig, 
zookeeper, hbase, flume, sqoop, oozie, hue)
+   * Tested and integrated packages for related Hadoop projects (Apache hive, 
pig, zookeeper, hbase, flume, sqoop, oozie, hue)
* Standard Linux service management for all Hadoop services
* RPM and Debian packages for redhat / ubuntu based systems in binary and 
source form
 * Public YUM and APT repository for distribution and updates
@@ -34, +34 @@

  * High performance bare metal cloud with 
[[http://www.softlayer.com|Softlayer]] ([[mailto:i...@cloudera.com|contact]])
  
   * [[http://www.cloudspace.com/|Cloudspace]]
-   * Cloudspace is a web technology consulting company, since 1996. Cloudspace 
uses Hadoop to scale client and internal projects on Amazon's EC2 and bare 
metal architectures.
+   * Cloudspace is a web technology consulting company, since 1996. Cloudspace 
uses Apache Hadoop to scale client and internal projects on Amazon's EC2 and 
bare metal architectures.
  
   * [[http://www.datameer.com|Datameer]]
-   * Datameer Analytics Solution (DAS) is the first Hadoop-based solution for 
big data analytics that includes data source integration, storage, an analytics 
engine and visualization.
+   * Datameer Analytics Solution (DAS) is a Hadoop-based solution for big data 
analytics that includes data source integration, storage, an analytics engine 
and visualization.
* DAS Log File Aggregator is a plug-in to DAS that makes it easy to import 
large numbers of log files stored on disparate servers.
  
   * [[http://www.debian.org|Debian]]
@@ -53, +53 @@

* Available as [[http://www.hstreaming.com/products/cloud/|cloud service]] 
and as a [[http://www.hstreaming.com/products/enterprise/|software license]].
  
   * [[http://www.alphaworks.ibm.com/tech/idah|IBM]]
-   * IBM now offers a repackaged version of Apache Hadoop that IBM supports on 
IBM JVMs.
+   * IBM offers a repackaged version of Apache Hadoop that IBM supports on IBM 
JVMs.
  
   * [[http://www.impetus.com/ |Impetus]]
*Impetus' LADAP system is built for large enterprises and Websites to 
effectively derive intelligence out of raw data from discrete sources. With 
LADAP, an in-depth analysis can be undertaken on data from many different 
sources including social networks, to find out the patterns and structures 
within it. [[http://bigdata.impetus.com/big_data_analytics_platform# | More 
info about LADAP @Impetus]]
@@ -61, +61 @@

*With a strong focus, established thought leadership and open source 
contributions in the area of Big Data analytics and consulting services, 
Impetus uses its Global Delivery Model to help technology businesses and 
enterprises evaluate and implement solutions tailored to their specific 
context, without being biased towards a particular solution. 
[[http://bigdata.impetus.com/# | More info about BigData @Impetus]]
  
   * [[http://www.karmasphere.com/|Karmasphere]]
-   * Distributes [[http://www.hadoopstudio.org/|Karmasphere Studio for 
Hadoop]], which allows cross-version development and management of Hadoop jobs 
in a familiar integrated development environment.
+   * Distributes [[http://www.hadoopstudio.org/|Karmasphere Studio for 
Hadoop]], which allows cross-version development and management of Apache 
Hadoop jobs in a familiar integrated development environment.
  
   * [[http://lucene.apache.org/mahout|Mahout]]
* Another Apache project using Hadoop to build scalable machine learning 
algorithms like canopy clustering, k-means and many more.
@@ -69, +69 @@

   * [[http://mapr.com|MapR Technologies]]
* MapR sells a high performance map-reduce framework based on Apache Hadoop 
that includes the standard eco-system components.  A significant amount of 
re-engineering of the file system and the 

[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2011-07-15 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
http://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=31rev2=32

Comment:
There's always a SPOF. And trim the maketing from the AWS entry. 

  Entries are listed alphabetically by company name.
  
   * [[http://aws.amazon.com/|Amazon Web Services]]
+   * Amazon offers a version of Apache Hadoop on their EC2 infrastucture, sold 
as  [[http://aws.amazon.com/elasticmapreduce|Amazon Elastic MapReduce]]. 
-   * We provide [[http://aws.amazon.com/elasticmapreduce|Amazon Elastic 
MapReduce]]. It's a web service that provides a hosted Hadoop framework running 
on the web-scale infrastructure of Amazon Elastic Compute Cloud (Amazon EC2) 
and Amazon Simple Storage Service (Amazon S3).
-   * Our customers can instantly provision as much or as little capacity as 
they like to perform data-intensive tasks for applications such as web 
indexing, data mining, log file analysis, machine learning, financial analysis, 
scientific simulation, and bioinformatics research.
  
   * [[http://www.cascading.org/|Cascading]] - Cascading is a feature rich API 
for defining and executing complex and fault tolerant data processing workflows 
on a Hadoop cluster.
  
@@ -63, +62 @@

* Another Apache project using Hadoop to build scalable machine learning 
algorithms like canopy clustering, k-means and many more.
  
   * [[http://mapr.com|MapR Technologies]]
-   * MapR sells a high performance map-reduce framework based on Apache Hadoop 
that includes the standard eco-system components.  A significant amount of 
re-engineering of the file system and the map-reduce components allows 
significantly higher performance than standard Hadoop while eliminating 
problems with single points of failure and allowing full read-write access to 
the cluster file store via NFS.
+   * MapR sells a high performance map-reduce framework based on Apache Hadoop 
that includes the standard eco-system components.  A significant amount of 
re-engineering of the file system and the map-reduce components allows 
significantly higher performance than standard Hadoop while eliminating the 
primary single points of failure and allowing full read-write access to the 
cluster file store via NFS.
  
   * [[http://lucene.apache.org/nutch|Nutch]] - flexible web search engine 
software
  


[Hadoop Wiki] Update of Distributions and Commercial Support by SteveLoughran

2011-07-15 Thread Apache Wiki
Dear Wiki user,

You have subscribed to a wiki page or wiki category on Hadoop Wiki for change 
notification.

The Distributions and Commercial Support page has been changed by 
SteveLoughran:
http://wiki.apache.org/hadoop/Distributions%20and%20Commercial%20Support?action=diffrev1=32rev2=33

Comment:
make clear that the Hadoop-layer SPOFs are fixed in MapR, review text and typos 
elsewhere

  Entries are listed alphabetically by company name.
  
   * [[http://aws.amazon.com/|Amazon Web Services]]
-   * Amazon offers a version of Apache Hadoop on their EC2 infrastucture, sold 
as  [[http://aws.amazon.com/elasticmapreduce|Amazon Elastic MapReduce]]. 
+   * Amazon offers a version of Apache Hadoop on their EC2 infrastructure, 
sold as  [[http://aws.amazon.com/elasticmapreduce|Amazon Elastic MapReduce]]. 
  
   * [[http://www.cascading.org/|Cascading]] - Cascading is a feature rich API 
for defining and executing complex and fault tolerant data processing workflows 
on a Hadoop cluster.
  
   * [[Cloudera]]: [[http://www.cloudera.com/downloads/|Cloudera's Distribution 
including Apache Hadoop]] currently includes:
* [[http://www.cloudera.com/downloads/|Docs and Setup Guide]]
-   * Tested and integrated packages for related Hadoop projects (hive, pig, 
zookeeper, hbase, ,flume, sqoop, oozie, hue)
+   * Tested and integrated packages for related Hadoop projects (hive, pig, 
zookeeper, hbase, flume, sqoop, oozie, hue)
* Standard Linux service management for all Hadoop services
* RPM and Debian packages for redhat / ubuntu based systems in binary and 
source form
 * Public YUM and APT repository for distribution and updates
@@ -62, +62 @@

* Another Apache project using Hadoop to build scalable machine learning 
algorithms like canopy clustering, k-means and many more.
  
   * [[http://mapr.com|MapR Technologies]]
-   * MapR sells a high performance map-reduce framework based on Apache Hadoop 
that includes the standard eco-system components.  A significant amount of 
re-engineering of the file system and the map-reduce components allows 
significantly higher performance than standard Hadoop while eliminating the 
primary single points of failure and allowing full read-write access to the 
cluster file store via NFS.
+   * MapR sells a high performance map-reduce framework based on Apache Hadoop 
that includes the standard eco-system components.  A significant amount of 
re-engineering of the file system and the map-reduce components allows 
significantly higher performance than standard Hadoop while eliminating 
Hadoop's single points of failure (the NameNode and JobTracker) and allowing 
full read-write access to the cluster file store via NFS.
  
   * [[http://lucene.apache.org/nutch|Nutch]] - flexible web search engine 
software
  
   * [[http://pentaho.com|Pentaho]] – Open Source Business Intelligence
-   * Pentaho provides the only complete, end-to-end open  source BI 
alternative to proprietary offerings like Oracle, SAP and  IBM
+   * Pentaho provides the only complete, end-to-end open  source BI 
alternative to proprietary offerings like Oracle, SAP and IBM.
-   * We provide an easy-to-use, graphical ETL tool that  is integrated with 
Hadoop for managing data and coordinating Hadoop related  tasks in the broader 
context of your ETL and Business Intelligence  workflow
+   * We provide an easy-to-use, graphical ETL tool that  is integrated with 
Hadoop for managing data and coordinating Hadoop related tasks in the broader 
context of your ETL and Business Intelligence workflow.
-   * We also provide Reporting and Analysis capabilities  against big data in 
Hadoop
+   * We also provide Reporting and Analysis capabilities against big data in 
Hadoop.
-   * Learn more at 
[[http://www.pentaho.com/hadoop/|http://www.pentaho.com/hadoop]]
+   * Learn more at 
[[http://www.pentaho.com/hadoop/|http://www.pentaho.com/hadoop]].
  
   * [[http://www.pervasivedatarush.com|Pervasive Software]]
* We provide[[http://www.pervasivedatarush.com|Pervasive DataRush]], a 
parallel dataflow framework which improves performance of Hadoop and MapReduce 
jobs by exploiting fine-grained parallelism on multicore servers.  
[[mailto:i...@pervasivedatarush.com|(contact)]]
   * [[http://www.platform.com|Platform Computing]]
-   *[[http://www.platform.com/mapreduce|Platform Computing]] provides an 
Enterprise Class MapReduce solution for Big Data Analytics with high 
scalability and fault tolerance. 
[[http://www.platform.com/products/mapreduce|Platform MapReduce]] provides 
unique scheduling capabilities and its architecture is based on almost two 
decades of distributed computing research and development. Based on the same 
low-latency distributed architecture deployed in the leading financial 
institutions on Wallstreet, the solution meets the needs of the most demanding 
enterprise customers. With comprehensive GUI mangementment tools and commercial 
support available for HDFS, the solution also supports