On Wed, 10 Sep 2008 23:33:44 +0800
"Amber" <[EMAIL PROTECTED]> wrote:
> Yes, we know both Greenplum and Netezza are PostgreSQL based MPP
> solutions, but they are commercial packages. I'd like to know are
> there open source ones, and I would suggest the PostgreSQL Team to
> start a MPP version o
From: "Joshua Drake" <[EMAIL PROTECTED]>
Sent: Wednesday, September 10, 2008 11:27 PM
To: "Amber" <[EMAIL PROTECTED]>
Cc: "Mark Roberts" <[EMAIL PROTECTED]>;
Subject: Re: [GENERAL] What's size of your PostgreSQL Database?
> On Wed, 10 Sep 2008
On Wed, 10 Sep 2008 23:17:40 +0800
"Amber" <[EMAIL PROTECTED]> wrote:
>
> 1. Some kind of MPP.
> 2. No single point of failure.
> 3. Convenient and multiple access interfaces.
>
> And following the is the solutions we have examined:
http://www.greenplum.com/
Joshua D. Drake
--
The PostgreSQL
> Yahoo has a 2PB Postgres single instance Postgres database (modified
> engine), but the biggest pure Pg single instance I've heard of is 4TB.
> The 4TB database has the additional interesting property in that they've
> done none of the standard "scalable" architecture changes (such as
> partition
> 8. We have a master and a replica. We have plans to move to a
> cluster/grid Soon(TM). It's not an emergency and Postgres can easily
> handle and scale to a 3TB database on reasonable hardware (<$30k).
>
I'd like to know what's your progress of choosing the cluster/grid solution, we
are also
On Thu, 2008-08-21 at 22:17 +0800, Amber wrote:
> Another question, how many people are there maintaining this huge database.
> We have about 2T of compressed SAS datasets, and now considering load them
> into a RDBMS database,
> according to your experience, it seems a single PostgreSQL instanc
right?
--
From: "Amber" <[EMAIL PROTECTED]>
Sent: Thursday, August 21, 2008 9:51 PM
To: "Mark Roberts" <[EMAIL PROTECTED]>
Cc:
Subject: Re: [GENERAL] What's size of your PostgreSQL Database?
>
>
&
> On Sat, 2008-08-16 at 11:42 +0800, Amber wrote:
>> Dear all:
>> We are currently considering using PostgreSQL to host a read only
>> warehouse, we would like to get some experiences, best practices and
>> performance metrics from the user community, following is the question list:
>> 1.
> Just out of curiosity, how do you replicate that amount of data?
When I started working here, we used Slony-I to replicate our aggregate
fact tables. A little over a year ago our data volume had grown to the
point that the Slony was regularly unable to keep up with the data
volume and around t
Mark Roberts wrote:
1. 2.5-3TB, several others that are of fractional sisize.
...
5. They do pretty well, actually. Our aggregate fact tables regularly
join to metadata tables and we have an average query return time of
10-30s. We do make some usage of denormalized mviews for
chained/hier
On Sat, 2008-08-16 at 11:42 +0800, Amber wrote:
> Dear all:
> We are currently considering using PostgreSQL to host a read only
> warehouse, we would like to get some experiences, best practices and
> performance metrics from the user community, following is the question list:
> 1. What's s
On Tue, 2008-08-19 at 07:34 -0400, Bill Moran wrote:
>
> In theory, you can have so many disks that the bottleneck moves to
> some
> other location, such as the IO bus or memory or the CPU, but I've
> never
> heard of that happening to anyone. Also, you want to get fast, high-
> quality disks, a
On Tue, 2008-08-19 at 02:28 -0400, David Wilson wrote:
> On Fri, Aug 15, 2008 at 11:42 PM, Amber <[EMAIL PROTECTED]> wrote:
> > Dear all:
> >We are currently considering using PostgreSQL to host a read only
> > warehouse,
> we would like to get some experiences, best practices and performance
In response to Ow Mun Heng <[EMAIL PROTECTED]>:
> On Mon, 2008-08-18 at 11:01 -0400, justin wrote:
> > Ow Mun Heng wrote:
> > > -Original Message-
> > > From: Scott Marlowe <[EMAIL PROTECTED]>
> > >
> > > > If you're looking at read only / read
> > > > mostly, then RAID5 or 6 might be
On Fri, Aug 15, 2008 at 11:42 PM, Amber <[EMAIL PROTECTED]> wrote:
> Dear all:
>We are currently considering using PostgreSQL to host a read only
> warehouse, we would like to get some experiences, best practices and
> performance metrics from the user community, following is the question lis
I have a db (tables with up to 5,000,000 records, up to 70 columns x 1,500,000
records, around 50Gb of disk space for the database (incl data, indexes, etc)
Most records have PostGIS geometry columns, which work very well.
For read performance this is on a (2 yr old) Linux box with 2x software
On Mon, 2008-08-18 at 11:01 -0400, justin wrote:
> Ow Mun Heng wrote:
> > -Original Message-
> > From: Scott Marlowe <[EMAIL PROTECTED]>
> >
> > > If you're looking at read only / read
> > > mostly, then RAID5 or 6 might be a better choice than RAID-10. But
> > > RAID 10 is my default
Ow Mun Heng wrote:
-Original Message-
From: Scott Marlowe <[EMAIL PROTECTED]>
If you throw enough drives on a quality RAID controller at it you can
get very good throughput. If you're looking at read only / read
mostly, then RAID5 or 6 might be a better choice than RAID-10. But
RAID
-Original Message-
From: Scott Marlowe <[EMAIL PROTECTED]>
>If you throw enough drives on a quality RAID controller at it you can
>get very good throughput. If you're looking at read only / read
>mostly, then RAID5 or 6 might be a better choice than RAID-10. But
>RAID 10 is my default cho
On Sun, 17 Aug 2008, Amber wrote:
what I am wondering is how multiple agent process share page caches.
The database allocates a block of shared memory (sized by the
shared_buffers parameter) that all the client processes share for caching
pages.
--
* Greg Smith [EMAIL PROTECTED] http://www
>> 7. How many concurrent readers of your database, and what's the average
>> transfer rate, suppose all readers are doing one table scaning.
>
> Concurrent but idle connections in production are around 600. Active
> connections at a time are in the dozens. I can read at about 60 to 70
> Megs a
On Fri, Aug 15, 2008 at 9:42 PM, Amber <[EMAIL PROTECTED]> wrote:
> Dear all:
>We are currently considering using PostgreSQL to host a read only
> warehouse, we would like to get some experiences, best practices and
> performance metrics from the user community, following is the question list
Dear all:
We are currently considering using PostgreSQL to host a read only
warehouse, we would like to get some experiences, best practices and
performance metrics from the user community, following is the question list:
1. What's size of your database?
2. What Operating System are you using
23 matches
Mail list logo