Barry,

Honestly, I would love all of that, but it is not absolutely necessary.

I installed the trial versin of verity ultraseek.  It limits you to
25000 documents (or databse records).  It was awesome.  Very fast, and
did all the relevancy and near matching type stuff.  However, as
someone mentioned it is even more expensive than the google solution
although I get the feeling there is alot of flexibility in their
pricing.

My biggest problem is that we are a small company that has been
compiling information for 16 years so we have built up a database of
2.2 million records.  Everything is expensive after about 500,000
records(documents).  And I am talking $30,000 to over $100,000.

But, the ultraseek was cool and I imagine the google appliance would
be just like the google.com website in speed and usability also.

For years I have been using an interface were my clients get to enter
search criteria into a large form.  This works great for advanced
users but is overwhelming for beginning users.  It is my impression
that people expect google-like usability and convenience when they
search for stuff on other sites, such as mine, and hence my interest
in providing that sort of convenience for them.

It is looking more and more like I will be using my existing form
interface and maybe extracting recent data from the total database.  I
would estimate that over 80% of my users are searching for data no
older than 6 months to 1 year.  So I could provide an archive search
that would access a database storing the older information and have a
seperate search that would access the newer stuff.  But then is it
really that big of a difference if the dates are part of the query? 
Thinking out loud now.

Thanks for all your help and I would be interested in hearing from any
others that have anything else to add.  I have yet to research the
open source stuff mentioned above.

BTW, I have been playing with a full text catalog, but I am not seeing
a huge benefit?  Can anyone chime in on the advantages of full text
over traditional database queries other than "near" matches and real
text stuff.

Thanks everyone.

On 4/27/06, Barry Beattie <[EMAIL PROTECTED]> wrote:
> David, are you going the whole hog and having keyword highlighting, %
> of relivance, etc?
>
> just curious
>
> On 4/28/06, Britton, Michael (NIH/NIEHS) [C] <[EMAIL PROTECTED]> wrote:
> > I think Verity Ultraseek ('VU' hereforward) costs more than Google
> > Appliance, but its collection management interface is great if you're
> > going to support searching across a large enterprise.  We're using VU
> > with FarCry and have the two working together nicely.  If I ever need to
> > build another search engine and have the deep pockets for it, I'll
> > probably opt for whatever the next incarnation of VU is (Verity
> > Uuberseek?).
> >
> > -------------------------------------------
> >
> > Mike Britton
> > Programmer / Analyst (Contractor)
> > NTP Information Systems Support
> > 919 541-0642
> > Email: [EMAIL PROTECTED]
> > NIEHS, MD EC-03, P.O. BOX 12233, Research Triangle Park, NC  27709
> > -----Original Message-----
> > From: David Mineer [mailto:[EMAIL PROTECTED]
> > Sent: Thursday, April 27, 2006 1:35 AM
> > To: [email protected]
> > Subject: Re: [CFCDev] Google style search for Database Content
> >
> > I have a full text cataolog that was populating when I left work.  Had
> > been running for a couple of hours.  I will continue to explore this
> > option tommorrow.  I will have to look into verity more.  I know of that
> > but have never used it, except as part of the old Allaire Forums that I
> > once used.
> >
> > Thanks.
> >
> > On 4/26/06, David Ross <[EMAIL PROTECTED]> wrote:
> > > MSSQL has a built-in full-text search engine on certain versions -
> > > that may be the easiest thing to do. I think 2 million entries is
> > > beyond what the Verity that comes with CF allows, but you could look
> > > into a separate Verity license or Lucene, an open-source alternative.
> > >
> > > -Dave
> > > >>> [EMAIL PROTECTED] 04/26/06 1:24 PM >>>
> > > I have a MSSQL server database with approx 2 million records.  I would
> >
> > > really like a simple 'Google Like' interface search for my clients
> > > that would search for the given value in all fields of all those
> > > records.
> > >
> > > I have been looking a the Google Search Appliance, which has direct
> > > database indexing built in, but it is VERY expensive (min $30K).  I
> > > have also seen the 'Google Mini' whichosts less ($2K) but does not do
> > > the direct database indexing.  Instead I have seen the reccomendation
> > > that I build a dynamic page with URL links to each record.  Would this
> >
> > > work with sow many records involved?
> > >
> > > Has anyone seen or does anyone have any ideas on making the entire
> > > contects of a large database easily searchable?
> > >
> > > TIA,
> > >
> > > --
> > > David Mineer Jr
> > > ---------------------
> > > The critical ingredient is getting off your butt and doing something.
> > > It's as simple as that. A lot of people have ideas, but there are few
> > > who decide to do something about them now.
> > > Not tomorrow. Not next week. But today. The true entrepreneur is a
> > > doer.
> > >
> > >
> > > ----------------------------------------------------------
> > > You are subscribed to cfcdev. To unsubscribe, send an email to
> > > [email protected] with the words 'unsubscribe cfcdev' as the subject
> > > of the email.
> > >
> > > CFCDev is run by CFCZone (www.cfczone.org) and supported by CFXHosting
> >
> > > (www.cfxhosting.com).
> > >
> > > An archive of the CFCDev list is available at
> > > www.mail-archive.com/[email protected]
> > >
> > >
> > >
> > >
> > > -----------------------------------------
> > > CONFIDENTIALITY NOTICE: This email and any attachments may contain
> > > confidential information that is protected by law and is for the sole
> > > use of the individuals or entities to which it is addressed.
> > > If you are not the intended recipient, please notify the sender by
> > > replying to this email and destroying all copies of the communication
> > > and attachments. Further use, disclosure, copying, distribution of, or
> >
> > > reliance upon the contents of this email and attachments is strictly
> > > prohibited. To contact Albany Medical Center, or for a copy of our
> > > privacy practices, please visit us on the Internet at www.amc.edu.
> > >
> > >
> > >
> > > ----------------------------------------------------------
> > > You are subscribed to cfcdev. To unsubscribe, send an email to
> > [email protected] with the words 'unsubscribe cfcdev' as the subject of
> > the email.
> > >
> > > CFCDev is run by CFCZone (www.cfczone.org) and supported by CFXHosting
> > (www.cfxhosting.com).
> > >
> > > An archive of the CFCDev list is available at
> > > www.mail-archive.com/[email protected]
> > >
> > >
> > >
> >
> >
> > --
> > David Mineer Jr
> > ---------------------
> > The critical ingredient is getting off your butt and doing something.
> > It's as simple as that. A lot of people have ideas, but there are few
> > who decide to do something about them now.
> > Not tomorrow. Not next week. But today. The true entrepreneur is a doer.
> >
> >
> > ----------------------------------------------------------
> > You are subscribed to cfcdev. To unsubscribe, send an email to
> > [email protected] with the words 'unsubscribe cfcdev' as the subject of
> > the email.
> >
> > CFCDev is run by CFCZone (www.cfczone.org) and supported by CFXHosting
> > (www.cfxhosting.com).
> >
> > An archive of the CFCDev list is available at
> > www.mail-archive.com/[email protected]
> >
> >
> >
> > ----------------------------------------------------------
> > You are subscribed to cfcdev. To unsubscribe, send an email to 
> > [email protected] with the words 'unsubscribe cfcdev' as the subject of 
> > the email.
> >
> > CFCDev is run by CFCZone (www.cfczone.org) and supported by CFXHosting 
> > (www.cfxhosting.com).
> >
> > An archive of the CFCDev list is available at 
> > www.mail-archive.com/[email protected]
> >
> >
> >
>
>
> ----------------------------------------------------------
> You are subscribed to cfcdev. To unsubscribe, send an email to 
> [email protected] with the words 'unsubscribe cfcdev' as the subject of the 
> email.
>
> CFCDev is run by CFCZone (www.cfczone.org) and supported by CFXHosting 
> (www.cfxhosting.com).
>
> An archive of the CFCDev list is available at 
> www.mail-archive.com/[email protected]
>
>
>


--
David Mineer Jr
---------------------
The critical ingredient is getting off your butt and doing
something. It's as simple as that. A lot of people have ideas,
but there are few who decide to do something about them now.
Not tomorrow. Not next week. But today. The true entrepreneur
is a doer.


----------------------------------------------------------
You are subscribed to cfcdev. To unsubscribe, send an email to 
[email protected] with the words 'unsubscribe cfcdev' as the subject of the 
email.

CFCDev is run by CFCZone (www.cfczone.org) and supported by CFXHosting 
(www.cfxhosting.com).

An archive of the CFCDev list is available at 
www.mail-archive.com/[email protected]


Reply via email to