Glen, You might want consider structuring the XML as a faux relational database... #1) full text #2) keyword index and frequency #3) titles, volume, number, etc.
During search, you only have to trawl through #2, which would have cross-ref IDs for numbers 1 and 3. Then pull #1 for presentation purposes and possible search term highlighting. ...Rob -----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Glen Pike Sent: Thursday, February 21, 2008 11:25 AM To: Flash Coders List Subject: Re: [Flashcoders] CDROM XML search The system can use AS3 - as it is a CDROM. I asked about the data size - at the moment, a sample XML file, generated by an automatic tool is about 500k, gulp. That means, 6MB per year, 60MB per decade at the moment. I have asked to see the file, because there may be a lot of rubbish that can be eliminated - I hope so.. Glen Merrill, Jason wrote: > First questions to get out of the way is which version of Actionscript > and potentially how much data (in k)? > > Jason Merrill > Bank of America > GT&O L&LD Solutions Design & Development eTools & Multimedia > > Bank of America Flash Platform Developer Community > > > Are you a Bank of America associate interested in innovative learning > ideas and technologies? > Check out our internal GT&O Innovative Learning Blog and & subscribe. > > > > > > > >>> -----Original Message----- >>> From: [EMAIL PROTECTED] >>> [mailto:[EMAIL PROTECTED] On Behalf Of Glen >>> Pike >>> Sent: Thursday, February 21, 2008 10:50 AM >>> To: Flash Coders List >>> Subject: [Flashcoders] CDROM XML search >>> >>> Hi, >>> >>> I have been asked to look at a search facility for a CDROM >>> project. >>> >>> The customer is archiving magazines, 1 a month, for a decade per >>> CD and wants a simple search engine. >>> >>> The magazines will be archived as scanned images plus XML data >>> containing page text content. >>> >>> Loading in an XML file and searching / filtering is pretty easy >>> in principle, but I am guessing I may run into performance issues as >>> the amount of data is scaled up. >>> >>> Google is proving fairly useless today, so has anyone had much >>> experience of this and have any recommendations. >>> >>> Thanks >>> >>> Glen >>> -- >>> >>> Glen Pike >>> 01736 759321 >>> www.glenpike.co.uk <http://www.glenpike.co.uk> >>> _______________________________________________ >>> Flashcoders mailing list >>> [email protected] >>> http://chattyfig.figleaf.com/mailman/listinfo/flashcoders >>> >>> > _______________________________________________ > Flashcoders mailing list > [email protected] > http://chattyfig.figleaf.com/mailman/listinfo/flashcoders > > > -- Glen Pike 01736 759321 www.glenpike.co.uk <http://www.glenpike.co.uk> _______________________________________________ Flashcoders mailing list [email protected] http://chattyfig.figleaf.com/mailman/listinfo/flashcoders _______________________________________________ Flashcoders mailing list [email protected] http://chattyfig.figleaf.com/mailman/listinfo/flashcoders

