Glen,

You might want consider structuring the XML as a faux relational database...
#1) full text
#2) keyword index and frequency
#3) titles, volume, number, etc.

During search, you only have to trawl through #2, which would have cross-ref
IDs for numbers 1 and 3. Then pull #1 for presentation purposes and possible
search term highlighting.

...Rob


-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Glen Pike
Sent: Thursday, February 21, 2008 11:25 AM
To: Flash Coders List
Subject: Re: [Flashcoders] CDROM XML search

The system can use AS3 - as it is a CDROM.

I asked about the data size - at the moment, a sample XML file, generated by
an automatic tool is about 500k, gulp.
That means, 6MB per year, 60MB per decade at the moment.

I have asked to see the file, because there may be a lot of rubbish that can
be eliminated - I hope so..

Glen



Merrill, Jason wrote:
> First questions to get out of the way is which version of Actionscript 
> and potentially how much data (in k)?
>
> Jason Merrill
> Bank of America
> GT&O L&LD Solutions Design & Development eTools & Multimedia
>
> Bank of America Flash Platform Developer Community
>
>
> Are you a Bank of America associate interested in innovative learning 
> ideas and technologies?
> Check out our internal  GT&O Innovative Learning Blog and & subscribe.
>
>
>
>
>  
>
>   
>>> -----Original Message-----
>>> From: [EMAIL PROTECTED]
>>> [mailto:[EMAIL PROTECTED] On Behalf Of Glen 
>>> Pike
>>> Sent: Thursday, February 21, 2008 10:50 AM
>>> To: Flash Coders List
>>> Subject: [Flashcoders] CDROM XML search
>>>
>>> Hi,
>>>
>>>    I have been asked to look at a search facility for a CDROM 
>>> project.
>>>
>>>    The customer is archiving magazines, 1 a month, for a decade per 
>>> CD and wants a simple search engine.
>>>
>>>    The magazines will be archived as scanned images plus XML data 
>>> containing page text content.
>>>
>>>    Loading in an XML file and searching / filtering is pretty easy 
>>> in principle, but I am guessing I may run into performance issues as 
>>> the amount of data is scaled up.
>>>
>>>    Google is proving fairly useless today, so has anyone had much 
>>> experience of this and have any recommendations.
>>>
>>>    Thanks
>>>
>>>    Glen
>>> --
>>>
>>> Glen Pike
>>> 01736 759321
>>> www.glenpike.co.uk <http://www.glenpike.co.uk> 
>>> _______________________________________________
>>> Flashcoders mailing list
>>> [email protected]
>>> http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
>>>
>>>       
> _______________________________________________
> Flashcoders mailing list
> [email protected]
> http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
>
>
>   

-- 

Glen Pike
01736 759321
www.glenpike.co.uk <http://www.glenpike.co.uk>
_______________________________________________
Flashcoders mailing list
[email protected]
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders

_______________________________________________
Flashcoders mailing list
[email protected]
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders

Reply via email to