Hi there. I have been working with semantic web type application for a
while and it appears to me pylucene could help me to get a much needed
speed boost. I have a couple of question to start.
My schema is fairly involved and has more than 100 elements. It would be
great to perform a search on any of these but how practical is that?
Should I be creating a summary type search using a smaller index of say
10 - 12 fields and then have detailed search based on a larger broader
number of fields in a different index. Any recommendations would be
helpful. There is no doubt I will have to create a query parser for my app.
Second questions is how can I get pydocs so I am aware of what
functionality exists and also syntax. I see examples in the samples
folder but some description of classes and methods is important. Is this
available somewhere?
My last initial question has to do with sorting. I see that there are
advanced possibilities with the indexes to sort and filter. How
advisable is using sort for large record sets. For example, say you have
got 20000 records returned from your search. Because this will have a
web interface I will only be showing first 20 likely so it will be
batching results. Is the sorting filtering highly memory intensive?
Hopefully, someone can provide some initial guidance. Many thanks.
Regards,
David
_______________________________________________
pylucene-dev mailing list
[email protected]
http://lists.osafoundation.org/mailman/listinfo/pylucene-dev