Hi there. I have been working with semantic web type application for a while and it appears to me pylucene could help me to get a much needed speed boost. I have a couple of question to start.

My schema is fairly involved and has more than 100 elements. It would be great to perform a search on any of these but how practical is that? Should I be creating a summary type search using a smaller index of say 10 - 12 fields and then have detailed search based on a larger broader number of fields in a different index. Any recommendations would be helpful. There is no doubt I will have to create a query parser for my app.

Second questions is how can I get pydocs so I am aware of what functionality exists and also syntax. I see examples in the samples folder but some description of classes and methods is important. Is this available somewhere?

My last initial question has to do with sorting. I see that there are advanced possibilities with the indexes to sort and filter. How advisable is using sort for large record sets. For example, say you have got 20000 records returned from your search. Because this will have a web interface I will only be showing first 20 likely so it will be batching results. Is the sorting filtering highly memory intensive?

Hopefully, someone can provide some initial guidance. Many thanks.

Regards,
David


_______________________________________________
pylucene-dev mailing list
[email protected]
http://lists.osafoundation.org/mailman/listinfo/pylucene-dev

Reply via email to