On Jul 22, 2005, at 4:37 AM, Namrata Kumari wrote:
- Well, the application I want to develop is more like storing xml files and with each of them having different structure. And then performing search on them that in turn can depend on the structure of the xml doc and user's
requirement.

That's still a pretty generic requirement. What type of queries? XPath?

- Moreover, I did not exactly understood as to how I can store the xml
document. I mean, I went through the java doc and couldnot figure out the
api's that could be used for this purpose. Can you guide me in this?

Look at the various types of fields. There is a "stored" attribute on Field that allows the field to be stored.

- But the biggest question is: Is Lucene a good option [which now I doubt on
the basis of what I have read till now :-(]

It really all depends. I built a search engine for the Rossetti Archive (http://www.rossettiarchive.org/rose/) which indexes XML files like this:

    http://www.rossettiarchive.org/docs/1-1847.s244.raw.xml

XPath queries are not possible into the XML, but that is also not a use case for the system. Highly structured queries such as this one are supported because the indexing process extracted detailed information from the XML files:

http://www.rossettiarchive.org/rose/?query=%2Bgenre%3Asonnet+%2B% 28author%3Arossetti+OR+author%3Adgr%29+%2Byear%3A%5B1850+TO+1870%5D

I still do not have a clear cut understanding of your needs and thus still not sure if Lucene is suitable or not. Certainly for full-text searches it is a fine choice, but the structured queries are a different story.

    Erik



Regards,
Namrata


-----Original Message-----
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: Friday, July 22, 2005 2:11 PM
To: general@lucene.apache.org
Subject: Re: Lucene as xml store


On Jul 22, 2005, at 1:07 AM, Namrata Kumari wrote:



hi,

I am a beginner to lucene , So kindly excuse me if the questions
mentioned a bit naive.
- Can I use lucene as an xml store + search engine?
- What I understood is that if we want to perform search on xml doc.
we need to parse xml document, form indexes and on the basis of fields
perform search.
- So, does this mean, that even if we use lucene as xml store (IF WE
CAN!!), we need to parse it to form indexes?


Lucene is a search engine and only deals with text (Strings essentially).
Lucene is also a flat document space and doing queries for things
hierarchical is not how it was designed, but it can be done to a limited
degree depending on how data is indexed.

Yes, Lucene can store text as well as make it searchable - so you could
store an XML document in it as well.

You have not provided any information on the types of queries you need to support or what the user experience will be like. There are many ways to
use Lucene and whether it is suitable solution to your
application depends on that information.   Tell us more about what
you're wanting to do and we can guide you further.


Please reply to this as soon as possible


That's what they all say!   :)   No need to say such a thing - if you
have well articulated questions that are straightforward enough to answer,
you'll get responses quickly here.

     Erik


Reply via email to