I have about 1000 xml files loaded, that average about 200kb each. I (tried) to build indexes on all elements and attributes. In my application, wildcard-browsing queries are important.
I.E. Users will want to explore unfamiliar data sets with queries like: //[EMAIL PROTECTED]'SLIDE_NUMBER']
With my dataset (using the lastest CVS version) such queries take from 20-50 seconds to complete.
I loaded the same dataset into a 30-day trial vesion of eXcelon. The same query, on eXcelon, completes in 1.4 seconds.
Have I built the Xindice indexes improperly? Or is that the way it is right now?
$XINDICE_HOME/bin/xindiceadmin add_collection -c $SERVER:$PORT/db/mycollection -n data
$XINDICE_HOME/bin/xindiceadmin add_indexer -c $SERVER:$PORT/db/mycollection/data -n starE -p "*"
$XINDICE_HOME/bin/xindiceadmin add_indexer -c $SERVER:$PORT/db/mycollection/data -n starA -p "[EMAIL PROTECTED]"
$XINDICE_HOME/bin/xindiceadmin import -c $SERVER:$PORT/db/mycollection -n data -f data
sample data: <?xml version="1.0"?> <XSIL Name="user_description"> <XSIL Name="system"> <XSIL Name="displayView" category="CCB"> <XSIL Name="displayView" category="BBT"> <XSIL Name="displayView" category="saddam" /> </XSIL> </XSIL> <Param Name="description">Afferent_bbt_L.10.9.538</Param> <Param Name="owner">saddam</Param> <Param Name="group">CCB</Param> <Param Name="template">BBT</Param> <Param Name="doc_id">BBT-saddam-01_01_1994-23</Param> <Param Name="creationDate">01_01_1994</Param> </XSIL> etc.....
-- /* Sandy Pittendrigh >--oO0> * [EMAIL PROTECTED] * http://cns.montana.edu/~sandy */