Hi Luceners,
this is my 1st experience with ARQ, LARQ & Lucene;
everyth. went smooth so far, however the slope seems to be getting
steeper suddenly.
The initial problem was to develop a Java app to build, then to browse
through, a repository of RDF data. With TDB/ARQ, this is now running
smoothly.
Then came 2 decisions:
1) localize most of the RDF litterals, & have the repository hold them all; so
far
20 Western languages + 3 Eastern, & counting;
2) allow queries to include full-text searches over the litterals.
The problem now is that I expect new languages to be added w/out prior
warning, & I'll have to cope with most of them decently, right from
day-zero. As with most of the 20 already identified, for that matter.
So far, Lucene has exactly the right architecture for a Java indexing
component; LARQ seems to have more or less what it takes to both index &
query RDF based on language info.
The only thing I need is the middle layer: a Java component extending
Lucene, that'd pull a plausible Analyzer out of its magic hat, for every
ISO 639-1 language tag however unlikely that turns up in the RDF input.
Not just an analyzer, mark: a *plausible* one. I mean one that'll
generate usable indexes right out of the box in most cases; I cannot
afford to bring the system back into dev & study the arcanes of
automatic indexing in just every language we're working with. So
lucene-contrib + covering the rest with StandardAnalyzer/English
stopwords is not an option.
On the other hand, to mitigate the "however unlikely" part: the
localized litterals are repair instructions, so I don't reckon we miss
support for Middle-High Sumerian any time soon.
I had a quick glance at Solr; it would appear to be a fair candidate
since it must needs have such a component nested in its server-side
software, & it's free.
Only I would rather have the component in question offered as a separate
release, or at least with backward compatibility guaranteed.
What do you advise; should I apple-C apple-V part of Solr? Else, what alternate
freeware might do?
Best regards,
François Jurain.
____________________________________________________
Retrouvez les 10 conseils pour économiser votre carburant sur Voila :
http://actu.voila.fr/evenementiel/LeDossierEcologie/l-eco-conduite/
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]