date:20110624

Re: [VOTE] release 3.3

2011-06-24 Thread Tommaso Teofili

+1

Tommaso

2011/6/23 Robert Muir rcm...@gmail.com

 Artifacts here:

 http://s.apache.org/lusolr33rc0

 working release notes here:

 http://wiki.apache.org/lucene-java/ReleaseNote33
 http://wiki.apache.org/solr/ReleaseNote33

 I ran the automated release test script in
 trunk/dev-tools/scripts/smokeTestRelease.py, and ran 'ant test' at the
 top level 50 times on windows.
 Here is my +1

 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org

RE: [VOTE] release 3.3

2011-06-24 Thread Steven A Rowe

+1

I did the following:

- compared the Solr  Lucene binary .zip and .tgz archives' contents for any 
differences (other than line endings)
- skimmed Changes.html for generation problems
- looked at random pages from each module's javadocs
- ran the Lucene demo, indexed and searched
- ran the Solr example server, indexed and searched
- eyeballed all modules' Maven artifacts  sanity checked their POMs
- ran all tests from the Solr  Lucene source tarballs, separately

Two non-release-blocking nits:

1. In the Solr source tarball, solr/example/README.txt recommends using the 
command ./post.sh *.xml from solr/example/exampledocs/, but post.sh does not 
have executable permissions.  In the binary tarball, however, post.sh has 
executable permissions.

2. I checked the source for references to older versions, and I found the 
following; I think these just point to a missing item in the release todo 
(post-branching), and should not block the release:

./lucene/contrib/analyzers/common/src/java/org/apache/lucene/analysis/fr/FrenchStemFilter.java:
 @Deprecated // TODO remove in 3.2 (this is present twice in this file)

./lucene/src/java/org/apache/lucene/index/ConcurrentMergeScheduler.java:  /** 
@deprecated remove all this test mode code in lucene 3.2! */

./lucene/contrib/analyzers/common/src/java/org/apache/lucene/analysis/br/BrazilianAnalyzer.java:
  // TODO make this private in 3.1 (this is present twice in this file)

./lucene/contrib/demo/src/java/org/apache/lucene/demo/IndexFiles.java:/** Index 
all text files under a directory. See 
http://lucene.apache.org/java/3_1/demo.html. */

./lucene/contrib/demo/src/java/org/apache/lucene/demo/IndexFiles.java: + See 
http://lucene.apache.org/java/3_1/demo.html for details.;

Steve

 -Original Message-
 From: Robert Muir [mailto:rcm...@gmail.com]
 Sent: Thursday, June 23, 2011 4:18 PM
 To: dev@lucene.apache.org
 Subject: [VOTE] release 3.3
 
 Artifacts here:
 
 http://s.apache.org/lusolr33rc0
 
 working release notes here:
 
 http://wiki.apache.org/lucene-java/ReleaseNote33
 http://wiki.apache.org/solr/ReleaseNote33
 
 I ran the automated release test script in
 trunk/dev-tools/scripts/smokeTestRelease.py, and ran 'ant test' at the
 top level 50 times on windows.
 Here is my +1

[jira] [Created] (SOLR-2619) two sfields in geospatial search

2011-06-24 Thread jose rodriguez (JIRA)

two sfields in geospatial search


 Key: SOLR-2619
 URL: https://issues.apache.org/jira/browse/SOLR-2619
 Project: Solr
  Issue Type: Wish
  Components: clients - php
Affects Versions: 3.2
 Environment: Using with drupal
Reporter: jose rodriguez
 Fix For: 3.2


Is it possible to create a query with two sfield (geospatial search)? .Want to 
mean two diferents pt and d for each field.

If i need from - to then i need fields around the from coordinate and around 
the to coordinates.

Thanks.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-2382) DIH Cache Improvements

2011-06-24 Thread Noble Paul (JIRA)

[
https://issues.apache.org/jira/browse/SOLR-2382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054263#comment-13054263
]

Noble Paul commented on SOLR-2382:
--

bq.cacheInit() in EntityProcessorBase specifically passes only the parameters
that apply to the current situation

it doen't matter . It can use any params which are relevant to it. Anyway you
can't define what params are required for a future DIHCache impl. Look at a
Transformer implementation it can read anything it wants. The cache should be
initialized like that only

DIH Cache Improvements
--

Key: SOLR-2382
URL: https://issues.apache.org/jira/browse/SOLR-2382
Project: Solr
Issue Type: New Feature
Components: contrib - DataImportHandler
Reporter: James Dyer
Priority: Minor
Attachments: SOLR-2382.patch, SOLR-2382.patch, SOLR-2382.patch,
SOLR-2382.patch, SOLR-2382.patch, SOLR-2382.patch, SOLR-2382.patch

Functionality:
1. Provide a pluggable caching framework for DIH so that users can choose a
cache implementation that best suits their data and application.

2. Provide a means to temporarily cache a child Entity's data without
needing to create a special cached implementation of the Entity Processor
(such as CachedSqlEntityProcessor).

3. Provide a means to write the final (root entity) DIH output to a cache
rather than to Solr. Then provide a way for a subsequent DIH call to use the
cache as an Entity input. Also provide the ability to do delta updates on
such persistent caches.

4. Provide the ability to partition data across multiple caches that can
then be fed back into DIH and indexed either to varying Solr Shards, or to
the same Core in parallel.
Use Cases:
1. We needed a flexible scalable way to temporarily cache child-entity
data prior to joining to parent entities.
- Using SqlEntityProcessor with Child Entities can cause an n+1 select
problem.
- CachedSqlEntityProcessor only supports an in-memory HashMap as a Caching
mechanism and does not scale.
- There is no way to cache non-SQL inputs (ex: flat files, xml, etc).

2. We needed the ability to gather data from long-running entities by a
process that runs separate from our main indexing process.

3. We wanted the ability to do a delta import of only the entities that
changed.
- Lucene/Solr requires entire documents to be re-indexed, even if only a
few fields changed.
- Our data comes from 50+ complex sql queries and/or flat files.
- We do not want to incur overhead re-gathering all of this data if only 1
entity's data changed.
- Persistent DIH caches solve this problem.

4. We want the ability to index several documents in parallel (using 1.4.1,
which did not have the threads parameter).

5. In the future, we may need to use Shards, creating a need to easily
partition our source data into Shards.
Implementation Details:
1. De-couple EntityProcessorBase from caching.
- Created a new interface, DIHCache two implementations:
- SortedMapBackedCache - An in-memory cache, used as default with
CachedSqlEntityProcessor (now deprecated).
- BerkleyBackedCache - A disk-backed cache, dependent on bdb-je, tested
with je-4.1.6.jar
- NOTE: the existing Lucene Contrib db project uses je-3.3.93.jar.
I believe this may be incompatible due to Generic Usage.
- NOTE: I did not modify the ant script to automatically get this jar,
so to use or evaluate this patch, download bdb-je from
http://www.oracle.com/technetwork/database/berkeleydb/downloads/index.html

2. Allow Entity Processors to take a cacheImpl parameter to cause the
entity data to be cached (see EntityProcessorBase DIHCacheProperties).

3. Partially De-couple SolrWriter from DocBuilder
- Created a new interface DIHWriter, two implementations:
- SolrWriter (refactored)
- DIHCacheWriter (allows DIH to write ultimately to a Cache).

4. Create a new Entity Processor, DIHCacheProcessor, which reads a
persistent Cache as DIH Entity Input.

5. Support a partition parameter with both DIHCacheWriter and
DIHCacheProcessor to allow for easy partitioning of source entity data.

[jira] [Issue Comment Edited] (SOLR-2382) DIH Cache Improvements

2011-06-24 Thread Noble Paul (JIRA)

[
https://issues.apache.org/jira/browse/SOLR-2382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13054263#comment-13054263
]

Noble Paul edited comment on SOLR-2382 at 6/24/11 6:42 AM:
---

bq.cacheInit() in EntityProcessorBase specifically passes only the parameters
that apply to the current situation

Why should the DocBuilder be even aware of DIHCache , Should it not be kept
local to the EntityProcessor?

was (Author: noble.paul):
bq.cacheInit() in EntityProcessorBase specifically passes only the
parameters that apply to the current situation