[jira] [Resolved] (LUCENE-10673) Spatial3d fails constructing a legal bounding box

2022-08-03 Thread Ignacio Vera (Jira)


 [ 
https://issues.apache.org/jira/browse/LUCENE-10673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ignacio Vera resolved LUCENE-10673.
---
Fix Version/s: 9.4
 Assignee: Ignacio Vera
   Resolution: Fixed

> Spatial3d fails constructing a legal bounding box
> -
>
> Key: LUCENE-10673
> URL: https://issues.apache.org/jira/browse/LUCENE-10673
> Project: Lucene - Core
>  Issue Type: Bug
>  Components: modules/spatial3d
>Reporter: Ignacio Vera
>Assignee: Ignacio Vera
>Priority: Major
> Fix For: 9.4
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> The issue can be reproduced with the following test:
> {code}
>   @Test
>   public void testBBoxLatDegenerate() {
>   double minX = Geo3DUtil.fromDegrees(-180.0);
>   double maxX = Geo3DUtil.fromDegrees(-174.3758381903);
>   double minY = Geo3DUtil.fromDegrees(89.9765306711);
>   double maxY = Geo3DUtil.fromDegrees(89.9794643372);
>   assertNotNull(GeoAreaFactory.makeGeoArea(PlanetModel.SPHERE, maxY, 
> minY, minX, maxX));
>   }
> {code}
> this currently fails with the following error:
> {code}
> Cannot determine sidedness because check point is on plane.
> java.lang.IllegalArgumentException: Cannot determine sidedness because check 
> point is on plane.
>   at 
> __randomizedtesting.SeedInfo.seed([F8095E23398C1BA6:396A213B6362092D]:0)
>   at 
> org.apache.lucene.spatial3d.geom.SidedPlane.(SidedPlane.java:137)
>   at 
> org.apache.lucene.spatial3d.geom.GeoRectangle.(GeoRectangle.java:149)
>   at 
> org.apache.lucene.spatial3d.geom.GeoBBoxFactory.makeGeoBBox(GeoBBoxFactory.java:134)
>   at 
> org.apache.lucene.spatial3d.geom.GeoAreaFactory.makeGeoArea(GeoAreaFactory.java:43)
>   at 
> org.apache.lucene.spatial3d.geom.TestGeoBBox.testBBoxLonDegenerate(TestGeoBBox.java:538)
>   at 
> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:77)
>   at 
> java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.base/java.lang.reflect.Method.invoke(Method.java:568)
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Commented] (LUCENE-10673) Spatial3d fails constructing a legal bounding box

2022-08-03 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/LUCENE-10673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575016#comment-17575016
 ] 

ASF subversion and git services commented on LUCENE-10673:
--

Commit 17e8a42e1aa5c5c8328b2e675605a731adeea201 in lucene's branch 
refs/heads/branch_9x from Ignacio Vera
[ https://gitbox.apache.org/repos/asf?p=lucene.git;h=17e8a42e1aa ]

LUCENE-10673: Improve check of equality for latitudes for spatial3d 
GeoBoundingBox (#1056)



> Spatial3d fails constructing a legal bounding box
> -
>
> Key: LUCENE-10673
> URL: https://issues.apache.org/jira/browse/LUCENE-10673
> Project: Lucene - Core
>  Issue Type: Bug
>  Components: modules/spatial3d
>Reporter: Ignacio Vera
>Priority: Major
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> The issue can be reproduced with the following test:
> {code}
>   @Test
>   public void testBBoxLatDegenerate() {
>   double minX = Geo3DUtil.fromDegrees(-180.0);
>   double maxX = Geo3DUtil.fromDegrees(-174.3758381903);
>   double minY = Geo3DUtil.fromDegrees(89.9765306711);
>   double maxY = Geo3DUtil.fromDegrees(89.9794643372);
>   assertNotNull(GeoAreaFactory.makeGeoArea(PlanetModel.SPHERE, maxY, 
> minY, minX, maxX));
>   }
> {code}
> this currently fails with the following error:
> {code}
> Cannot determine sidedness because check point is on plane.
> java.lang.IllegalArgumentException: Cannot determine sidedness because check 
> point is on plane.
>   at 
> __randomizedtesting.SeedInfo.seed([F8095E23398C1BA6:396A213B6362092D]:0)
>   at 
> org.apache.lucene.spatial3d.geom.SidedPlane.(SidedPlane.java:137)
>   at 
> org.apache.lucene.spatial3d.geom.GeoRectangle.(GeoRectangle.java:149)
>   at 
> org.apache.lucene.spatial3d.geom.GeoBBoxFactory.makeGeoBBox(GeoBBoxFactory.java:134)
>   at 
> org.apache.lucene.spatial3d.geom.GeoAreaFactory.makeGeoArea(GeoAreaFactory.java:43)
>   at 
> org.apache.lucene.spatial3d.geom.TestGeoBBox.testBBoxLonDegenerate(TestGeoBBox.java:538)
>   at 
> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:77)
>   at 
> java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.base/java.lang.reflect.Method.invoke(Method.java:568)
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Commented] (LUCENE-10673) Spatial3d fails constructing a legal bounding box

2022-08-03 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/LUCENE-10673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17575015#comment-17575015
 ] 

ASF subversion and git services commented on LUCENE-10673:
--

Commit bd0718f0716ad3f30e0d79b352cd678df249f550 in lucene's branch 
refs/heads/main from Ignacio Vera
[ https://gitbox.apache.org/repos/asf?p=lucene.git;h=bd0718f0716 ]

LUCENE-10673: Improve check of equality for latitudes for spatial3d 
GeoBoundingBox (#1056)



> Spatial3d fails constructing a legal bounding box
> -
>
> Key: LUCENE-10673
> URL: https://issues.apache.org/jira/browse/LUCENE-10673
> Project: Lucene - Core
>  Issue Type: Bug
>  Components: modules/spatial3d
>Reporter: Ignacio Vera
>Priority: Major
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> The issue can be reproduced with the following test:
> {code}
>   @Test
>   public void testBBoxLatDegenerate() {
>   double minX = Geo3DUtil.fromDegrees(-180.0);
>   double maxX = Geo3DUtil.fromDegrees(-174.3758381903);
>   double minY = Geo3DUtil.fromDegrees(89.9765306711);
>   double maxY = Geo3DUtil.fromDegrees(89.9794643372);
>   assertNotNull(GeoAreaFactory.makeGeoArea(PlanetModel.SPHERE, maxY, 
> minY, minX, maxX));
>   }
> {code}
> this currently fails with the following error:
> {code}
> Cannot determine sidedness because check point is on plane.
> java.lang.IllegalArgumentException: Cannot determine sidedness because check 
> point is on plane.
>   at 
> __randomizedtesting.SeedInfo.seed([F8095E23398C1BA6:396A213B6362092D]:0)
>   at 
> org.apache.lucene.spatial3d.geom.SidedPlane.(SidedPlane.java:137)
>   at 
> org.apache.lucene.spatial3d.geom.GeoRectangle.(GeoRectangle.java:149)
>   at 
> org.apache.lucene.spatial3d.geom.GeoBBoxFactory.makeGeoBBox(GeoBBoxFactory.java:134)
>   at 
> org.apache.lucene.spatial3d.geom.GeoAreaFactory.makeGeoArea(GeoAreaFactory.java:43)
>   at 
> org.apache.lucene.spatial3d.geom.TestGeoBBox.testBBoxLonDegenerate(TestGeoBBox.java:538)
>   at 
> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:77)
>   at 
> java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.base/java.lang.reflect.Method.invoke(Method.java:568)
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] iverase merged pull request #1056: LUCENE-10673: Improve check of equality for latitudes for spatial3d GeoBoundingBox

2022-08-03 Thread GitBox


iverase merged PR #1056:
URL: https://github.com/apache/lucene/pull/1056


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Updated] (LUCENE-10670) Add a Codec class to track merge time of each index part

2022-08-03 Thread Yuan Xiao (Jira)


 [ 
https://issues.apache.org/jira/browse/LUCENE-10670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuan Xiao updated LUCENE-10670:
---
Description: 
Lucene has applied filter pattern to *Path*, *Codec*, *Reader* etc(FilterPath, 
FilterCodec, FilterReader) , we should do the same for 
Writer/Producer/Reader/Consumer/Format(postings, points, doc values, vectors, 
stored fields, live docs, term vectors) under Codec.

One use case is enabling probe the merge time of different file format by 
making a FilterCodec that overrides the merge method of each Codec format 
writer through these filters.

It will allow downstream to add any additional functionality to those classes. 

I have opened up a PR for this use case.  

Any suggestions of different ways to track index parts merge time is welcome.

  was:
Lucene has applied filter pattern to *Path*, *Codec*, *Reader* etc(FilterPath, 
FilterCodec, FilterReader) , we should do the same for 
Writer/Producer/Reader/Consumer/Format(postings, points, doc values, vectors, 
stored fields, live docs, term vectors) under Codec.

One use case is enabling probe the merge time of different file format by 
making a FilterCodec that overrides the merge method of each Codec format 
writer through these filters.

It will allow downstream to add any additional functionality to those classes. 

I have opened up a PR

Any suggestions is welcome. 

Summary: Add a Codec class to track merge time of each index part  
(was: Add Filter class to Codec Writer/Producer/Reader/Consumer/Format)

> Add a Codec class to track merge time of each index part
> 
>
> Key: LUCENE-10670
> URL: https://issues.apache.org/jira/browse/LUCENE-10670
> Project: Lucene - Core
>  Issue Type: Improvement
>Reporter: Yuan Xiao
>Priority: Major
>
> Lucene has applied filter pattern to *Path*, *Codec*, *Reader* 
> etc(FilterPath, FilterCodec, FilterReader) , we should do the same for 
> Writer/Producer/Reader/Consumer/Format(postings, points, doc values, vectors, 
> stored fields, live docs, term vectors) under Codec.
> One use case is enabling probe the merge time of different file format by 
> making a FilterCodec that overrides the merge method of each Codec format 
> writer through these filters.
> It will allow downstream to add any additional functionality to those 
> classes. 
> I have opened up a PR for this use case.  
> Any suggestions of different ways to track index parts merge time is welcome.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] yugushihuang opened a new pull request, #1057: Add a codec class to track merge time of each index part

2022-08-03 Thread GitBox


yugushihuang opened a new pull request, #1057:
URL: https://github.com/apache/lucene/pull/1057

   This PR add filter class for each index format classes and their 
writer/producer/reader/consumer and add a tracking codec class to track merge 
time of each index part.
   [Jira Issue 
Link](https://issues.apache.org/jira/projects/LUCENE/issues/LUCENE-10670)
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Updated] (LUCENE-10670) Add Filter class to Codec Writer/Producer/Reader/Consumer/Format

2022-08-03 Thread Yuan Xiao (Jira)


 [ 
https://issues.apache.org/jira/browse/LUCENE-10670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuan Xiao updated LUCENE-10670:
---
Description: 
Lucene has applied filter pattern to *Path*, *Codec*, *Reader* etc(FilterPath, 
FilterCodec, FilterReader) , we should do the same for 
Writer/Producer/Reader/Consumer/Format(postings, points, doc values, vectors, 
stored fields, live docs, term vectors) under Codec.

One use case is enabling probe the merge time of different file format by 
making a FilterCodec that overrides the merge method of each Codec format 
writer through these filters.

It will allow downstream to add any additional functionality to those classes. 

I have opened up a PR

Any suggestions is welcome. 

  was:
Lucene has applied filter pattern to *Path*, *Codec*, *Reader* etc(FilterPath, 
FilterCodec, FilterReader) , we should do the same for 
Writer/Producer/Reader/Consumer/Format(postings, points, doc values, vectors, 
stored fields, live docs, term vectors) under Codec.

One use case is enabling probe the merge time of different file format by 
making a FilterCodec that overrides the merge method of each Codec format 
writer through these filters.

It will allow downstream to add any additional functionality to those classes. 


> Add Filter class to Codec Writer/Producer/Reader/Consumer/Format
> 
>
> Key: LUCENE-10670
> URL: https://issues.apache.org/jira/browse/LUCENE-10670
> Project: Lucene - Core
>  Issue Type: Improvement
>Reporter: Yuan Xiao
>Priority: Major
>
> Lucene has applied filter pattern to *Path*, *Codec*, *Reader* 
> etc(FilterPath, FilterCodec, FilterReader) , we should do the same for 
> Writer/Producer/Reader/Consumer/Format(postings, points, doc values, vectors, 
> stored fields, live docs, term vectors) under Codec.
> One use case is enabling probe the merge time of different file format by 
> making a FilterCodec that overrides the merge method of each Codec format 
> writer through these filters.
> It will allow downstream to add any additional functionality to those 
> classes. 
> I have opened up a PR
> Any suggestions is welcome. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mikemccand merged pull request #108: Reorder text processing, tweak mention format, escape tilde

2022-08-03 Thread GitBox


mikemccand merged PR #108:
URL: https://github.com/apache/lucene-jira-archive/pull/108


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mocobeta commented on issue #89: Excess strike-through on conversion

2022-08-03 Thread GitBox


mocobeta commented on issue #89:
URL: 
https://github.com/apache/lucene-jira-archive/issues/89#issuecomment-1204100796

   No, in fact, it should be safely done - #108.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mocobeta commented on a diff in pull request #108: Reorder text processing, tweak mention format, escape tilde

2022-08-03 Thread GitBox


mocobeta commented on code in PR #108:
URL: 
https://github.com/apache/lucene-jira-archive/pull/108#discussion_r936795884


##
migration/src/jira_util.py:
##
@@ -287,9 +276,23 @@ def escape_gh_issue_link(m: re.Match):
 disp_name = jira_users.get(jira_id)
 gh_m = account_map.get(jira_id)
 # replace Jira name with GitHub account or Jira display name if it 
is available, othewise show Jira name with ``
-mention = lambda: f"@{gh_m}" if gh_m else disp_name if disp_name 
else f"`~{jira_id}`"
+mention = lambda: f"@{gh_m}" if gh_m else disp_name if disp_name 
else f"`@{jira_id}`"
 text = text.replace(m, mention())
 
+# escape tilde to avoid unintentional strike-throughs in Markdown
+text = text.replace("~", "\~")

Review Comment:
   Escape `~` **before** Markdown conversion and **after** mention conversion.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mocobeta commented on a diff in pull request #108: Reorder text processing, tweak mention format, escape tilde

2022-08-03 Thread GitBox


mocobeta commented on code in PR #108:
URL: 
https://github.com/apache/lucene-jira-archive/pull/108#discussion_r936796559


##
migration/src/jira_util.py:
##
@@ -287,9 +276,23 @@ def escape_gh_issue_link(m: re.Match):
 disp_name = jira_users.get(jira_id)
 gh_m = account_map.get(jira_id)
 # replace Jira name with GitHub account or Jira display name if it 
is available, othewise show Jira name with ``
-mention = lambda: f"@{gh_m}" if gh_m else disp_name if disp_name 
else f"`~{jira_id}`"
+mention = lambda: f"@{gh_m}" if gh_m else disp_name if disp_name 
else f"`@{jira_id}`"
 text = text.replace(m, mention())
 
+# escape tilde to avoid unintentional strike-throughs in Markdown
+text = text.replace("~", "\~")
+
+# convert Jira markup into Markdown with customization
+elements = MarkupElements()
+elements.replace(UnorderedList, UnorderedTweakedList)
+elements.replace(OrderedList, OrderedTweakedList)
+elements.replace(BlockQuote, TweakedBlockQuote)
+elements.replace(Quote, TweakedQuote)
+elements.replace(Monospaced, TweakedMonospaced)
+elements.insert_after(Ruler, LongRuler)
+elements.append(EscapeHtmlTag)
+text = jira2markdown.convert(text, elements=elements)

Review Comment:
   Do syntax conversion after escaping `~`.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mocobeta commented on a diff in pull request #108: Reorder text processing, tweak mention format, escape tilde

2022-08-03 Thread GitBox


mocobeta commented on code in PR #108:
URL: 
https://github.com/apache/lucene-jira-archive/pull/108#discussion_r936795884


##
migration/src/jira_util.py:
##
@@ -287,9 +276,23 @@ def escape_gh_issue_link(m: re.Match):
 disp_name = jira_users.get(jira_id)
 gh_m = account_map.get(jira_id)
 # replace Jira name with GitHub account or Jira display name if it 
is available, othewise show Jira name with ``
-mention = lambda: f"@{gh_m}" if gh_m else disp_name if disp_name 
else f"`~{jira_id}`"
+mention = lambda: f"@{gh_m}" if gh_m else disp_name if disp_name 
else f"`@{jira_id}`"
 text = text.replace(m, mention())
 
+# escape tilde to avoid unintentional strike-throughs in Markdown
+text = text.replace("~", "\~")

Review Comment:
   Escape `~` before Markdown conversion and after mention conversion.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mocobeta commented on a diff in pull request #108: Reorder text processing, tweak mention format, escape tilde

2022-08-03 Thread GitBox


mocobeta commented on code in PR #108:
URL: 
https://github.com/apache/lucene-jira-archive/pull/108#discussion_r936795269


##
migration/src/jira_util.py:
##
@@ -287,9 +276,23 @@ def escape_gh_issue_link(m: re.Match):
 disp_name = jira_users.get(jira_id)
 gh_m = account_map.get(jira_id)
 # replace Jira name with GitHub account or Jira display name if it 
is available, othewise show Jira name with ``
-mention = lambda: f"@{gh_m}" if gh_m else disp_name if disp_name 
else f"`~{jira_id}`"
+mention = lambda: f"@{gh_m}" if gh_m else disp_name if disp_name 
else f"`@{jira_id}`"

Review Comment:
   Use `@` instead of `~` for mentions.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mocobeta commented on pull request #108: Reorder text processing, tweak mention format, escape tilde

2022-08-03 Thread GitBox


mocobeta commented on PR #108:
URL: 
https://github.com/apache/lucene-jira-archive/pull/108#issuecomment-1204091313

   The result seems fine: 
   https://github.com/mocobeta/migration-test-3/issues/522
   
   ![Screenshot from 2022-08-04 
00-17-21](https://user-images.githubusercontent.com/1825333/182645095-a27a92d9-71a0-42a1-8b82-b21c842c7e39.png)
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mocobeta opened a new pull request, #108: Reorder text processing, tweak mention format, escape tilde

2022-08-03 Thread GitBox


mocobeta opened a new pull request, #108:
URL: https://github.com/apache/lucene-jira-archive/pull/108

   #89 
   
   I think this should be safe.
   
   1. Convert Jira's `[~username]` mention to GitHub mention.
   2. Escape all `~` in Jira data
   3. Convert Jira syntax to Markdown


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mocobeta commented on issue #89: Excess strike-through on conversion

2022-08-03 Thread GitBox


mocobeta commented on issue #89:
URL: 
https://github.com/apache/lucene-jira-archive/issues/89#issuecomment-1203992440

   The difficulty here is that `~` actually has special meanings in both Jira 
syntax and Markdown.
   
   The conversion process is done as follows and the order is important.
   
   1. Convert Jira syntax to Markdown
   2. Convert Jira mentions to GitHub mentions
   
   Escaping `~` should be done _before_ 1) syntax conversion; otherwise valid 
strike-throughs in Markdown are affected. However, it also should be done 
_after_ 2) mention conversion; otherwise `[~username]` mentions are affected. 
   
   We could find a tricky workaround, but I'm afraid any workaround has 
unpredictable side effects - I would leave this issue.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene] iverase opened a new pull request, #1056: LUCENE-10673: Improve check of equality for latitudes for GeoBoundingBox

2022-08-03 Thread GitBox


iverase opened a new pull request, #1056:
URL: https://github.com/apache/lucene/pull/1056

   The PR makes two latitudes to be equal if the value of the sine is lower 
than the vector resolution.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mocobeta commented on issue #106: Inlined patches don't migrate correctly

2022-08-03 Thread GitBox


mocobeta commented on issue #106:
URL: 
https://github.com/apache/lucene-jira-archive/issues/106#issuecomment-1203827169

   Thanks, I added this to the migration plan (or to-do list) in #7.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Commented] (LUCENE-10672) Re-evaluate different ways to encode postings

2022-08-03 Thread Michael McCandless (Jira)


[ 
https://issues.apache.org/jira/browse/LUCENE-10672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17574668#comment-17574668
 ] 

Michael McCandless commented on LUCENE-10672:
-

+1

Maybe we can also peek at how Vespa and Tantivy do their encoding ... any 
inspiration there?

The world needs more popular open-search source engines.

> Re-evaluate different ways to encode postings
> -
>
> Key: LUCENE-10672
> URL: https://issues.apache.org/jira/browse/LUCENE-10672
> Project: Lucene - Core
>  Issue Type: Task
>Reporter: Adrien Grand
>Priority: Minor
>
> In Lucene 4, we moved to FOR to encode postings because it woud give better 
> throughput compared to VInts that we had been using until then. This was a 
> time when Lucene would often need to evaluate entire postings lists, and 
> optimizations like BS1 were very important for good performance.
> Nowadays, Lucene performs more dynamic pruning and it's less frequent that 
> Lucene needs to evaluate all hits that match a query. So the performance of 
> {{nextDoc()}} has become a bit less relevant while the performance of 
> {{advance(target)}} has become more relevant.
> I wonder if we should re-evaluate other ways to encode postings that are 
> theoretically better at skipping, such as Elias-Fano coding, since they 
> support skipping directly on the encoded representation instead of requiring 
> decoding a full block of integers where only a couple of them would be 
> relevant.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Updated] (LUCENE-10673) Spatial3d fails constructing a legal bounding box

2022-08-03 Thread Ignacio Vera (Jira)


 [ 
https://issues.apache.org/jira/browse/LUCENE-10673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ignacio Vera updated LUCENE-10673:
--
Component/s: modules/spatial3d

> Spatial3d fails constructing a legal bounding box
> -
>
> Key: LUCENE-10673
> URL: https://issues.apache.org/jira/browse/LUCENE-10673
> Project: Lucene - Core
>  Issue Type: Bug
>  Components: modules/spatial3d
>Reporter: Ignacio Vera
>Priority: Major
>
> The issue can be reproduced with the following test:
> {code}
>   @Test
>   public void testBBoxLatDegenerate() {
>   double minX = Geo3DUtil.fromDegrees(-180.0);
>   double maxX = Geo3DUtil.fromDegrees(-174.3758381903);
>   double minY = Geo3DUtil.fromDegrees(89.9765306711);
>   double maxY = Geo3DUtil.fromDegrees(89.9794643372);
>   assertNotNull(GeoAreaFactory.makeGeoArea(PlanetModel.SPHERE, maxY, 
> minY, minX, maxX));
>   }
> {code}
> this currently fails with the following error:
> {code}
> Cannot determine sidedness because check point is on plane.
> java.lang.IllegalArgumentException: Cannot determine sidedness because check 
> point is on plane.
>   at 
> __randomizedtesting.SeedInfo.seed([F8095E23398C1BA6:396A213B6362092D]:0)
>   at 
> org.apache.lucene.spatial3d.geom.SidedPlane.(SidedPlane.java:137)
>   at 
> org.apache.lucene.spatial3d.geom.GeoRectangle.(GeoRectangle.java:149)
>   at 
> org.apache.lucene.spatial3d.geom.GeoBBoxFactory.makeGeoBBox(GeoBBoxFactory.java:134)
>   at 
> org.apache.lucene.spatial3d.geom.GeoAreaFactory.makeGeoArea(GeoAreaFactory.java:43)
>   at 
> org.apache.lucene.spatial3d.geom.TestGeoBBox.testBBoxLonDegenerate(TestGeoBBox.java:538)
>   at 
> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at 
> java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:77)
>   at 
> java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>   at java.base/java.lang.reflect.Method.invoke(Method.java:568)
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Created] (LUCENE-10673) Spatial3d fails constructing a legal bounding box

2022-08-03 Thread Ignacio Vera (Jira)
Ignacio Vera created LUCENE-10673:
-

 Summary: Spatial3d fails constructing a legal bounding box
 Key: LUCENE-10673
 URL: https://issues.apache.org/jira/browse/LUCENE-10673
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Ignacio Vera


The issue can be reproduced with the following test:

{code}
  @Test
  public void testBBoxLatDegenerate() {
  double minX = Geo3DUtil.fromDegrees(-180.0);
  double maxX = Geo3DUtil.fromDegrees(-174.3758381903);
  double minY = Geo3DUtil.fromDegrees(89.9765306711);
  double maxY = Geo3DUtil.fromDegrees(89.9794643372);
  assertNotNull(GeoAreaFactory.makeGeoArea(PlanetModel.SPHERE, maxY, minY, 
minX, maxX));
  }
{code}

this currently fails with the following error:

{code}
Cannot determine sidedness because check point is on plane.
java.lang.IllegalArgumentException: Cannot determine sidedness because check 
point is on plane.
at 
__randomizedtesting.SeedInfo.seed([F8095E23398C1BA6:396A213B6362092D]:0)
at 
org.apache.lucene.spatial3d.geom.SidedPlane.(SidedPlane.java:137)
at 
org.apache.lucene.spatial3d.geom.GeoRectangle.(GeoRectangle.java:149)
at 
org.apache.lucene.spatial3d.geom.GeoBBoxFactory.makeGeoBBox(GeoBBoxFactory.java:134)
at 
org.apache.lucene.spatial3d.geom.GeoAreaFactory.makeGeoArea(GeoAreaFactory.java:43)
at 
org.apache.lucene.spatial3d.geom.TestGeoBBox.testBBoxLonDegenerate(TestGeoBBox.java:538)
at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:77)
at 
java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.base/java.lang.reflect.Method.invoke(Method.java:568)


{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mikemccand commented on issue #106: Inlined patches don't migrate correctly

2022-08-03 Thread GitBox


mikemccand commented on issue #106:
URL: 
https://github.com/apache/lucene-jira-archive/issues/106#issuecomment-1203800580

   Some of those are properly marked up with Jira's markup, and should migrate 
well, but others are not.
   
   Honestly, I still feel this is part of the "long tail" of Jira authors 
writing comments in invalid Jira markup, and we should not aim to correct their 
mistakes here.  Maybe a best effort when we see the errors at most ...


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[GitHub] [lucene-jira-archive] mikemccand commented on issue #106: Inlined patches don't migrate correctly

2022-08-03 Thread GitBox


mikemccand commented on issue #106:
URL: 
https://github.com/apache/lucene-jira-archive/issues/106#issuecomment-1203799408

   OK I iterated some more on my silly "attempt to detected diff/patch/quoted 
code" to this VERY scratchy tool:
   
   ```
   import os
   import re
   import glob
   import json
   
   reDiffLines = re.compile('^[\d,]+[cd][\d,]+\s*$', re.MULTILINE)
   reDiffCommand = re.compile('\s+diff\s+[^\s]+')
   reLeadingLessGreaterThan = re.compile(r'^[<>] .*?\);.*?', re.MULTILINE)
   
   results = []
   
   for file_name in glob.glob('jira-dump/*.json'):
   s = open(file_name).read()
   d = json.loads(s)
   jira_id = d['key']
   if not jira_id.startswith('LUCENE'):
   print(f'SKIPPING: {jira_id}')
   continue
   fields = d['fields']
   #print(fields.keys())


   desc = fields['description']
   #if desc is not None and ('\n---\n' in desc or 
reDiffLines.search(desc)):  

  
   if desc is not None:
   count = len(reLeadingLessGreaterThan.findall(desc))
   if reDiffLines.search(desc) or count >= 2:
   #print(f'MATCH: {d["key"]}\n  {desc}')   


   #print(f'MATCH: {d["key"]}\n')   


   results.append((jira_id, f'*** Diff in desc?: 
[{jira_id}](https://issues.apache.org/jira/browse/{jira_id})'))
   #print(repr(desc))   


   if jira_id == 'LUCENE-825':
   print(reDiffCommand.match(desc))
   print(repr(desc))
   for comment in fields['comment']['comments']:
   comment_text = comment['body']
   comment_id = comment['id']
   count = len(reLeadingLessGreaterThan.findall(comment_text))
   #if '\n---\n' in comment_text or reDiffLines.search(comment_text):   


   if reDiffLines.search(comment_text) or count > 2:
   #print(f'MATCH: {d["key"]}\n  {comment_text}')   


   jira_comment_link = 
f'https://issues.apache.org/jira/browse/{jira_id}?focusedCommentId={comment_id}=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-{comment_id}'
   results.append((jira_id, f'*** Diff in comment?: 
[{jira_id}]({jira_comment_link})'))
   if jira_id == 'LUCENE-5400':
   print(reDiffCommand.search(comment_text))
   print(repr(comment_text))
   #print(repr(comment_text))   


   
   print(f'\n{len(results)} possible diffs:\n')
   for jira_id, text in sorted(results, key=lambda x: int(x[0][7:])):
   print(text)
   ```
   
   It produces these results:
   
   > 9 possible diffs:
   
   *** Diff in desc?: 
[LUCENE-108](https://issues.apache.org/jira/browse/LUCENE-108)
   *** Diff in desc?: 
[LUCENE-162](https://issues.apache.org/jira/browse/LUCENE-162)
   *** Diff in desc?: 
[LUCENE-327](https://issues.apache.org/jira/browse/LUCENE-327)
   *** Diff in comment?: 
[LUCENE-584](https://issues.apache.org/jira/browse/LUCENE-584?focusedCommentId=12487106=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-12487106)
   *** Diff in comment?: 
[LUCENE-743](https://issues.apache.org/jira/browse/LUCENE-743?focusedCommentId=12520476=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-12520476)
   *** Diff in desc?: 
[LUCENE-825](https://issues.apache.org/jira/browse/LUCENE-825)
   *** Diff in desc?: 
[LUCENE-5110](https://issues.apache.org/jira/browse/LUCENE-5110)
   *** Diff in comment?: 

[GitHub] [lucene-jira-archive] mikemccand commented on issue #106: Inlined patches don't migrate correctly

2022-08-03 Thread GitBox


mikemccand commented on issue #106:
URL: 
https://github.com/apache/lucene-jira-archive/issues/106#issuecomment-1203731966

   > I can fix 19 issues by hand after the migration (there would be no hurry), 
could you tell me the issue numbers or commands to identify the issues with 
inline patches?
   
   OK will do -- I'll post my findings here.  Thanks @mocobeta.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org



[jira] [Created] (LUCENE-10672) Re-evaluate different ways to encode postings

2022-08-03 Thread Adrien Grand (Jira)
Adrien Grand created LUCENE-10672:
-

 Summary: Re-evaluate different ways to encode postings
 Key: LUCENE-10672
 URL: https://issues.apache.org/jira/browse/LUCENE-10672
 Project: Lucene - Core
  Issue Type: Task
Reporter: Adrien Grand


In Lucene 4, we moved to FOR to encode postings because it woud give better 
throughput compared to VInts that we had been using until then. This was a time 
when Lucene would often need to evaluate entire postings lists, and 
optimizations like BS1 were very important for good performance.

Nowadays, Lucene performs more dynamic pruning and it's less frequent that 
Lucene needs to evaluate all hits that match a query. So the performance of 
{{nextDoc()}} has become a bit less relevant while the performance of 
{{advance(target)}} has become more relevant.

I wonder if we should re-evaluate other ways to encode postings that are 
theoretically better at skipping, such as Elias-Fano coding, since they support 
skipping directly on the encoded representation instead of requiring decoding a 
full block of integers where only a couple of them would be relevant.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org