[jira] [Commented] (JENA-1553) Can't Backup data - java.io.IOException: Illegal UTF-8: 0xFFFFFFB1

2018-11-06 Thread Jean-Marc Vanel (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16677797#comment-16677797
 ] 

Jean-Marc Vanel commented on JENA-1553:
---

I got the same kind of problem on my other site (dedicated to nature, botany, 
etc).

{{I made a dump with tdb.tdbdump , and got that same stack already reported in 
this issue:}}

on the main TDB: 
{noformat}
org.apache.jena.tdb.TDBException: Failed to tokenise: 
at 
org.apache.jena.tdb.store.nodetable.NodecSSE.decode(NodecSSE.java:127)
at org.apache.jena.tdb.lib.NodeLib.decode(NodeLib.java:120)
at org.apache.jena.tdb.lib.NodeLib.fetchDecode(NodeLib.java:97)
at 
org.apache.jena.tdb.store.nodetable.NodeTableNative.readNodeFromTable(NodeTableNative.java:182)
at 
org.apache.jena.tdb.store.nodetable.NodeTableNative._retrieveNodeByNodeId(NodeTableNative.java:108)
at 
org.apache.jena.tdb.store.nodetable.NodeTableNative.getNodeForNodeId(NodeTableNative.java:67)
at 
org.apache.jena.tdb.store.nodetable.NodeTableCache._retrieveNodeByNodeId(NodeTableCache.java:128)
at 
org.apache.jena.tdb.store.nodetable.NodeTableCache.getNodeForNodeId(NodeTableCache.java:82)
at 
org.apache.jena.tdb.store.nodetable.NodeTableWrapper.getNodeForNodeId(NodeTableWrapper.java:50)
at 
org.apache.jena.tdb.store.nodetable.NodeTableInline.getNodeForNodeId(NodeTableInline.java:67)
at org.apache.jena.tdb.lib.TupleLib.quad(TupleLib.java:128)
at org.apache.jena.tdb.lib.TupleLib.quad(TupleLib.java:120)
at 
org.apache.jena.tdb.lib.TupleLib.lambda$convertToQuads$3(TupleLib.java:59)
at org.apache.jena.atlas.iterator.Iter$2.next(Iter.java:270)
at 
org.apache.jena.riot.system.StreamOps.sendQuadsToStream(StreamOps.java:140)
at org.apache.jena.riot.writer.NQuadsWriter.write$(NQuadsWriter.java:62)
 {noformat}
 

on the history TDB:
{noformat}
org.apache.jena.tdb.TDBException: Not a node: urce/Sillans-la-Cascade>
at 
org.apache.jena.tdb.store.nodetable.NodecSSE.decode(NodecSSE.java:132)
at org.apache.jena.tdb.lib.NodeLib.decode(NodeLib.java:120)
at org.apache.jena.tdb.lib.NodeLib.fetchDecode(NodeLib.java:97)
at 
org.apache.jena.tdb.store.nodetable.NodeTableNative.readNodeFromTable(NodeTableNative.java:182)
at 
org.apache.jena.tdb.store.nodetable.NodeTableNative._retrieveNodeByNodeId(NodeTableNative.java:108)
at 
org.apache.jena.tdb.store.nodetable.NodeTableNative.getNodeForNodeId(NodeTableNative.java:67)
at 
org.apache.jena.tdb.store.nodetable.NodeTableCache._retrieveNodeByNodeId(NodeTableCache.java:128)
at 
org.apache.jena.tdb.store.nodetable.NodeTableCache.getNodeForNodeId(NodeTableCache.java:82)
at 
org.apache.jena.tdb.store.nodetable.NodeTableWrapper.getNodeForNodeId(NodeTableWrapper.java:50)
at 
org.apache.jena.tdb.store.nodetable.NodeTableInline.getNodeForNodeId(NodeTableInline.java:67)
at org.apache.jena.tdb.lib.TupleLib.quad(TupleLib.java:128)
at org.apache.jena.tdb.lib.TupleLib.quad(TupleLib.java:120)
at 
org.apache.jena.tdb.lib.TupleLib.lambda$convertToQuads$3(TupleLib.java:59)
at org.apache.jena.atlas.iterator.Iter$2.next(Iter.java:270)
at 
org.apache.jena.riot.system.StreamOps.sendQuadsToStream(StreamOps.java:140)
at org.apache.jena.riot.writer.NQuadsWriter.write$(NQuadsWriter.java:62)
 {noformat}
Nothing bad was reported on the web application yet .


 QUESTIONS
 * What should I do ?
 * It is feasible to have a (possibly partial) recovery program ?
 It would catch TDBException somewhere, remove bad binary data, and continue .
 That could even be the default mode of tdbdump.

 

> Can't Backup data - java.io.IOException: Illegal UTF-8: 0xFFB1
> --
>
> Key: JENA-1553
> URL: https://issues.apache.org/jira/browse/JENA-1553
> Project: Apache Jena
>  Issue Type: Bug
>  Components: Jena
> Environment: Ubuntu 16.04 running Docker.  Running stain/jena-fuseki 
> from the official Docker Hub.
>Reporter: Brian Mullen
>Assignee: Andy Seaborne
>Priority: Major
> Fix For: Jena 3.9.0
>
>
> Attempting to backup through Fuseki, TDB 500M+ triples, breaking with error:  
>  
> {code:java}
> [2018-06-01 13:25:46] Log4jLoggerAdapter WARN  Exception in backup
> org.apache.jena.atlas.RuntimeIOException: java.io.IOException: Illegal UTF-8: 
> 0xFFB1
>     at org.apache.jena.atlas.io.IO.exception(IO.java:233)
>     at org.apache.jena.atlas.io.BlockUTF8.exception(BlockUTF8.java:275)
>     at 
> org.apache.jena.atlas.io.BlockUTF8.toCharsBuffer(BlockUTF8.java:150)
>     at org.apache.jena.atlas.io.BlockUTF8.toChars(BlockUTF8.java:73)
>     at 

[jira] [Comment Edited] (JENA-1553) Can't Backup data - java.io.IOException: Illegal UTF-8: 0xFFFFFFB1

2018-11-06 Thread Jean-Marc Vanel (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16677744#comment-16677744
 ] 

Jean-Marc Vanel edited comment on JENA-1553 at 11/7/18 7:05 AM:


Many thanks Andy !

I suppose that you used a generic binary editor ( which one by curiosity ? ); 
otherwise you would have commited somewhere a "mending" Java program.

 I hoped a fixed TDB directory or a fixed N-Triples file, but it is just a list 
a nodes, including my precious text nodes. I will have to redo the triples from 
that.

 

NOTE:

The database you fixed is the historical data of user inputs in UI pages.

There is also the main database, containing the current state of user inputs, 
plus a large sample of LOD internet documents that are duplicated with 
timestamps (the so called semantic cache). So it is much larger :( . One day I 
should put that in a separate database .

 


was (Author: jmvanel):
Many thanks Andy !

I suppose that you used a generic binary editor ( which one by curiosity ? );

otherwise you would have commited somewhere a "mending" Java program.

 

The database you fixed is the historical data of user inputs in UI pages.

There is also the main database, containing the current state of user inputs, 
plus a large sample of LOD internet documents that are duplicated with 
timestamps (the so called semantic cache). So it is much larger :( . One day I 
should put that in a separate database . Anyway , I'll will try my application 
with the provided fixed database, plus an empty database for the main database. 
I may have to update my application for this use case, and certainly reload the 
semantic cache.

 

> Can't Backup data - java.io.IOException: Illegal UTF-8: 0xFFB1
> --
>
> Key: JENA-1553
> URL: https://issues.apache.org/jira/browse/JENA-1553
> Project: Apache Jena
>  Issue Type: Bug
>  Components: Jena
> Environment: Ubuntu 16.04 running Docker.  Running stain/jena-fuseki 
> from the official Docker Hub.
>Reporter: Brian Mullen
>Assignee: Andy Seaborne
>Priority: Major
> Fix For: Jena 3.9.0
>
>
> Attempting to backup through Fuseki, TDB 500M+ triples, breaking with error:  
>  
> {code:java}
> [2018-06-01 13:25:46] Log4jLoggerAdapter WARN  Exception in backup
> org.apache.jena.atlas.RuntimeIOException: java.io.IOException: Illegal UTF-8: 
> 0xFFB1
>     at org.apache.jena.atlas.io.IO.exception(IO.java:233)
>     at org.apache.jena.atlas.io.BlockUTF8.exception(BlockUTF8.java:275)
>     at 
> org.apache.jena.atlas.io.BlockUTF8.toCharsBuffer(BlockUTF8.java:150)
>     at org.apache.jena.atlas.io.BlockUTF8.toChars(BlockUTF8.java:73)
>     at org.apache.jena.atlas.io.BlockUTF8.toString(BlockUTF8.java:95)
>     at 
> org.apache.jena.tdb.store.nodetable.NodecSSE.decode(NodecSSE.java:101)
>     at org.apache.jena.tdb.lib.NodeLib.decode(NodeLib.java:105)
>     at org.apache.jena.tdb.lib.NodeLib.fetchDecode(NodeLib.java:81)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative.readNodeFromTable(NodeTableNative.java:186)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative._retrieveNodeByNodeId(NodeTableNative.java:111)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative.getNodeForNodeId(NodeTableNative.java:70)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableCache._retrieveNodeByNodeId(NodeTableCache.java:128)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableCache.getNodeForNodeId(NodeTableCache.java:82)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableWrapper.getNodeForNodeId(NodeTableWrapper.java:50)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableInline.getNodeForNodeId(NodeTableInline.java:67)
>     at org.apache.jena.tdb.lib.TupleLib.triple(TupleLib.java:107)
>     at org.apache.jena.tdb.lib.TupleLib.triple(TupleLib.java:84)
>     at 
> org.apache.jena.tdb.lib.TupleLib.lambda$convertToTriples$2(TupleLib.java:54)
>     at org.apache.jena.atlas.iterator.Iter$2.next(Iter.java:270)
>     at org.apache.jena.atlas.iterator.Iter$2.next(Iter.java:270)
>     at org.apache.jena.atlas.iterator.Iter.next(Iter.java:891)
>     at 
> org.apache.jena.riot.system.StreamOps.sendQuadsToStream(StreamOps.java:140)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write$(NQuadsWriter.java:62)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write(NQuadsWriter.java:45)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write(NQuadsWriter.java:91)
>     at org.apache.jena.riot.RDFWriter.write$(RDFWriter.java:208)
>     at org.apache.jena.riot.RDFWriter.output(RDFWriter.java:165)
>     at 

[jira] [Commented] (JENA-1553) Can't Backup data - java.io.IOException: Illegal UTF-8: 0xFFFFFFB1

2018-11-06 Thread Jean-Marc Vanel (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16677744#comment-16677744
 ] 

Jean-Marc Vanel commented on JENA-1553:
---

Many thanks Andy !

I suppose that you used a generic binary editor ( which one by curiosity ? );

otherwise you would have commited somewhere a "mending" Java program.

 

The database you fixed is the historical data of user inputs in UI pages.

There is also the main database, containing the current state of user inputs, 
plus a large sample of LOD internet documents that are duplicated with 
timestamps (the so called semantic cache). So it is much larger :( . One day I 
should put that in a separate database . Anyway , I'll will try my application 
with the provided fixed database, plus an empty database for the main database. 
I may have to update my application for this use case, and certainly reload the 
semantic cache.

 

> Can't Backup data - java.io.IOException: Illegal UTF-8: 0xFFB1
> --
>
> Key: JENA-1553
> URL: https://issues.apache.org/jira/browse/JENA-1553
> Project: Apache Jena
>  Issue Type: Bug
>  Components: Jena
> Environment: Ubuntu 16.04 running Docker.  Running stain/jena-fuseki 
> from the official Docker Hub.
>Reporter: Brian Mullen
>Assignee: Andy Seaborne
>Priority: Major
> Fix For: Jena 3.9.0
>
>
> Attempting to backup through Fuseki, TDB 500M+ triples, breaking with error:  
>  
> {code:java}
> [2018-06-01 13:25:46] Log4jLoggerAdapter WARN  Exception in backup
> org.apache.jena.atlas.RuntimeIOException: java.io.IOException: Illegal UTF-8: 
> 0xFFB1
>     at org.apache.jena.atlas.io.IO.exception(IO.java:233)
>     at org.apache.jena.atlas.io.BlockUTF8.exception(BlockUTF8.java:275)
>     at 
> org.apache.jena.atlas.io.BlockUTF8.toCharsBuffer(BlockUTF8.java:150)
>     at org.apache.jena.atlas.io.BlockUTF8.toChars(BlockUTF8.java:73)
>     at org.apache.jena.atlas.io.BlockUTF8.toString(BlockUTF8.java:95)
>     at 
> org.apache.jena.tdb.store.nodetable.NodecSSE.decode(NodecSSE.java:101)
>     at org.apache.jena.tdb.lib.NodeLib.decode(NodeLib.java:105)
>     at org.apache.jena.tdb.lib.NodeLib.fetchDecode(NodeLib.java:81)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative.readNodeFromTable(NodeTableNative.java:186)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative._retrieveNodeByNodeId(NodeTableNative.java:111)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative.getNodeForNodeId(NodeTableNative.java:70)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableCache._retrieveNodeByNodeId(NodeTableCache.java:128)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableCache.getNodeForNodeId(NodeTableCache.java:82)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableWrapper.getNodeForNodeId(NodeTableWrapper.java:50)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableInline.getNodeForNodeId(NodeTableInline.java:67)
>     at org.apache.jena.tdb.lib.TupleLib.triple(TupleLib.java:107)
>     at org.apache.jena.tdb.lib.TupleLib.triple(TupleLib.java:84)
>     at 
> org.apache.jena.tdb.lib.TupleLib.lambda$convertToTriples$2(TupleLib.java:54)
>     at org.apache.jena.atlas.iterator.Iter$2.next(Iter.java:270)
>     at org.apache.jena.atlas.iterator.Iter$2.next(Iter.java:270)
>     at org.apache.jena.atlas.iterator.Iter.next(Iter.java:891)
>     at 
> org.apache.jena.riot.system.StreamOps.sendQuadsToStream(StreamOps.java:140)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write$(NQuadsWriter.java:62)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write(NQuadsWriter.java:45)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write(NQuadsWriter.java:91)
>     at org.apache.jena.riot.RDFWriter.write$(RDFWriter.java:208)
>     at org.apache.jena.riot.RDFWriter.output(RDFWriter.java:165)
>     at org.apache.jena.riot.RDFWriter.output(RDFWriter.java:112)
>     at 
> org.apache.jena.riot.RDFWriterBuilder.output(RDFWriterBuilder.java:149)
>     at org.apache.jena.riot.RDFDataMgr.write$(RDFDataMgr.java:1269)
>     at org.apache.jena.riot.RDFDataMgr.write(RDFDataMgr.java:1162)
>     at org.apache.jena.riot.RDFDataMgr.write(RDFDataMgr.java:1153)
>     at org.apache.jena.fuseki.mgt.Backup.backup(Backup.java:115)
>     at org.apache.jena.fuseki.mgt.Backup.backup(Backup.java:75)
>     at 
> org.apache.jena.fuseki.mgt.ActionBackup$BackupTask.run(ActionBackup.java:58)
>     at 
> org.apache.jena.fuseki.async.AsyncPool.lambda$submit$0(AsyncPool.java:55)
>     at org.apache.jena.fuseki.async.AsyncTask.call(AsyncTask.java:100)
>     at 

[jira] [Commented] (JENA-1553) Can't Backup data - java.io.IOException: Illegal UTF-8: 0xFFFFFFB1

2018-11-06 Thread Andy Seaborne (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16677241#comment-16677241
 ] 

Andy Seaborne commented on JENA-1553:
-

I have just emailed you as much recovered data as I can get out of the file. 
There are some holes in the list of recovered nodes where I couldn't determine 
a resync point.

> Can't Backup data - java.io.IOException: Illegal UTF-8: 0xFFB1
> --
>
> Key: JENA-1553
> URL: https://issues.apache.org/jira/browse/JENA-1553
> Project: Apache Jena
>  Issue Type: Bug
>  Components: Jena
> Environment: Ubuntu 16.04 running Docker.  Running stain/jena-fuseki 
> from the official Docker Hub.
>Reporter: Brian Mullen
>Assignee: Andy Seaborne
>Priority: Major
> Fix For: Jena 3.9.0
>
>
> Attempting to backup through Fuseki, TDB 500M+ triples, breaking with error:  
>  
> {code:java}
> [2018-06-01 13:25:46] Log4jLoggerAdapter WARN  Exception in backup
> org.apache.jena.atlas.RuntimeIOException: java.io.IOException: Illegal UTF-8: 
> 0xFFB1
>     at org.apache.jena.atlas.io.IO.exception(IO.java:233)
>     at org.apache.jena.atlas.io.BlockUTF8.exception(BlockUTF8.java:275)
>     at 
> org.apache.jena.atlas.io.BlockUTF8.toCharsBuffer(BlockUTF8.java:150)
>     at org.apache.jena.atlas.io.BlockUTF8.toChars(BlockUTF8.java:73)
>     at org.apache.jena.atlas.io.BlockUTF8.toString(BlockUTF8.java:95)
>     at 
> org.apache.jena.tdb.store.nodetable.NodecSSE.decode(NodecSSE.java:101)
>     at org.apache.jena.tdb.lib.NodeLib.decode(NodeLib.java:105)
>     at org.apache.jena.tdb.lib.NodeLib.fetchDecode(NodeLib.java:81)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative.readNodeFromTable(NodeTableNative.java:186)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative._retrieveNodeByNodeId(NodeTableNative.java:111)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative.getNodeForNodeId(NodeTableNative.java:70)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableCache._retrieveNodeByNodeId(NodeTableCache.java:128)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableCache.getNodeForNodeId(NodeTableCache.java:82)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableWrapper.getNodeForNodeId(NodeTableWrapper.java:50)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableInline.getNodeForNodeId(NodeTableInline.java:67)
>     at org.apache.jena.tdb.lib.TupleLib.triple(TupleLib.java:107)
>     at org.apache.jena.tdb.lib.TupleLib.triple(TupleLib.java:84)
>     at 
> org.apache.jena.tdb.lib.TupleLib.lambda$convertToTriples$2(TupleLib.java:54)
>     at org.apache.jena.atlas.iterator.Iter$2.next(Iter.java:270)
>     at org.apache.jena.atlas.iterator.Iter$2.next(Iter.java:270)
>     at org.apache.jena.atlas.iterator.Iter.next(Iter.java:891)
>     at 
> org.apache.jena.riot.system.StreamOps.sendQuadsToStream(StreamOps.java:140)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write$(NQuadsWriter.java:62)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write(NQuadsWriter.java:45)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write(NQuadsWriter.java:91)
>     at org.apache.jena.riot.RDFWriter.write$(RDFWriter.java:208)
>     at org.apache.jena.riot.RDFWriter.output(RDFWriter.java:165)
>     at org.apache.jena.riot.RDFWriter.output(RDFWriter.java:112)
>     at 
> org.apache.jena.riot.RDFWriterBuilder.output(RDFWriterBuilder.java:149)
>     at org.apache.jena.riot.RDFDataMgr.write$(RDFDataMgr.java:1269)
>     at org.apache.jena.riot.RDFDataMgr.write(RDFDataMgr.java:1162)
>     at org.apache.jena.riot.RDFDataMgr.write(RDFDataMgr.java:1153)
>     at org.apache.jena.fuseki.mgt.Backup.backup(Backup.java:115)
>     at org.apache.jena.fuseki.mgt.Backup.backup(Backup.java:75)
>     at 
> org.apache.jena.fuseki.mgt.ActionBackup$BackupTask.run(ActionBackup.java:58)
>     at 
> org.apache.jena.fuseki.async.AsyncPool.lambda$submit$0(AsyncPool.java:55)
>     at org.apache.jena.fuseki.async.AsyncTask.call(AsyncTask.java:100)
>     at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>     at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>     at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>     at java.lang.Thread.run(Thread.java:745)
> Caused by: java.io.IOException: Illegal UTF-8: 0xFFB1
>     ... 40 more
> [2018-06-01 13:25:46] Log4jLoggerAdapter INFO  
> Backup(/fuseki/backups/PDE_PROD_2018-06-01_13-24-00):2{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (JENA-1621) Upgrade Lucene to 7.5.x

2018-11-06 Thread Code Ferret (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16677232#comment-16677232
 ] 

Code Ferret commented on JENA-1621:
---

It should be noted that the version of Lucene is 7.4.0

> Upgrade Lucene to 7.5.x
> ---
>
> Key: JENA-1621
> URL: https://issues.apache.org/jira/browse/JENA-1621
> Project: Apache Jena
>  Issue Type: Improvement
>  Components: Text
>Affects Versions: Jena 3.9.0
>Reporter: Andy Seaborne
>Assignee: Code Ferret
>Priority: Major
> Fix For: Jena 3.10.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (JENA-1621) Upgrade Lucene to 7.5.x

2018-11-06 Thread Code Ferret (JIRA)


 [ 
https://issues.apache.org/jira/browse/JENA-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Code Ferret resolved JENA-1621.
---
   Resolution: Fixed
 Assignee: Code Ferret
Fix Version/s: Jena 3.10.0

Resolved via PR #488

> Upgrade Lucene to 7.5.x
> ---
>
> Key: JENA-1621
> URL: https://issues.apache.org/jira/browse/JENA-1621
> Project: Apache Jena
>  Issue Type: Improvement
>  Components: Text
>Affects Versions: Jena 3.9.0
>Reporter: Andy Seaborne
>Assignee: Code Ferret
>Priority: Major
> Fix For: Jena 3.10.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (JENA-1621) Upgrade Lucene to 7.5.x

2018-11-06 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16677222#comment-16677222
 ] 

ASF GitHub Bot commented on JENA-1621:
--

Github user ajs6f commented on the issue:

https://github.com/apache/jena/pull/488
  
I've seen that happen when I've forgotten to rebase over master before 
merging, but I don't know if that's what's happening here...


> Upgrade Lucene to 7.5.x
> ---
>
> Key: JENA-1621
> URL: https://issues.apache.org/jira/browse/JENA-1621
> Project: Apache Jena
>  Issue Type: Improvement
>  Components: Text
>Affects Versions: Jena 3.9.0
>Reporter: Andy Seaborne
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] jena issue #488: Merged Lucene upgrade. This resolves JENA-1621

2018-11-06 Thread ajs6f
Github user ajs6f commented on the issue:

https://github.com/apache/jena/pull/488
  
I've seen that happen when I've forgotten to rebase over master before 
merging, but I don't know if that's what's happening here...


---


[GitHub] jena issue #488: Merged Lucene upgrade. This resolves JENA-1621

2018-11-06 Thread xristy
Github user xristy commented on the issue:

https://github.com/apache/jena/pull/488
  
I'm not sure how I ended up w/ _unmerged commits_. All of the changes 
appear in apache jena git repo and github jena git repo so all looks consistent 
to me other than the message.


---


[jira] [Commented] (JENA-1621) Upgrade Lucene to 7.5.x

2018-11-06 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16677200#comment-16677200
 ] 

ASF GitHub Bot commented on JENA-1621:
--

Github user asfgit closed the pull request at:

https://github.com/apache/jena/pull/488


> Upgrade Lucene to 7.5.x
> ---
>
> Key: JENA-1621
> URL: https://issues.apache.org/jira/browse/JENA-1621
> Project: Apache Jena
>  Issue Type: Improvement
>  Components: Text
>Affects Versions: Jena 3.9.0
>Reporter: Andy Seaborne
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] jena pull request #488: Merged Lucene upgrade. This resolves JENA-1621

2018-11-06 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/jena/pull/488


---


[jira] [Commented] (JENA-1621) Upgrade Lucene to 7.5.x

2018-11-06 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16677199#comment-16677199
 ] 

ASF subversion and git services commented on JENA-1621:
---

Commit 63525085be7927567a8d963586b15269416e1b90 in jena's branch 
refs/heads/master from [~code-ferret]
[ https://git-wip-us.apache.org/repos/asf?p=jena.git;h=6352508 ]

Merged Lucene upgrade. This resolves JENA-1621


> Upgrade Lucene to 7.5.x
> ---
>
> Key: JENA-1621
> URL: https://issues.apache.org/jira/browse/JENA-1621
> Project: Apache Jena
>  Issue Type: Improvement
>  Components: Text
>Affects Versions: Jena 3.9.0
>Reporter: Andy Seaborne
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (JENA-1553) Can't Backup data - java.io.IOException: Illegal UTF-8: 0xFFFFFFB1

2018-11-06 Thread Andy Seaborne (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16677100#comment-16677100
 ] 

Andy Seaborne commented on JENA-1553:
-

bq. Another related question is : how to change calls to Jena API to prevent 
data corruption in the future?

The problem is an internal bug (now fixed) to do with overlapping requests, not 
an API issue. As long as you are using transactions, it should have worked.  If 
you use transaction once, the system checks for transaction usage.

bq. What I understood is that post-mortem analysis of the data base is of no 
use. 

Correct.

bq. I have data, especially text nodes, that I want to recover.

It _may_ be possible to recover some text nodes - I will investigate.

> Can't Backup data - java.io.IOException: Illegal UTF-8: 0xFFB1
> --
>
> Key: JENA-1553
> URL: https://issues.apache.org/jira/browse/JENA-1553
> Project: Apache Jena
>  Issue Type: Bug
>  Components: Jena
> Environment: Ubuntu 16.04 running Docker.  Running stain/jena-fuseki 
> from the official Docker Hub.
>Reporter: Brian Mullen
>Assignee: Andy Seaborne
>Priority: Major
> Fix For: Jena 3.9.0
>
>
> Attempting to backup through Fuseki, TDB 500M+ triples, breaking with error:  
>  
> {code:java}
> [2018-06-01 13:25:46] Log4jLoggerAdapter WARN  Exception in backup
> org.apache.jena.atlas.RuntimeIOException: java.io.IOException: Illegal UTF-8: 
> 0xFFB1
>     at org.apache.jena.atlas.io.IO.exception(IO.java:233)
>     at org.apache.jena.atlas.io.BlockUTF8.exception(BlockUTF8.java:275)
>     at 
> org.apache.jena.atlas.io.BlockUTF8.toCharsBuffer(BlockUTF8.java:150)
>     at org.apache.jena.atlas.io.BlockUTF8.toChars(BlockUTF8.java:73)
>     at org.apache.jena.atlas.io.BlockUTF8.toString(BlockUTF8.java:95)
>     at 
> org.apache.jena.tdb.store.nodetable.NodecSSE.decode(NodecSSE.java:101)
>     at org.apache.jena.tdb.lib.NodeLib.decode(NodeLib.java:105)
>     at org.apache.jena.tdb.lib.NodeLib.fetchDecode(NodeLib.java:81)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative.readNodeFromTable(NodeTableNative.java:186)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative._retrieveNodeByNodeId(NodeTableNative.java:111)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableNative.getNodeForNodeId(NodeTableNative.java:70)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableCache._retrieveNodeByNodeId(NodeTableCache.java:128)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableCache.getNodeForNodeId(NodeTableCache.java:82)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableWrapper.getNodeForNodeId(NodeTableWrapper.java:50)
>     at 
> org.apache.jena.tdb.store.nodetable.NodeTableInline.getNodeForNodeId(NodeTableInline.java:67)
>     at org.apache.jena.tdb.lib.TupleLib.triple(TupleLib.java:107)
>     at org.apache.jena.tdb.lib.TupleLib.triple(TupleLib.java:84)
>     at 
> org.apache.jena.tdb.lib.TupleLib.lambda$convertToTriples$2(TupleLib.java:54)
>     at org.apache.jena.atlas.iterator.Iter$2.next(Iter.java:270)
>     at org.apache.jena.atlas.iterator.Iter$2.next(Iter.java:270)
>     at org.apache.jena.atlas.iterator.Iter.next(Iter.java:891)
>     at 
> org.apache.jena.riot.system.StreamOps.sendQuadsToStream(StreamOps.java:140)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write$(NQuadsWriter.java:62)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write(NQuadsWriter.java:45)
>     at 
> org.apache.jena.riot.writer.NQuadsWriter.write(NQuadsWriter.java:91)
>     at org.apache.jena.riot.RDFWriter.write$(RDFWriter.java:208)
>     at org.apache.jena.riot.RDFWriter.output(RDFWriter.java:165)
>     at org.apache.jena.riot.RDFWriter.output(RDFWriter.java:112)
>     at 
> org.apache.jena.riot.RDFWriterBuilder.output(RDFWriterBuilder.java:149)
>     at org.apache.jena.riot.RDFDataMgr.write$(RDFDataMgr.java:1269)
>     at org.apache.jena.riot.RDFDataMgr.write(RDFDataMgr.java:1162)
>     at org.apache.jena.riot.RDFDataMgr.write(RDFDataMgr.java:1153)
>     at org.apache.jena.fuseki.mgt.Backup.backup(Backup.java:115)
>     at org.apache.jena.fuseki.mgt.Backup.backup(Backup.java:75)
>     at 
> org.apache.jena.fuseki.mgt.ActionBackup$BackupTask.run(ActionBackup.java:58)
>     at 
> org.apache.jena.fuseki.async.AsyncPool.lambda$submit$0(AsyncPool.java:55)
>     at org.apache.jena.fuseki.async.AsyncTask.call(AsyncTask.java:100)
>     at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>     at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>     at 
> 

[jira] [Commented] (JENA-1621) Upgrade Lucene to 7.5.x

2018-11-06 Thread Code Ferret (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16676995#comment-16676995
 ] 

Code Ferret commented on JENA-1621:
---

Thanks [~andy.seaborne]. I also added {{}} in 
jena/pom.xml to keep the two versions close to one another since they must be 
compatible.

> Upgrade Lucene to 7.5.x
> ---
>
> Key: JENA-1621
> URL: https://issues.apache.org/jira/browse/JENA-1621
> Project: Apache Jena
>  Issue Type: Improvement
>  Components: Text
>Affects Versions: Jena 3.9.0
>Reporter: Andy Seaborne
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (JENA-1621) Upgrade Lucene to 7.5.x

2018-11-06 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/JENA-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16676650#comment-16676650
 ] 

ASF GitHub Bot commented on JENA-1621:
--

Github user afs commented on a diff in the pull request:

https://github.com/apache/jena/pull/488#discussion_r231081048
  
--- Diff: jena-text-es/pom.xml ---
@@ -140,12 +140,13 @@
 com.github.alexcojocaru
 elasticsearch-maven-plugin
 
-5.2
+6.8
 
   elasticsearch
   9500
   9400
   ${es.startup.timeout}
+  6.4.2
--- End diff --

Can this be the property ver.elasticsearch?


> Upgrade Lucene to 7.5.x
> ---
>
> Key: JENA-1621
> URL: https://issues.apache.org/jira/browse/JENA-1621
> Project: Apache Jena
>  Issue Type: Improvement
>  Components: Text
>Affects Versions: Jena 3.9.0
>Reporter: Andy Seaborne
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] jena pull request #488: Merged Lucene upgrade. This resolves JENA-1621

2018-11-06 Thread afs
Github user afs commented on a diff in the pull request:

https://github.com/apache/jena/pull/488#discussion_r231081048
  
--- Diff: jena-text-es/pom.xml ---
@@ -140,12 +140,13 @@
 com.github.alexcojocaru
 elasticsearch-maven-plugin
 
-5.2
+6.8
 
   elasticsearch
   9500
   9400
   ${es.startup.timeout}
+  6.4.2
--- End diff --

Can this be the property ver.elasticsearch?


---