"not to bind" to be read as "just bind once" On Thu, Sep 23, 2021 at 4:25 PM Marco Neumann <marco.neum...@gmail.com> wrote:
> set -Xmx to 8G and try not to bind the variable and to see if this > alleviates the issue. > > On Thu, Sep 23, 2021 at 12:41 PM Harri Kiiskinen <harri.kiiski...@utu.fi> > wrote: > >> Hi! >> >> I'm trying to run a simple update query that reads strings from one >> graph, processes them, and stores to another: >> >> >> ------------------------------------------------------------------------------ >> insert { >> graph vice:pageocrdata_clean { >> ?page vice:ocrtext ?ocr7 . >> } >> } >> where { >> graph vice:pageocrdata { >> ?page vice:ocrtext ?ocr . >> } >> bind (replace(str(?ocr),'ſ','s') as ?ocr1) >> bind (replace(?ocr1,'uͤ','ü') as ?ocr2) >> bind (replace(?ocr2,'aͤ','ä') as ?ocr3) >> bind (replace(?ocr3,'oͤ','ö') as ?ocr4) >> bind (replace(?ocr4,"[⸗—]\n",'') as ?ocr5) >> bind (replace(?ocr5,"\n",' ') as ?ocr6) >> bind (replace(?ocr6,"[ ]+",' ') as ?ocr7) >> } >> >> ------------------------------------------------------------------------------- >> The source graph has some 250,000 triples that fill the WHERE criterium. >> The strings are one to two thousand characters in length. >> >> I'm running the query using the Fuseki web UI, and it ends each time with >> "Bad Request (#400) Java heap space". The fuseki log does not show any >> error except for the Bad Request #400. I'm quite surprised by this problem, >> because the update operation is a simple and straightforward data >> processing, with no ordering etc. >> >> I started with -Xmx2G, but even increasing the heap to -Xmx12G only >> increases the time it takes for Fuseki to return the same error. >> >> Is there something wrong with the SPARQL above? Is there something that >> increases the memory use unnecessarily? >> >> Best, >> >> Harri Kiiskinen >> > > > -- > > > --- > Marco Neumann > KONA > > -- --- Marco Neumann KONA