bug with lucene version 3.0.1

2010-03-03 Thread thomas . boy
Hello developers, at first, i want to thank you for this project ! I like it to use this project in combination with lucene. But i think i've found a little bug in the pdf box version 1.0.0 The class org.apache.pdfbox.searchengine.lucene.IndexFiles have to be changed, because the Constructor of

Relation between COS and PD model

2010-03-03 Thread Johannes Koch
Hi, I wonder what the intended relation between the COS and the PD model is. Is the COS model the actual data model and the PD classes are only views on this data? Is the PD layer supposed to cache data (for performance issues)? If data is changed via methods in the PD layer, how are these

bug with lucene version 3.0.1

2010-03-03 Thread thomas . boy
Hello developers, please read my first mail xD there are some more changes necessary, i think but u will see Greetz Thomas Boy

Re: Relation between COS and PD model

2010-03-03 Thread Andreas Lehmkühler
Hi, Gesendet: Mi, 03. Mrz 2010 Von: Johannes Kochjohannes.k...@fit.fraunhofer.de Hi, I wonder what the intended relation between the COS and the PD model is. Is the COS model the actual data model and the PD classes are only views on this data? AFAIU yes Is the PD layer supposed to

Re: Re: Relation between COS and PD model

2010-03-03 Thread Andreas Lehmkühler
Hi, Gesendet: Mi, 03. Mrz 2010 Von: Johannes Kochjohannes.k...@fit.fraunhofer.de Hi Andreas, Andreas Lehmkühler schrieb: Gesendet: Mi, 03. Mrz 2010 Von: Johannes Kochjohannes.k...@fit.fraunhofer.de Is the PD layer supposed to cache data (for performance issues)? There may be some

Re: Re: Relation between COS and PD model

2010-03-03 Thread nisen
可以使用COSDocument 2010/3/3 Andreas Lehmkühler andr...@lehmi.de: Hi, Gesendet: Mi, 03. Mrz 2010 Von: Johannes Kochjohannes.k...@fit.fraunhofer.de Hi Andreas, Andreas Lehmkühler schrieb: Gesendet: Mi, 03. Mrz 2010 Von: Johannes Kochjohannes.k...@fit.fraunhofer.de Is the PD layer

Re: Re: Relation between COS and PD model

2010-03-03 Thread Jukka Zitting
Hi, 2010/3/3 Andreas Lehmkühler andr...@lehmi.de: Von: Johannes Kochjohannes.k...@fit.fraunhofer.de How will caching PD objects synchronize their cached PD objects with underlying COS data changed by other PD objects? I don't remember a concrete example, but I'm sure that there are a few. But

Re: Re: Relation between COS and PD model

2010-03-03 Thread nisen
Oh,my God.I use in Chinese.Sorry. I think you can use cos.COSDocument for cache。 cache in memory: new COSDocument(new pdfbox.io.RandomAccessBuffer()) if you use PD,you can use PDDocument.load( new RandomAccessBuffer()) default is filesystem, at you temp dir or you set dir. called scratchFile

[jira] Commented: (PDFBOX-595) extracted text contains character names instead of the characters themselves

2010-03-03 Thread JIRA
[ https://issues.apache.org/jira/browse/PDFBOX-595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12840824#action_12840824 ] Andreas Lehmkühler commented on PDFBOX-595: --- After applying PDFBOX-592 it works

Re: [gsoc2010] PDF GUI discuss frame

2010-03-03 Thread nisen
vote result: * PackageNameChoices: org.apache.pdflens (3) nisen,Mel,Adam * ProjectNameChoices :PDFLens(3) ,nisen,Mel,Adam * PluginSystemChoices: no comment * GUI_API_Choices : Pivot (3) , nisen,Mel,Adam we will use like this. 2010/2/27 Martinez, Mel - 1004 - MITLL

Re: Reopen PDFBOX-483?

2010-03-03 Thread steve poling
Hello again, I got distracted by other work from this issue and I've returned to it today. Here are the experiments I've performed: 1) view PDF with PdfReader (it renders correctly) 2) print PDF to HP LaserJet 4P (it renders with many lines and text omitted) 3) comment-out W/W* in

[jira] Commented: (PDFBOX-595) extracted text contains character names instead of the characters themselves

2010-03-03 Thread Godmar Back (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12841028#action_12841028 ] Godmar Back commented on PDFBOX-595: My report was against 0.8.0-incubator. If the

[jira] Commented: (PDFBOX-595) extracted text contains character names instead of the characters themselves

2010-03-03 Thread JIRA
[ https://issues.apache.org/jira/browse/PDFBOX-595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12841087#action_12841087 ] Andreas Lehmkühler commented on PDFBOX-595: --- This issue probably depends on the