> On 14 Mar 2022, at 11:13, Christiaan Hofman <cmhof...@gmail.com> wrote: > > > >> On 14 Mar 2022, at 10:56, Christiaan Hofman <cmhof...@gmail.com >> <mailto:cmhof...@gmail.com>> wrote: >> >> >> >>> On 14 Mar 2022, at 10:50, Christiaan Hofman <cmhof...@gmail.com >>> <mailto:cmhof...@gmail.com>> wrote: >>> >>> >>> >>>> On 14 Mar 2022, at 04:49, Mark Roberts <mroberts1...@gmail.com >>>> <mailto:mroberts1...@gmail.com>> wrote: >>>> >>>> Is there some way to get more detailed information about skim notes, i.e., >>>> other than the code framework? >>>> >>>> I have tried the skimnotes command line tool (e.g., the 'get' and 'format' >>>> commands), but it seems to only output the basic information about notes, >>>> such as the note type, page number, and note text. >>>> >>>> Perhaps(?) there's another mode for the skimnotes tool, but I couldn't >>>> find it from reading the documentation. >>>> >>>> I'd like to get more complete data on each note, such as a timestamp, the >>>> coordinates of the boxes that are highlighted in the PDF file, the >>>> highlight color, and the text contained in each box. >>>> >>>> I assume(?) this data is in the notes file, but the skimnotes app ignores >>>> it for now. >>>> >>>> I'm wondering about this because if possible I'd like to make a script >>>> that gathers my notes for a PDF file, and tries to fix words that were >>>> broken by hyphenation in the original PDF. If I can get the highlight >>>> boxes in the notes file, and the text in each box, then it should be >>>> possible to check for a hyphen character at the end of each line, and then >>>> stitch together the words that were split across lines. >>>> >>>> Any suggestions? >>>> >>>> Thanks in advance, >>>> >>>> M. >>> >>> The skimnotes tool is not a tool that can interpret the data. It only >>> copies the data around to various locations that are supported (such as >>> between extended attributes, .skim files, or within a .pdfd bundle). There >>> is no tool to interpret he data. The Wiki has information about how the >>> data is formatted. You could try to build your own tool to unarchive the >>> data from that, but that would be quite a bit of work. >>> >>> Christiaan >>> >> >> >> I can also note that in the near future the skim notes will be saved in a >> plist format, which can be read by various tools and apps, including >> AppleScript. You can already have Skim do that by activating a hidden >> preference, see the Wiki for details. >> >> Christiaan >> > > > I just remembered that the skimnotes tool *can* convert to the plist format, > which you may be able to read, using the ’skimnotes format’ command.' > skimnotes format plist SKIM_FILE' can do that. The help for skimnotes does > not say so, but you can immediately also get the skim notes plist format from > the skimnotes tool as follows: > > skimnotes get plist PDF_FILE SKIM_FILE > > This will get you a plist file in SKIM_FILE. Perhaps for other tools to read > it you have to change the extension to .plist. You could also then pass it > through plutil to convert the binary plist to xml plist (plutil -convert xml1 > PLIST_FILE), which would even be human readable. You could combine that to > get the skimnotes in xml format as follows: > > skimnotes get plist PDF_FILE - | plutil -format xml1 -o PLIST_FILE - > > Christiaan >
Small correction, I messed up ‘-format’ arguments to the commands. It should be added in skimnotes, and in plutil it is -convert: skimnotes get -format plist PDF_FILE SKIM_FILE plutil -convert xml1 PLIST_FILE skimnotes get -format plist PDF_FILE - | plutil -convert xml1 -o PLIST_FILE - If you want to go to the reverse, and write the xml plist data as skim notes, you could do: plutil -convert binary1 -o - PLIST_FILE | skimnotes set PDF_FILE - Christiaan
_______________________________________________ Skim-app-users mailing list Skim-app-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/skim-app-users