Got it working! Key nugget to know, you have to have the CFITSIO library installed before building GDAL.
CFITSIO library info: https://heasarc.gsfc.nasa.gov/docs/software/fitsio/fitsio.html Download GDAL from here: http://download.osgeo.org/gdal/CURRENT/ gunzip tar xvf ./configure --with-cfitsio make make install Now it works like a champ! On 2018/07/03 14:01:29, Susan Borda <[email protected]> wrote: > Hi- > Here is more information: > This is what I'm getting from Tika from *.fits file: > Content-Length: 40968000 Content-Type: application/fits X-Parsed-By: > org.apache.tika.parser.DefaultParser X-Parsed-By: > org.apache.tika.parser.gdal.GDALParser X-TIKA:digest:MD5: > cce03f62a68c09ec562f9e8e05b54b40 X-TIKA:digest:SHA256: > b3f0c61409cbd7f2c9aeb8bdfa0798d529383db699c1055b8a12a68267b948dd > resourceName: mirc0000.fits > > I'm hoping to see something more like this (from the header extracted with > astropy.py): > SIMPLE = T / file does conform to FITS standard BITPIX = 16 / number of > bits per data pixel NAXIS = 3 / number of data axes NAXIS1 = 1280 / length > of data axis 1 NAXIS2 = 16 / length of data axis 2 NAXIS3 = 1000 / length > of data axis 3 EXTEND = T / FITS dataset may contain extensions COMMENT > FITS (Flexible Image Transport System) format is defined in > 'AstronomyCOMMENT and Astrophysics', volume 376, page 359; bibcode: > 2001A&A...376..359H BZERO = 32768 / offset data range to that of unsigned > short BSCALE = 1 / default scaling factor DATE = '2006-09-01T04:01:02' / > File creation date (YYYY-MM-DDThh:mm:ss UTC) TELESCOP= 'CHARA array 330m > max baseline, 6dishes' / Telescope > > Thanks, > susan > > On 2018/06/26 15:28:24, Susan Borda <[email protected]> wrote: > > Hi- > > I'm working with NetCDF and FITS files and I have Tika working for > > extracting the header text in NetCDF files but I can only get basic file > > metadata for FITS files. Does header text extraction not work on FITS > > files? > > > > Followed this for FITS: > > https://wiki.apache.org/tika/TikaGDAL > > And am only seeing the basic file metadata not the actual text from the > > header. > > > > This is what I'm using for NetCDF files (also used tika --gui to see the > > header text): > > curl -X -PUT --data-binary @age4_timeseries.nc http://localhost:9998/tika > > --header "Content-type: text/-t" > > curl -T age4_timeseries.nc http://localhost:9998/tika --header "Accept: > > text/plain" > > > > I've looked through the Tika Jira and found a reference from 2012: > > https://issues.apache.org/jira/browse/TIKA-874 > > > > Any advice would be appreciated. > > > > Thanks, > > susan > > >
