I'm a bit confused with the parsed output from an eml file. Below is the result of parsing an embedded image.
Questions:
- How is it useful to have "File Modified Date" be today's date? This
email was from 2010 so I'd think any dates would correspond with that.
- In the same vein, why print out the name of the .tmp file Tika extracted
the embedded image to?
- Why is there an "X-TIKA:embedded_resource_path" but no corresponding
"resourceName"? Is it because no name exists in the embedded context so
Tika generates one? If so, then why have a X-TIKA:embedded_resource_path
meta item at all?
{
"Component 1": "Y component: Quantization table 0, Sampling
factors 2 horiz/2 vert",
"Component 2": "Cb component: Quantization table 1, Sampling
factors 1 horiz/1 vert",
"Component 3": "Cr component: Quantization table 1, Sampling
factors 1 horiz/1 vert",
"Compression Type": "Baseline",
"Content-Type": "image/jpeg",
"Data Precision": "8 bits",
"File Modified Date": "Fri Jan 06 14:23:00 -05:00 2023",
"File Name": "apache-tika-2005517581185548340.tmp",
"File Size": "45006 bytes",
"Image Height": "125 pixels",
"Image Width": "894 pixels",
"Multipart-Boundary": "=_related 00719D7F852576AB_=",
"Multipart-Subtype": "related",
"Number of Components": "3",
"Number of Tables": "4 Huffman tables",
"Resolution Units": "none",
"Thumbnail Height Pixels": "0",
"Thumbnail Width Pixels": "0",
"Version": "1.1",
"X Resolution": "1 dot",
"X-TIKA:Parsed-By": [
"org.apache.tika.parser.DefaultParser",
"org.apache.tika.parser.image.JpegParser"
],
"X-TIKA:embedded_depth": "1",
"X-TIKA:embedded_resource_path": "/embedded-1",
"X-TIKA:parse_time_millis": "16",
"Y Resolution": "1 dot",
"embeddedResourceType": "ATTACHMENT",
"tiff:BitsPerSample": "8",
"tiff:ImageLength": "125",
"tiff:ImageWidth": "894"
}
smime.p7s
Description: S/MIME Cryptographic Signature
