I'm sure this is documented somewhere, and I apologize in advance for not being able to find it.
How do I determine the model or schema of the document passed to the search engine by a given job? For instance, I'm running a job that crawls a directory on my local file system and passes to to Elastic Search. Interrogating Elastic Search, I can determine that the document has three fields, "file", "type" and "uri", all strings. How would I have known that in advance? Thanks for any help.
