David Severski created DRILL-7676:
-------------------------------------

             Summary: Improve documentation for httpd format plugin
                 Key: DRILL-7676
                 URL: https://issues.apache.org/jira/browse/DRILL-7676
             Project: Apache Drill
          Issue Type: Improvement
          Components: Documentation
    Affects Versions: 1.17.0
            Reporter: David Severski


The httpd format plugin documentation 
([https://drill.apache.org/docs/httpd-format-plugin/)] speaks to the 
configuration required to setup the plugin, but has no information on the 
returned fields of the parser. As a casual drill user, after configuring the 
plugin, I was confused to receive over 100 fields in my first `SELECT * FROM` 
query, all of which had complex names and were opaque to understand.

 

Conversation on the drill slack led to the info that the underlying parser is 
[https://github.com/nielsbasjes/logparser] which eventually yields that the 
parser has a complicated name/type/dissector structure. The logparser project 
itself doesn't describe what the hierarchy looks like, which is unfortunate.

 

I'd really like a end-user understandable enhancement to the drill docs that 
try to walk users through what they will get back from this parser, touching on 
the various parts of the tree returned from this plugin. In particular, the 
inclusion of fields with `firstline` and `lastline` were of special confusion 
when I first encountered them



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to