avro support
Project: http://git-wip-us.apache.org/repos/asf/drill/repo Commit: http://git-wip-us.apache.org/repos/asf/drill/commit/313dbd6d Tree: http://git-wip-us.apache.org/repos/asf/drill/tree/313dbd6d Diff: http://git-wip-us.apache.org/repos/asf/drill/diff/313dbd6d Branch: refs/heads/gh-pages Commit: 313dbd6d82a9b5f90544afae94d0a95bc0d5fa07 Parents: c240e01 Author: Kristine Hahn <[email protected]> Authored: Sun May 17 12:26:42 2015 -0700 Committer: Kristine Hahn <[email protected]> Committed: Sun May 17 12:26:42 2015 -0700 ---------------------------------------------------------------------- .../035-plugin-configuration-introduction.md | 4 +++- .../connect-a-data-source/080-drill-default-input-format.md | 8 +++++--- .../010-data-sources-and-file-formats-introduction.md | 5 ++++- _docs/getting-started/010-drill-introduction.md | 1 + .../005-querying-a-file-system-introduction.md | 1 + 5 files changed, 14 insertions(+), 5 deletions(-) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/drill/blob/313dbd6d/_docs/connect-a-data-source/035-plugin-configuration-introduction.md ---------------------------------------------------------------------- diff --git a/_docs/connect-a-data-source/035-plugin-configuration-introduction.md b/_docs/connect-a-data-source/035-plugin-configuration-introduction.md index 332966c..0cfc206 100644 --- a/_docs/connect-a-data-source/035-plugin-configuration-introduction.md +++ b/_docs/connect-a-data-source/035-plugin-configuration-introduction.md @@ -78,7 +78,7 @@ The following table describes the attributes you configure for storage plugins i </tr> <tr> <td>"formats"</td> - <td>"psv"<br>"csv"<br>"tsv"<br>"parquet"<br>"json"<br>"maprdb"</td> + <td>"psv"<br>"csv"<br>"tsv"<br>"parquet"<br>"json"<br>"avro"<br>"maprdb"*</td> <td>yes</td> <td>One or more file formats of data Drill can read. Drill can implicitly detect some file formats based on the file extension or the first few bits of data within the file, but you need to configure an option for others.</td> </tr> @@ -102,6 +102,8 @@ The following table describes the attributes you configure for storage plugins i </tr> </table> +\* Only appears when you install Drill on a cluster using the mapr-drill package. + The configuration of other attributes, such as `size.calculator.enabled` in the hbase plugin and `configProps` in the hive plugin, are implementation-dependent and beyond the scope of this document. Although Drill can work with different file types in the same directory, restricting a Drill workspace to one file type prevents confusion. http://git-wip-us.apache.org/repos/asf/drill/blob/313dbd6d/_docs/connect-a-data-source/080-drill-default-input-format.md ---------------------------------------------------------------------- diff --git a/_docs/connect-a-data-source/080-drill-default-input-format.md b/_docs/connect-a-data-source/080-drill-default-input-format.md index 25a065b..7c41e91 100644 --- a/_docs/connect-a-data-source/080-drill-default-input-format.md +++ b/_docs/connect-a-data-source/080-drill-default-input-format.md @@ -21,11 +21,13 @@ default input format, and Drill cannot detect the file format, the query fails. You can define a default input format for any of the file types that Drill supports. Currently, Drill supports the following types: - * CSV - * TSV - * PSV + * Avro + * CSV, TSV, or PSV * Parquet * JSON + * MapR-DB* + +\* Only available when you install Drill on a cluster using the mapr-drill package. ## Defining a Default Input Format http://git-wip-us.apache.org/repos/asf/drill/blob/313dbd6d/_docs/data-sources-and-file-formats/010-data-sources-and-file-formats-introduction.md ---------------------------------------------------------------------- diff --git a/_docs/data-sources-and-file-formats/010-data-sources-and-file-formats-introduction.md b/_docs/data-sources-and-file-formats/010-data-sources-and-file-formats-introduction.md index 8ec8cee..d758a50 100644 --- a/_docs/data-sources-and-file-formats/010-data-sources-and-file-formats-introduction.md +++ b/_docs/data-sources-and-file-formats/010-data-sources-and-file-formats-introduction.md @@ -11,11 +11,14 @@ Included in the data sources that Drill supports are these key data sources: Drill supports the following input formats for data: +* [Avro](http://avro.apache.org/docs/current/spec.html) * CSV (Comma-Separated-Values) * TSV (Tab-Separated-Values) * PSV (Pipe-Separated-Values) * Parquet -* JSON +* MapR-DB* + +\* Only available when you install Drill on a cluster using the mapr-drill package. You set the input format for data coming from data sources to Drill in the workspace portion of the [storage plugin]({{ site.baseurl }}/docs/storage-plugin-registration) definition. The default input format in Drill is Parquet. http://git-wip-us.apache.org/repos/asf/drill/blob/313dbd6d/_docs/getting-started/010-drill-introduction.md ---------------------------------------------------------------------- diff --git a/_docs/getting-started/010-drill-introduction.md b/_docs/getting-started/010-drill-introduction.md index 0586738..100d9da 100644 --- a/_docs/getting-started/010-drill-introduction.md +++ b/_docs/getting-started/010-drill-introduction.md @@ -16,6 +16,7 @@ Apache Drill 1.0 offers the following new features: * [Query audit logging]({{site.baseurl}}/docs/getting-query-information/) for getting the query history on a Drillbit. * Improved connection handling. * New Errors tab in the Query Profiles UI that facilitates troubleshooting and distributed storing of profiles. +* Support for new storage plugin format: [Avro](http://avro.apache.org/docs/current/spec.html) Key features of Apache Drill are: http://git-wip-us.apache.org/repos/asf/drill/blob/313dbd6d/_docs/query-data/query-a-file-system/005-querying-a-file-system-introduction.md ---------------------------------------------------------------------- diff --git a/_docs/query-data/query-a-file-system/005-querying-a-file-system-introduction.md b/_docs/query-data/query-a-file-system/005-querying-a-file-system-introduction.md index 1238eba..6d204ca 100644 --- a/_docs/query-data/query-a-file-system/005-querying-a-file-system-introduction.md +++ b/_docs/query-data/query-a-file-system/005-querying-a-file-system-introduction.md @@ -26,6 +26,7 @@ Drill supports the following file types: * Tab-separated values (TSV, type: text) * Pipe-separated values (PSV, type: text) * Structured data files: + * Avro(type: avro) * JSON (type: json) * Parquet (type: parquet)
