Github user takuti commented on a diff in the pull request: https://github.com/apache/incubator-hivemall/pull/158#discussion_r213890928 --- Diff: docs/gitbook/getting_started/tutorial.md --- @@ -0,0 +1,493 @@ +<!-- + Licensed to the Apache Software Foundation (ASF) under one + or more contributor license agreements. See the NOTICE file + distributed with this work for additional information + regarding copyright ownership. The ASF licenses this file + to you under the Apache License, Version 2.0 (the + "License"); you may not use this file except in compliance + with the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, + software distributed under the License is distributed on an + "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY + KIND, either express or implied. See the License for the + specific language governing permissions and limitations + under the License. +--> + +# Step-by-Step Tutorial on Supervised Learning with Apache Hivemall + +<!-- toc --> + +## What is Hivemall? + +[Apache Hive](https://hive.apache.org/) is a data warehousing solution that enables us to process large-scale data in the form of SQL easily. Assume that you have a table named `purchase_history` which can be artificially created as: + +```sql +create table if not exists purchase_history +(id bigint, day_of_week string, price int, category string, label int) --- End diff -- `gender string` is missing.
---