Paul Rogers created DRILL-4740: ---------------------------------- Summary: Awkward wording in "Analyzing the Yelp Academic Dataset" Key: DRILL-4740 URL: https://issues.apache.org/jira/browse/DRILL-4740 Project: Apache Drill Issue Type: Improvement Components: Documentation Affects Versions: 1.6.0 Reporter: Paul Rogers Priority: Minor
Consider the topic paragraph for the Yelp sample data page: http://drill.apache.org/docs/analyzing-the-yelp-academic-dataset/ It could use a bit of TLC. For example: "Apache Drill is one of the fastest growing open source projects, with the community making rapid progress with monthly releases The key difference is Drill’s agility and flexibility." This is a non-sequiter. The speed and agility of the software does not drive the monthly releases. Can we reword it to say that Drill’s speed and agility makes it a popular project? And that many people work hard to make it better with monthly releases? Something like that... (Although, at present, releases have dropped to bi-monthly or quarterly...) And: "Along with meeting the table stakes for SQL-on-Hadoop, which is to achieve low latency performance at scale, …" Seems two problems. 1. What does it mean “meeting the table stakes”? Very unclear. 2. This is a run-on sentence that tries to say multiple thoughts in a single sentence and should be rewritten. Then, there is redundancy: "...Drill allows users to analyze the data without any ETL or up-front schema definitions. … Drill, has a “no schema” approach…" I’m sure this paragraph was written quickly early on, but it could certainly be improved a bit… -- This message was sent by Atlassian JIRA (v6.3.4#6332)