We came across this issue as well, Zeppelin csv export is using the data
URI scheme which is base64 encoding all the rows into a single string,
Chrome seems to crash with over a few thousand rows, but Firefox has been
able to handle over 100k for me. However, the Zeppelin notebook itself
becomes slow at that point. I would also like better support for the
ability to export a large set of rows, perhaps another tool is more
preferred?

On Tue, May 2, 2017 at 10:00 AM, Ruslan Dautkhanov <dautkha...@gmail.com>
wrote:

> Good idea to introduce in Zeppelin a way to download full datasets without
> actually visualizing them.
>
> Not sure if this helps, we taught our users to use %sh hadoop fs -getmerge
> /hadoop/path/dir/ /some/nfs/mount/
> for large files (they sometimes have to download datasets with millions of
> records).
> They run Zeppelin on edge nodes that have NFS mounts to a drop zone.
>
> ps. Hue has a limit too, by default 100k rows
> https://github.com/cloudera/hue/blob/release-3.12.0/
> desktop/conf.dist/hue.ini#L905
> Not sure how much it scales up.
>
>
>
> --
> Ruslan Dautkhanov
>
> On Tue, May 2, 2017 at 10:41 AM, Paul Brenner <pbren...@placeiq.com>
> wrote:
>
>> There are limits to how much data the download to csv button will
>> download (1.5MB? 3500 rows?) which limit zeppelin’s usefulness for our BI
>> teams. This limit comes up far before we run into issues with showing too
>> many rows of data in zeppelin.
>>
>> Unfortunately (fortunately?) Hue is the other tool the BI team has been
>> using and there they have no problem downloading much larger datasets to
>> csv. This is definitely not a requirement I’ve ever run into in the way I
>> use zeppelin since I would just use spark to write the data out. However,
>> the BI team is not allowed to run spark jobs (they use hive via jdbc) so
>> that download to csv button is pretty important to them.
>>
>> Would it be possible to significantly increase the limit? Even better
>> would it be possible to download more data than is shown? I assume this is
>> the type of thing I would need to open a ticket for, but I wanted to ask
>> here first.
>>
>> <http://www.placeiq.com/> <http://www.placeiq.com/>
>> <http://www.placeiq.com/> Paul Brenner <https://twitter.com/placeiq>
>> <https://twitter.com/placeiq> <https://twitter.com/placeiq>
>> <https://www.facebook.com/PlaceIQ> <https://www.facebook.com/PlaceIQ>
>> <https://www.linkedin.com/company/placeiq>
>> <https://www.linkedin.com/company/placeiq>
>> DATA SCIENTIST
>> *(217) 390-3033 <(217)%20390-3033> *
>>
>> <http://www.placeiq.com/2015/05/26/placeiq-named-winner-of-prestigious-2015-oracle-data-cloud-activate-award/>
>> <http://placeiq.com/2015/12/18/accuracy-vs-precision-in-location-data-mma-webinar/>
>> <http://placeiq.com/2015/12/18/accuracy-vs-precision-in-location-data-mma-webinar/>
>> <http://placeiq.com/2015/12/18/accuracy-vs-precision-in-location-data-mma-webinar/>
>> <http://placeiq.com/2015/12/18/accuracy-vs-precision-in-location-data-mma-webinar/>
>> <http://placeiq.com/2016/03/08/measuring-addressable-tv-campaigns-is-now-possible/>
>> <http://placeiq.com/2016/04/13/placeiq-joins-the-network-advertising-initiative-nai-as-100th-member/>
>> <http://placeiq.com/2016/04/13/placeiq-joins-the-network-advertising-initiative-nai-as-100th-member/>
>> <http://placeiq.com/2016/04/13/placeiq-joins-the-network-advertising-initiative-nai-as-100th-member/>
>> <http://placeiq.com/2016/04/13/placeiq-joins-the-network-advertising-initiative-nai-as-100th-member/>
>> <http://placeiq.com/2016/04/13/placeiq-joins-the-network-advertising-initiative-nai-as-100th-member/>
>> <http://pages.placeiq.com/Location-Data-Accuracy-Whitepaper-Download.html?utm_source=Signature&utm_medium=Email&utm_campaign=AccuracyWP>
>> <http://placeiq.com/2016/08/03/placeiq-bolsters-location-intelligence-platform-with-mastercard-insights/>
>> <http://placeiq.com/2016/10/26/the-making-of-a-location-data-industry-milestone/>[image:
>> PlaceIQ:Location Data Accuracy]
>> <http://placeiq.com/2016/12/07/placeiq-introduces-landmark-a-groundbreaking-offering-that-delivers-access-to-the-highest-quality-location-data-for-insights-that-fuel-limitless-business-decisions/>
>>
>
>

Reply via email to