hive has a feature for data sampling where you actually don't read the entire table but sample of the table. I suppose these parameters belong to those queries.
more you can read at https://cwiki.apache.org/Hive/languagemanual-sampling.html On Fri, Jan 25, 2013 at 4:42 AM, Wu, James C. <james.c...@disney.com> wrote: > Hi,**** > > ** ** > > Do anyone know the meaning of these hive settings? The description of them > are not clear to me. If someone can give me an example of how they shall be > used, it would be great!**** > > ** ** > > <property>**** > > <name>hive.limit.row.max.size</name>**** > > <value>100000</value>**** > > <description>When trying a smaller subset of data for simple LIMIT, how > much size we need to guarantee**** > > each row to have at least.</description>**** > > </property>**** > > ** ** > > <property>**** > > <name>hive.limit.optimize.limit.file</name>**** > > <value>10</value>**** > > <description>When trying a smaller subset of data for simple LIMIT, > maximum number of files we can**** > > sample.</description>**** > > </property>**** > > ** ** > > Regards,**** > > ** ** > > James**** > > ** ** > > ** ** > -- Nitin Pawar