Sampling
Hive抽样
select * from my_table
limit 10000;select * from my_table
order by rand()
limit 10000;select * from my_table
sort by rand()
limit 10000;select * from my_table
distribute by rand()
sort by rand()
limit 10000;Reference
Last updated