HiveWarehouseConnector fetching only 1000 rows through hive.execute() #279

shesadri · 2019-12-03T06:35:14Z

we are using the spark 2.3.0 version with Hadoop3 to fetch the records using hive table. While using the hive connector library we are facing issue where it does only 1000 records fetch though we have more than millions records eligible for query we pass. Any possibility of overriding this value set for us to get more records.

justinleet · 2019-12-10T20:48:02Z

I saw this issue a bit ago. execute looks like it only runs through the driver (and increasing that limit can OOM) and should be used primarily for catalog operations. The solution was to use executeQuery instead.

guerinclement · 2020-07-31T13:19:52Z

I saw this issue a bit ago. execute looks like it only runs through the driver (and increasing that limit can OOM) and should be used primarily for catalog operations. The solution was to use executeQuery instead.

Yes, but the LIMIT instruction used in executeQuery() does not return the required number of lines (up to 260x from my tests).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HiveWarehouseConnector fetching only 1000 rows through hive.execute() #279

HiveWarehouseConnector fetching only 1000 rows through hive.execute() #279

shesadri commented Dec 3, 2019

justinleet commented Dec 10, 2019

guerinclement commented Jul 31, 2020

HiveWarehouseConnector fetching only 1000 rows through hive.execute() #279

HiveWarehouseConnector fetching only 1000 rows through hive.execute() #279

Comments

shesadri commented Dec 3, 2019

justinleet commented Dec 10, 2019

guerinclement commented Jul 31, 2020