Skip to content
/ pyhive Public

easy to use interface for hive yielding pandas DataFrames

License

Notifications You must be signed in to change notification settings

wabu/pyhive

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

The pyhive package defines easy to use interface for hive yielding pandas DataFrames. It directly connects to a hiveserver2 using Thrift/aio-hs2.

Installation

You can use pip to install the package:

pip install git+https://github.com/wabu/pyhive.git

Or you download/clone the package with git and use the setup script:

./setup.py develop

Usage

from hive import Hive
hive = Hive('hiveserver')
hive.execute('use foobar')  # command without getting results
hive.fetch('show tables')   # get results from command as dataframe
# iterate over long result, getting chunks of data
for chunk in hive.iter('select * from baz limit 1000000'):
    print(chunk.tail())

About

easy to use interface for hive yielding pandas DataFrames

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages