Skip to content
This repository has been archived by the owner on Mar 1, 2023. It is now read-only.

Latest commit

 

History

History
executable file
·
10 lines (7 loc) · 498 Bytes

README.md

File metadata and controls

executable file
·
10 lines (7 loc) · 498 Bytes

Shark (Hive on Spark)

Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 100 times faster than Hive without modification to the existing data nor queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions.

Shark 0.2 requires Scala 2.9.2, Hive 0.9, and Spark 0.6.

For current documentation, see the Shark Project Wiki