-
Notifications
You must be signed in to change notification settings - Fork 14
Update README, licensing, build scripts #44
Conversation
0cd3c03
to
259a127
Compare
README.md
Outdated
@@ -1,21 +1,105 @@ | |||
# Heterogeneous Data Kernels | |||
# oneAPI Heterogeneous Data Kernels | |||
oneHDK is a low-level execution library for analytic data processing. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
data analytics processing? Swap the adjective.
README.md
Outdated
Cloning a project with submodules. Either use `git clone --recurse-submodules` to clone the repo, or clone as normal and then run: | ||
### Storage | ||
|
||
`ArrowStorage` is the default (and only available) HDK storage layer. `ArrowStorage` provides storage support for [Apache Arrow](https://github.com/apache/arrow) format data. The storage layer must be explicitly initialized: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
currently?
README.md
Outdated
HDK is used as a fast execution backend in [Modin](https://github.com/intel-ai/modin). The HDK library provides a set of components for federating analytic queries to an execution backend based on [OmniSciDB](https://github.com/intel-ai/omniscidb). Currently, HDK targets OLAP-style queries expressed as relational algebra or SQL. Major and immediate project priorities include: | ||
- Introducing a HDK-specific IR and set of optimizations to reduce reliance on RelAlg and improve extensibility of the query API. | ||
- Supporting heterogeneous device execution, where a query is split across a set of hardware devices (e.g. CPU and GPU) for best performance. We have developed an initial cost model for heterogeneous execution. | ||
- Improving performance of the CPU backend on Modin-specific queries and current-generation data science workstations and servers. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
by >2x
A low-level execution library for analytic data processing. | ||
HDK is used as a fast execution backend in [Modin](https://github.com/intel-ai/modin). The HDK library provides a set of components for federating analytic queries to an execution backend based on [OmniSciDB](https://github.com/intel-ai/omniscidb). Currently, HDK targets OLAP-style queries expressed as relational algebra or SQL. Major and immediate project priorities include: | ||
- Introducing a HDK-specific IR and set of optimizations to reduce reliance on RelAlg and improve extensibility of the query API. | ||
- Supporting heterogeneous device execution, where a query is split across a set of hardware devices (e.g. CPU and GPU) for best performance. We have developed an initial cost model for heterogeneous execution. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Supporting both Intel and Nvidia GPU
README.md
Outdated
|
||
A low-level execution library for analytic data processing. | ||
HDK is used as a fast execution backend in [Modin](https://github.com/intel-ai/modin). The HDK library provides a set of components for federating analytic queries to an execution backend based on [OmniSciDB](https://github.com/intel-ai/omniscidb). Currently, HDK targets OLAP-style queries expressed as relational algebra or SQL. Major and immediate project priorities include: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also can be used standalone through pyHDK
Comments addressed w/ latest push. |
No description provided.