PECOS v1.0.0
Highlights
- Enable C++ interface of HNSW with 4bit PQ for faster inference of ANN
- Release AWS Batch CDK to automate AWS infra construction for distributed training
- Accelerate PECOS Hierarchical Clustering
- Memory-mapped Utility of XR-Linear and HNSW for faster loading large models
Enhancements
C++ Interface of HNSW with 4bit Product Quantization
- Add HNSW-QG AVX512f version by @Patrick-H-Chen in #175
- Add HNSWPQ4Bits example by @Patrick-H-Chen in #176
CDK for Automated Distributed XR-Linear Training
- AWS multi-node Batch CDK for distributed PECOS by @weiliw-amz in #178
- Fix AWS multi-node batch cdk docker file bug by @weiliw-amz in #197
- AWS CDK build image from local PECOS source by @weiliw-amz in #198
- Add make clean in CDK Dockerfile by @weiliw-amz in #201
- Add JSON API for distributed XR-Linear training by @jiong-zhang in #208
Accelerated Hierarchical Clustering
- Accelerate clustering with sparse-dense vector and parallel sorting by @yaushian in #183
- Implement sampling in clustering.hpp by @yaushian in #187
- Python interface for sampling of hierarchical clustering by @yaushian in #190
- Switch between sdvec and dense vec by @yaushian in #203
- Python code for sampling of distributed clustering by @yaushian in #204
Memory-mapped Utility for XR-Linear and HNSW
- Refactor chunked matrix for accommodating memory-mapped utility module by @weiliw-amz in #192
- Add memory-mapped utilility module by @weiliw-amz in #189
- Memory-mapped XLinear Model by @weiliw-amz in #195
- Enabled C++ memory-map usage in PECOS-HNSW by @OctoberChang in #209
- Enable Python API of memory-map IO for PECOS-HNSW by @OctoberChang in #212
Bug Fixes
- Temporarily Pin setuptools<=60.0 in Wheel Build by @weiliw-amz in #173
- Remove PretrainedTransformer Vectorizer to avoid Pytest Error by @OctoberChang in #179
- Bump joblib from 1.0.1 to 1.2.0 in /examples/xr-transformer-neurips21 by @dependabot in #181
- Pin build-pypi-wheel runner image to ubuntu-20.04 by @weiliw-amz in #188
- Remove deprecated hierarchical_kmeans and related unittest by @weiliw-amz in #193
- Remove Ubuntu-18.04, Python3.6 and add Ubuntu-22.04, Python3.10 in Actions by @weiliw-amz in #194
- Remove sphinx_rtd_theme from requirements by @weiliw-amz in #200
- Upgrade codeql action to v2 by @weiliw-amz in #199
- Bump Python and Ubuntu version in README.md by @weiliw-amz in #213
Documentations
- Refactor PECOS KDD2022 tutorial instructions and materials by @hallogameboy, @jiong-zhang, and @OctoberChang in #164 #165 #167 #168 #170
- Update PECOS-related paper citations by @jiong-zhang and @OctoberChang in #169 #180
- Add the example folder of PINA by @jiong-zhang in #182
Full Changelog: v0.4.0...v1.0.0