Skip to content

Releases: AI-Hypercomputer/JetStream

v0.2.2

31 May 18:44
ec26ec2
Compare
Choose a tag to compare

Key Changes

  • Enable observability in JetStream Server (prometheus metrics)
  • Enable JAX profiler support on single-host JetStream Server
  • Support both text and token ids I/O for JetStream Decode API
  • Add health check API
  • Support MLPerf evaluation
  • Enable JetStream Server E2E tests
  • Increase unit test coverage (>=96%)

What's Changed

New Contributors

Full Changelog: v0.2.1...v0.2.2

v0.2.1

03 May 21:23
dabded4
Compare
Choose a tag to compare

Key Changes

  • Support Llama3 tokenizer
  • JetStream Tokenizer refactor
  • Disaggregation preparation work

What's Changed

New Contributors

Full Changelog: v0.2.0...v0.2.1

v0.2.0

05 Apr 20:42
ae2ca8c
Compare
Choose a tag to compare

Major Changes

  • Support JetStream MaxText inference on Cloud TPU VM
  • Support JetStream Pytorch inference on Cloud TPU VM
  • Support Continuous Batching with interleaved mode in JetStream
  • Support online serving benchmarking

What's Changed

New Contributors

Full Changelog: https://github.com/google/JetStream/commits/v0.2.0