Support loading ONNX models directly #141

robertknight · 2024-05-04T04:41:35Z

ONNX models currently have to be converted into the FlatBuffers-based .rten format to use them.

The .rten format is intended to support efficient loading, and have a small code footprint (it serves a similar role to ORT). However the need to convert models is a barrier to using this library, and inconvenient in projects that want to trial or mix different runtimes for various reasons. It would reduce friction if .onnx models could be loaded directly.

The text was updated successfully, but these errors were encountered:

robertknight · 2024-05-26T10:13:39Z

Had a look at Rust protocol buffers runtimes:

Prost has the widest adoption, but it has downsides:
- It allocates Vec<T>s for each repeated field, with an exception for bytes (Enable zero-copy support for bytes::Bytes fields tokio-rs/prost#449)
- It requires protoc installed at build time if you follow their recommended steps to compile protos as part of a build.rs
quick-protobuf has less adoption and hasn't been released in a while, but seems much more aligned with this project's priorities:
- Minimizes allocations when deserializing
- Does not require protoc installed

robertknight added the usability label May 6, 2024

robertknight mentioned this issue Jun 1, 2024

Support for models larger than 2GB #225

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support loading ONNX models directly #141

Support loading ONNX models directly #141

robertknight commented May 4, 2024 •

edited

Loading

robertknight commented May 26, 2024

Support loading ONNX models directly #141

Support loading ONNX models directly #141

Comments

robertknight commented May 4, 2024 • edited Loading

robertknight commented May 26, 2024

robertknight commented May 4, 2024 •

edited

Loading