Dependency-free safetensors loader/writer in C/C++

Secure, dependency-free safetensors loader/writer in portable C/C++. Code is tested with fuzzer.

Features

Endianness

Little-endian.

Requirements

C++11 and C11 compiler

Fuzz testing

See fuzz directory.

Usage

// define only in one *.cc
#define SAFETENSORS_CPP_IMPLEMENTATION
#include "safetensors.hh"

std::string warn, err;
bool ret = safetensors::load_from_file(filename, &st, &warn, &err);

if (warn.size()) {
  std::cout << "WARN: " << warn << "\n";
}

if (!ret) {
  std::cerr << "Failed to load: " << filename << "\n";
  std::cerr << "  ERR: " << err << "\n";

  return false;
}

// Check if data_offsets are valid.
if (!safetensors::validate_data_offsets(st, err)) {
  std::cerr << "Invalid data_offsets\n";
  std::cerr << err << "\n";

  return false;
}

for (size_t i = 0; i < st.tensors.size(); i++) {
  // do something with tensor
}

for (size_t i = 0; i < st.metadata.size(); i++) {
  // do something with __metadata__
}

Please see example.cc for more details.

Compile

Windows

> vcsetup.bat

Then open solution file in build folder.

Linux and macOS

Run makefile

$ make

or

$ ./bootstrap-cmake-linux.sh
$ cd build
$ make

C API

W.I.P.

C API will be provided in safetensors-c.h for other language bindings.

Limitation

JSON part(header) is up to 100MB.
ndim is up to 8.

TODO

Strict shape size check.
Remove internal::from_chars(parse number(floating point value) from string)
- We only need int number parser
mmap load.
Save safetensors.
Do more tests.
validate dict key is valid UTF-8 string.
CI build script.
C++ STL free?
- To load safetensor in GPU or Accelerator directly
- Use nanostl? https://github.com/lighttransport/nanostl

License

MIT license

Third-party licenses

minijson(included in safetensors.hh) : MIT license
- Grisu2(parse floating point value in minijson) : MIT license
- internal::from_chars : Apache 2.0 license.
llama.cpp(mmap feature) : MIT license.
MIOpen(bf16 conversion) : MIT license.
fp16 conversion: CC0 license. https://gist.github.com/rygorous/2156668

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
cmake		cmake
fuzzer		fuzzer
gen		gen
test		test
.clang-format		.clang-format
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
bootstrap-cmake-cross-llvm-mingw.sh		bootstrap-cmake-cross-llvm-mingw.sh
bootstrap-cmake-linux.sh		bootstrap-cmake-linux.sh
example-c.c		example-c.c
example.cc		example.cc
safetensors-c.cc		safetensors-c.cc
safetensors-c.h		safetensors-c.h
safetensors.cc		safetensors.cc
safetensors.hh		safetensors.hh
serialize-example.cc		serialize-example.cc
vcsetup.bat		vcsetup.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dependency-free safetensors loader/writer in C/C++

Features

Endianness

Requirements

Fuzz testing

Usage

Compile

Windows

Linux and macOS

C API

Limitation

TODO

License

Third-party licenses

About

Releases

Packages

Languages

License

syoyo/safetensors-cpp

Folders and files

Latest commit

History

Repository files navigation

Dependency-free safetensors loader/writer in C/C++

Features

Endianness

Requirements

Fuzz testing

Usage

Compile

Windows

Linux and macOS

C API

Limitation

TODO

License

Third-party licenses

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages