-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rustify Snapshot Module #4523
Comments
Hi, can I work on this issue? |
Hello @beamandala 👋 Yes, feel free to work on this issue! Thank you so much for your interest in this issue! |
@zulinx86 I've refactored the snapshot module code based on the pattern provided and wanted to check in with you to make sure I'm heading in the right direction. Let me know if there's anything I need to change or if everything looks good. /// Firecracker snapshot header
#[derive(Debug, Serialize, Deserialize)]
struct SnapshotHdr {
/// magic value
magic: u64,
/// Snapshot data version
version: Version,
}
impl SnapshotHdr {
fn new(version: Version) -> Self {
Self {
magic: SNAPSHOT_MAGIC_ID,
version,
}
}
fn load<R: Read>(reader: &mut R) -> Result<Self, SnapshotError> {
let hdr: SnapshotHdr = deserialize(reader)?;
Ok(hdr)
}
fn store<W: Write>(&self, writer: &mut W) -> Result<(), SnapshotError> {
serialize(writer, self)
}
}
#[derive(Debug, Serialize, Deserialize)]
pub struct Snapshot<Data> {
// The snapshot version we can handle
version: Version,
data: Data,
}
/// Helper function to serialize an object to a writer
pub fn serialize<T, O>(writer: &mut T, data: &O) -> Result<(), SnapshotError>
where
T: Write,
O: Serialize + Debug,
{
bincode::serialize_into(writer, data).map_err(|err| SnapshotError::Serde(err.to_string()))
}
/// Helper function to deserialize an object from a reader
pub fn deserialize<T, O>(reader: &mut T) -> Result<O, SnapshotError>
where
T: Read,
O: DeserializeOwned + Debug,
{
// flags below are those used by default by bincode::deserialize_from, plus `with_limit`.
bincode::DefaultOptions::new()
.with_limit(VM_STATE_DESERIALIZE_LIMIT)
.with_fixint_encoding()
.allow_trailing_bytes() // need this because we deserialize header and snapshot from the same file, so after
// reading the header, there will be trailing bytes.
.deserialize_from(reader)
.map_err(|err| SnapshotError::Serde(err.to_string()))
}
impl<Data: for<'a> Deserialize<'a>> Snapshot<Data> {
pub fn load_unchecked<R: Read>(reader: &mut R) -> Result<Self, SnapshotError>
where
Data: DeserializeOwned + Debug,
{
let hdr: SnapshotHdr = deserialize(reader)?;
if hdr.magic != SNAPSHOT_MAGIC_ID {
return Err(SnapshotError::InvalidMagic(hdr.magic));
}
let data: Data = deserialize(reader)?;
Ok(Self {
version: hdr.version,
data,
})
}
pub fn load<R: Read>(reader: &mut R, snapshot_len: usize) -> Result<Self, SnapshotError>
where
Data: DeserializeOwned + Debug,
{
let mut crc_reader = CRC64Reader::new(reader);
// Fail-fast if the snapshot length is too small
let raw_snapshot_len = snapshot_len
.checked_sub(std::mem::size_of::<u64>())
.ok_or(SnapshotError::InvalidSnapshotSize)?;
// Read everything apart from the CRC.
let mut snapshot = vec![0u8; raw_snapshot_len];
crc_reader
.read_exact(&mut snapshot)
.map_err(|ref err| SnapshotError::Io(err.raw_os_error().unwrap_or(libc::EINVAL)))?;
// Since the reader updates the checksum as bytes ar being read from it, the order of these
// 2 statements is important, we first get the checksum computed on the read bytes
// then read the stored checksum.
let computed_checksum = crc_reader.checksum();
let stored_checksum: u64 = deserialize(&mut crc_reader)?;
if computed_checksum != stored_checksum {
return Err(SnapshotError::Crc64(computed_checksum));
}
let mut snapshot_slice: &[u8] = snapshot.as_mut_slice();
Snapshot::load_unchecked::<_>(&mut snapshot_slice)
}
}
impl<Data: Serialize + Debug> Snapshot<Data> {
pub fn save<W: Write>(&self, mut writer: &mut W) -> Result<(), SnapshotError> {
// Write magic value and snapshot version
serialize(&mut writer, &SnapshotHdr::new(self.version.clone()))?;
// Write data
serialize(&mut writer, &self.data)
}
pub fn save_with_crc<W: Write>(&self, writer: &mut W) -> Result<(), SnapshotError> {
let mut crc_writer = CRC64Writer::new(writer);
self.save(&mut crc_writer)?;
// Now write CRC value
let checksum = crc_writer.checksum();
serialize(&mut crc_writer, &checksum)
}
} |
I have been looking through our usage of snapshots and I think we can remove |
Hi @beamandala, I also think @ShadowCurse's comment about specializing the generics to |
@roypat The
The way we can keep it within Rust safety is to create a wrapper that can be created from struct FileBytes<'a> {
slice: &'a[u8],
}
impl<'a> FileBytes<'a> {
fn new(file: &'a File) -> Self {
let length = file.metadata().size();
// Safe as file fd and length are valid
let ptr = usafe { libc::mmap(0, length, ..., file.as_raw_fd(), 0 ) };
// Safe as we just created a mapping. This just converts it to
// more convenient Rust type
let slice = unsafe { std::slice::from_raw_parts(ptr, length) };
Self { slice }
}
}
// For convenient usage
impl<'a> Deref for FileBytes<'a> {
type Target = &[u8];
...
} |
Hey @zulinx86 @roypat @ShadowCurse, I opened up #4691 as a draft PR. Any feedback on it would be great. I apologize for the inactivity but I'll actively work on it now. Thanks! |
Hi @beamandala, |
Hi @beamandala, |
Sorry, I missed your previous comment. My two questions are whether the snapshot module I rewrote matches what you were expecting. I also refactored some of the existing snapshot code and wanted to make sure those changes looked good so I can continue doing it for the other places where snapshot is used. |
No worries :) I've left some comments on the PR. I think the only big comment is about losing the |
As part of the work done to remove support for versionize, we realized that we could improve the API of the snapshot module.
The pattern that would like to follow is as follows:
Checklist:
The text was updated successfully, but these errors were encountered: