Skip to content

Version 24.09.0 (September 27, 2024)

Latest
Compare
Choose a tag to compare
@elliottslaughter elliottslaughter released this 26 Sep 16:57
  • Legion
    • Bug fixes for control replication and multi-node configurations
  • Regent
    • Fixes for ROCm 6.0 code generation
  • Tools
    • Legion Prof now uses subcommands (e.g., legion_prof view) to clarify which options apply to which actions
    • Legion Prof now tracks backtraces at the points where blocking wait calls are performed by the application
    • Legion Prof reports more detailed timing information for tasks
    • Legion Prof calculates clock skew between nodes and reports it when relevant
    • Commonly used features of Legion Prof are now enabled by default
    • The old Python Legion Prof implementation is no longer supported
  • Realm
    • Point fields x, y, z and w have been replaced by methods
    • Support for launching CUDA tasks onto a CUDA stream asynchronously via cuCtxRecordEvent without the need of CUDA hijack
    • Support for CUDA fabric sharing
    • Support for host-to-host copies via CUDA DMA
    • Support for querying number of NUMA nodes from the NumaModuleConfig
    • Added reference counting for preimage operations
    • Make std::atomic as the default atomic implementation
    • Remove REALM_CXX_STANDARD, and bump the minimal requirement to C++17
    • Implemented an ABI stable wrapper for GASNetEX
    • Additional unit tests including CircularQueue, ReplicatedHeap, find_fastest_path, DynaamicTableAllocator, generate_gather_paths, TransferIteratorIndexSpace
    • Dead code cleanups and bug fixes