Releases: devitocodes/devito
v4.2.3
Synopsis
- Performance optimizations in the symbolic layer and generated code for x86, GPU and MPI.
- Various minor correctness and performance bug fixes.
- Improvements to application developer API.
- Added new tutorial notebooks.
- Increased test coverage - particularly for MPI and GPU's.
Backwards compatibility breaks and deprecations
None
Changes
- Restrict pytest to < 6.0 @rhodrin (#1411)
- pip prod(deps): update distributed requirement from <2.20 to <2.22 @dependabot (#1402)
- MPI minor bug fixes @rhodrin (#1408)
- mac notebook fix @rhodrin (#1407)
- Run docker test @mloubout (#1398)
- docker: fix typos @mloubout (#1397)
- compiler: version checks avoid unreliable exception @dbowman-ion (#1386)
- Fix cross-loop blocking with imperfect nests @FabioLuporini (#1381)
- Cancel previous CI at new commit @mloubout (#1378)
- Heapification of temporaries @FabioLuporini (#1349)
- Fix missing halo exchange over invariants fields @FabioLuporini (#1359)
- tutorials: 00_index fix broken link @georgebisbas (#1356)
- CI: add autolog of PRs @mloubout (#1338)
🐛 Bug Fixes
- Fix issue #1298 @FabioLuporini (#1421)
- ir: Fix lowering of ConditionalDimension @FabioLuporini (#1419)
- mpi: Patch compute_loc_indices + test @FabioLuporini (#1416)
- Fix estimate_cost @FabioLuporini (#1405)
- Fix issue 1332 (fablup version) @FabioLuporini (#1389)
- checkpointing: set right size_ckp @ofmla (#1361)
Data
tweaks @rhodrin (#1369)- Fix min storage @FabioLuporini (#1372)
- Avoid dererencing dangling pointers on GC'd objects @tjb900 (#1316)
- Replace subdimension in expressions @mloubout (#1345)
- Indexify with non-symbolic numeric indices @mloubout (#1343)
- Fix numeric indices @mloubout (#1341)
API
- Buffer: remove unused from args to allow multi buffers @mloubout (#1420)
- operator: Expose more opt knobs to the user level @FabioLuporini (#1387)
- Function/Expr shift utility @mloubout (#1377)
- Prevent sympy cache blowup @mloubout (#1366)
- Symbolic speedup @mloubout (#1362)
Examples
- examples: fix issue #1393 @ofmla (#1400)
- tutorials: add missing ignore tag in dask tutorial @mloubout (#1403)
- Dask tutorial with operator created only once @ofmla (#945)
- tests: Fix args and test viscoacoustic @georgebisbas (#1365)
- Add norm asserts for ssa iso notebooks @jkwashbourne (#1373)
- Make dt a solve property to use latest model parameters @mloubout (#1371)
- examples: Drop unnecessary NBVALs @FabioLuporini (#1367)
- Refine CFL condition in seismic @mloubout (#1348)
- Freesurface with subdomain (iso-acoustic) @mloubout (#1344)
- Improve solve speed for complicated equations @mloubout (#1342)
- elastic-fix @mloubout (#1339)
Documentation
- tutos: Inception of performance modes notebook @georgebisbas (#1358)
- Fix issue 1185 @dabiged (#1360)
ConditionalDimension
s notebook @georgebisbas (#1269)- Add binder directory with requirements.txt including matplotlib @jkwashbourne (#1351)
MPI
- ci: Test all critical seismic examples with MPI @FabioLuporini (#1418)
- Single precision interpolation @mloubout (#1413)
- Update halo warning @rhodrin (#1370)
- Fix MPI benchmark json results @jaimesouza (#1295)
- Improve detection of redundant halo exchanges @FabioLuporini (#1352)
GPU
- Add MPI+GPU runs to CI @georgebisbas (#1251)
- Tweak CIRE for GPUs @FabioLuporini (#1409)
- Add docker publish @mloubout (#1388)
- Nvidia dockerfile @FabioLuporini (#1392)
- Tool get_gpu_info (using lshw, lspci) with test @jack-lascala (#1383)
Contributors
Many thanks to all the contributors to this release (last surname alphabetical order):
- George Bisbas (Imperial College London)
- David Bowman (ION)
- Tim Burgess (DUG)
- Jaime Freire de Souza
- Chris Dinneen
- Ken Hester (NVidia)
- Navjot Kukreja (Imperial College London)
- Giacomo La Scala
- Mathias Louboutin (Georgia Institute of Technology)
- Fabio Luporini (Devito Codes)
- Oscar Mojica (SENAI CIMATEC)
- Rhodri Nelson (Imperial College London)
- John Washbourne (Chevron)
Devito-v4.2.2
Compiler:
- Improve CIRE
- Conditionals improvement
- Improve aliases detection
Misc:
- Updated minimum SymPy version requirement
- Refreshed docs
- Various bug fixes.
- Added more tests for CI and performance regression.
Benchmark:
- Improved asv
- Added support for adjoint/jacobian/jacobian_adjoint
- Improved JIT support for AMD, added AOMP compiler
Example:
- New Skew Self Adjoint operator
- Homogenize seismic examples and Model.
Many thanks to all the contributors to this release (last surname alphabetical order):
George Bisbas (Imperial College London)
Gerard Gorman (Imperial College London)
Mathias Louboutin (Georgia Institute of Technology)
Fabio Luporini (Devito Codes)
Oscar Mojica (SENAI CIMATEC)
Rhodri Nelson (Imperial College London)
Peterson Nogueira (SENAI CIMATEC)
João Henrique Speglich (SENAI CIMATEC)
Lauê Rami Souza Costa de Jesus (SENAI CIMATEC)
John Washbourne (Chevron)
Devito-4.2.1
pip
- Removed direct dependencies to enable PyPI
Air-Speed Velocity (ASV)
- Added performance regression testing via ASV
CI/CD
- Automatically start and stop Azure VM's for testing.
Benchmarking
- Additional performance metrics
Documentation
- New examples for TTI adjoint and viscoacoustic.
- Updated project logo and documentation version.
- Various improvements to examples/notebooks.
Bugfixes
- Loop fusion.
- MPI/SubDomain.
Many thanks to all the contributors to this release (last surname alphabetical order):
George Bisbas (Imperial College London)
Gerard Gorman (Imperial College London)
Mathias Louboutin (Georgia Institute of Technology)
Fabio Luporini (Devito Codes)
Oscar Mojica (SENAI CIMATEC)
Rhodri Nelson (Imperial College London)
Peterson Nogueira (SENAI CIMATEC)
João Henrique Speglich (SENAI CIMATEC)
Lauê Rami Souza Costa de Jesus (SENAI CIMATEC)
John Washbourne (Chevron)
Devito-4.2
GPU support
- Added OpenACC 2.6 code generation support.
- Added domain decomposition support for multi-node-multi-GPU via MPI and OpenACC.
Testing, CI/CD
- Added testing with PGI compiler suite v19.10 on NVidia K80 GPU's.
- Enhanced CI testing for OpenMP 5 offloading onto GPU's.
- Enhanced CI testing for domain decomposition with MPI.
Performance optimizations
- Wide range of performance optimizations to the Devito compiler to speed-up compile time (~2-5x speedup in compilation time).
- Various low-level performance and memory optimizations (e.g. identifying and removing redundant computations).
DSL extensions
- tools.Evaluable -> finite_differences.Evaluable
- Dynamic FD shortcuts (i.e.
u.dx(fd_order=2)
). Supports FD order,x0
andside
.
YASK and OPS backends
- After a great deal of consideration, we have decided to drop support for YASK (https://01.org/yask) and OPS (https://github.com/OP-DSL/OPS) as Devito backends. While YASK and OPS have many desirable features, the software engineering overhead (person-effort) to maintain feature-complete support proved unsustainable. We would like to take this opportunity to thank Chuck Yount (YASK) and Istvan Reguly (OPS) for their collaboration on both software development and algorithm development, which has been of great benefit to Devito.
Tutorials and documentation
- Revised FAQ - https://github.com/devitocodes/devito/wiki/FAQ
- New Jupyter notebooks:
- GPU tutorial.
- PML boundary condition tutorial.
- Normal Moveout (NMO) tutorial.
- Revised viscoelastic example.
General updates
- Added Developer Certificate of Origin to Devito contribution guidelines.
- General improvements to documentation, examples and tutorials.
- Several minor bug fixes.
- Increased test coverage.
- Improvements to test code coverage reporting.
- Added PGI compiler support.
- Added MPI support for the SubDomainSet feature.
- Additional runtime warnings for inadequate problem specifications, e.g. stencil radius greater than rank-domain width.
- Deprecate DEVITO_DLE, replaced by DEVITO_OPT.
- Deprecate DEVITO_OPENMP, replaced by DEVITO_LANGUAGE.
Many thanks to all the contributors to this release (last surname alphabetical order):
- George Bisbas (Imperial College London)
- Tim Burgess (DownUnder Geosolutions)
- Edward Caunt (Imperial College London)
- Gerard Gorman (Imperial College London)
- Mathias Louboutin (Georgia Institute of Technology)
- Fabio Luporini (Devito Codes)
- Rhodri Nelson (Imperial College London)
- Peterson Nogueira (SENAI CIMATEC and UFBA)
- Joao Speglich (SENAI CIMATEC)
- John Washbourne (Chevron)
- Philipp Witte (Georgia Institute of Technology)
Special thanks and congratulations to Ken Hester (NVidia), winner of the Devito hackathon at Rice OGHPC 2020, who used the Devito JIT backdoor to develop the prototype OpenACC code which informed the implementation of OpenACC code generation support within this release.
Devito-4.1
GPUs
- Support for more Operators -- isotropic acoustic, TTI, elastic, viscoelastic, ...
- Support for reductions
- Fixed several bugs concerning code generation
DSL
- Fixed minor bugs related to derivatives and finite-differences
MPI
- Fixed several bugs arising in presence of SubDomains and/or SubDimensions
- Fixed code generation in full mode (compute-communication overlap), when the same symbol (e.g.
comm
) was appearing twice in the signature of elemental functions
Documentation
- Updated user API
- Additional tutorials in the form of Jupyter notebooks (SymPy, DSL, SubDomains)
Benchmarking
- A new mode exploiting the JIT_BACKDOOR mechanism was added
- DevitoBench package merged into Devito
CI/CD:
- Dropped Azure Pipelines and Travis; testing infrastructure moved to GitHub Actions
- Added GPU testing
- Added testing for osx
- Auto-deployment on Pypi upon release
Compiler:
- Improved architecture auto-detection
- Refactored DLE as a sequence of "standard" compilation passes
- Moved data dependence analysis to the Cluster level
- Fixed data dependence analysis in some corner cases arising with SubDomains and/or SubDimensions
- Moved interpolation to separate sub-package
- Integrated profiling for the compilation pipeline
Many thanks to all the contributors to this release (last surname alphabetical order):
- Tarek Allam (The Alan Turing Institute)
- George Bisbas (Imperial College London)
- Maelso Bruno (Federal University of Paraíba)
- Tim Burgess (DownUnder Geosolutions)
- Gerard Gorman (Imperial College London)
- Navjot Kukreja (Imperial College London)
- Mathias Louboutin (Georgia Institute of Technology)
- Fabio Luporini (Devito Codes)
- Rhodri Nelson (Imperial College London)
- BlockSprintZIf (Independent)
Devito-4.0
Tensor algebra support (#873):
VectorFunction
andVectorTimeFunction
- (2nd order)
TensorFunction
andTensorTimeFunction
- Full support for FD and related operations (derivatives, shortcuts, solve, ...)
- Differential operators such as
div
,grad
andcurl
FD extensions:
- Custom FD with user-supplied coefficients as
Function
(#964)
Extended and more rigorous support for staggered grids (#873):
- True half-grid staggering (
u(x + h_x/2)
) - Automatic evaluation at half-nodes (averaging only)
- Automatic staggered FD of any order
MPI:
- MPI data slicing fixes(#1008)
- SparseFunction fixes (gathering with correct coordinates) (#922)
- MPI parameters correctly passed to the
Operator
(#1016, #1019) - Gathering bug fix (#916)
- Domain decomposition improvements (#915, #904)
- Effective computation/communication overlap
- Fixed argument processing to acquire the correct MPI communicator
GPU:
- More progress on the OPS backend
- OpenMP-5.x offloading experimental support (#1004, #1012)
(note: compiler installation instructions available on the Devito wiki)
Performance:
Misc:
- More robust inheritance from
sympy
and support for latest version (#1005) Model
bug fixes and cleanup (#1000, #1015)Model
now initializes/pads all object via Operators (#911)CustomCompiler
now uses the system/user provided parameters correctly (#925)- Devito based Gaussian smoother (#931)
- ARM support (#940)
- Better caching infrastructure (#956) removing the majority of memory leaks
- Improved benchmarking infrastructure
Many thanks to all the contributors to this release (last surname alphabetical order):
- George Bisbas (Imperial College London)
- Maelso Bruno (Federal University of Paraíba)
- Lucas Cavalcante (Federal University of Rio Grande do Norte)
- Gabriel Sebastian von Conta (Universidade Federal do Rio Grande do Norte)
- Arthur Giannotta (University of São Paulo)
- Gerard Gorman (Imperial College London)
- Navjot Kukreja (Imperial College London)
- Mathias Louboutin (Georgia Institute of Technology)
- Fabio Luporini (Imperial College London)
- Vitor Mickus (Federal University of Rio Grande do Norte)
- Rhodri Nelson (Imperial College London)
- Vincenzo Pandolfo (G-Research)
- Elias Gabriel Amaral da Silva (Federal University of Rio Grande do Norte)
- Jaime Freire de Souza (University of São Paulo)
Devito-3.5
Release notes
- MPI support:
- Python-level: MPI-distributed NumPy arrays.
- C-level: code generation for sub-domains, staggered grids, operators with coupled PDEs.
- C-level: performance optimizations (e.g., computation-communication overlap).
- Lazy evaluation of derivatives.
- Revisited staggered grids API (now Dimension-based, previously mask-based).
- Re-engineered clustering (which means smarter loop fusion/fission).
- DSE: Improved aliases detection.
- DLE: OpenMP nested parallelism; hierarchical loop blocking.
- Auto-padding for Functions/TimeFunctions.
- Improved data dependency analysis.
- Smarter Operator auto-tuning.
- New tutorials: Operator application, MPI, new propagators, custom stencils, and more.
- Revisited benchmarking scripts.
- Revisited examples, new models and propagators (e.g., visco-elastic).
- Smarter continuous integration: now Travis sided by Azure Pipelines; dropped Jenkins.
- Misc bug fixes.
- Hundreds of tests added.
- More sophisticated platform auto-detection.
- Update in checkpointing (via PyRevolve) to significantly reduce memory consumption and overall runtimes. Not backward compatible.
- Experimental support for lossy compression of checkpoints through PyRevolve + ZFP.
Devito-3.4
Release notes
- Preliminary support for MPI (no changes to user code requested)
- Support for staggered grids
- Improved compilation technology
- Improved Operator autotuning
- More powerful DSL (e.g., take derivatives of entire expressions such as
(u+v).dx
) - More efficient pickling
- Misc bug fixes
- New modeling examples based on the elastic wave equation
- New examples describing aspects of the compilation technology
Devito-3.3
This release ships the following features, besides a large number of misc improvements and bug fixes
- Support for Docker-based installation
- Introduction of PrecomputedSparseFunction for SparseFunction with precomputed coefficients
- Smarter JIT engine, including autodetection of the backend-compiler version for ad-hoc optimisations
- Operator pickling
- Introduction of new IR level: the ScheduleTree level
- Smarter loop generation
- Better, more efficient data dependence analysis
- Caching of JIT-compiled code
- More accurate profiling reports
- Extended YASK support (e.g., boundary conditions)
DFT paper
dft-paper not parallel on default dimensions