[MLA-1762] reduce memory allocations from DiscreteActionOutputApplier #4922

chriselion · 2021-02-08T19:29:07Z

Proposed change(s)

There were several intermediate tensors and float arrays (1d and 2d) allocated on each call. This reuses a single float[] for the CDF, and eliminates the other intermediate allocations. Now for each action, we compute the CDF and sample the result directly to the output ActionBuffers.DiscreteActions.

Before:

After:

Types of change(s)

Optimization
Code refactor

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)

Other comments

chriselion · 2021-02-08T19:38:59Z

com.unity.ml-agents/Runtime/Inference/Utils/Multinomial.cs

-        public int Sample(float[] cmf)
+        /// <param name="branchSize">The number of possible branches, i.e. the effective size of the cmf array.</param>
+        /// <returns>A sampled index from the CMF ranging from 0 to branchSize-1.</returns>
+        public int Sample(float[] cmf, int branchSize)


Because the float[] might be larger than we need now, we also pass the effective size of the array.

We could instead (or in additionally) repeat the final sumProb value in ComputeCdf()

…ction-GC

vincentpierre · 2021-02-08T22:10:01Z

com.unity.ml-agents/Runtime/Inference/ApplierImpl.cs

        readonly ActionSpec m_ActionSpec;
+        readonly int[] m_StartActionIndices;
+        readonly float[] m_cdfBuffer;


m_CdfBuffer ?

vincentpierre · 2021-02-08T22:13:03Z

com.unity.ml-agents/Runtime/Inference/ApplierImpl.cs

-
-            if (src.data.batch != dst.data.batch)
-            {
-                throw new ArgumentException("Batch size for input and output data is different!");


Are these exceptions no longer needed ?

Not sure they were ever necessary in the first place. The temporary tensors they were referencing are no longer needed.

Chris Elion added 4 commits February 5, 2021 18:31

remove cdf and output tensor allocations

757b05d

remove temporary tensors

99e8883

remove all allocations in DiscreteActionOutputApplier.Apply()

42d2f23

cleaup and changelog

255bfa7

chriselion commented Feb 8, 2021

View reviewed changes

chriselion requested review from vincentpierre and surfnerd February 8, 2021 19:40

Merge remote-tracking branch 'origin/master' into MLA-1762-discrete-a…

f2bd60c

…ction-GC

vincentpierre approved these changes Feb 8, 2021

View reviewed changes

m_CdfBuffer

f23c0b3

surfnerd approved these changes Feb 8, 2021

View reviewed changes

chriselion merged commit 41bbd45 into master Feb 9, 2021

delete-merged-branch bot deleted the MLA-1762-discrete-action-GC branch February 9, 2021 00:52

github-actions bot locked as resolved and limited conversation to collaborators Feb 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MLA-1762] reduce memory allocations from DiscreteActionOutputApplier #4922

[MLA-1762] reduce memory allocations from DiscreteActionOutputApplier #4922

chriselion commented Feb 8, 2021 •

edited

Loading

chriselion Feb 8, 2021

vincentpierre Feb 8, 2021

chriselion Feb 8, 2021

vincentpierre Feb 8, 2021

chriselion Feb 8, 2021

[MLA-1762] reduce memory allocations from DiscreteActionOutputApplier #4922

[MLA-1762] reduce memory allocations from DiscreteActionOutputApplier #4922

Conversation

chriselion commented Feb 8, 2021 • edited Loading

Proposed change(s)

Types of change(s)

Checklist

Other comments

chriselion Feb 8, 2021

Choose a reason for hiding this comment

vincentpierre Feb 8, 2021

Choose a reason for hiding this comment

chriselion Feb 8, 2021

Choose a reason for hiding this comment

vincentpierre Feb 8, 2021

Choose a reason for hiding this comment

chriselion Feb 8, 2021

Choose a reason for hiding this comment

chriselion commented Feb 8, 2021 •

edited

Loading