If there are many instances of the task running on the accelerators and the amount of computation per work block is small, the ALF runtime can become overwhelmed with moving work blocks and associated data in and out of accelerator memory. In this case, multi-use work blocks can be used in conjunction with accelerator data partitioning to further improve performance for an ALF application.
For an example of how to use multi-use work blocks, refer to Implementation 2: Making use of multi-use work blocks together with task context or work block parameter/context buffers.