summaryrefslogtreecommitdiff
path: root/lib/debugobjects.c
diff options
context:
space:
mode:
authorJiwei.Fu <jiwei.fu@nxp.com>2024-11-13 15:05:46 +0800
committerJason Liu <jason.hui.liu@nxp.com>2024-11-19 16:15:19 +0800
commit91817fe5b657734f85eb23d366854e2d235d705b (patch)
treee96d7c7e6f163c578c3fbe4ec07471e126fde365 /lib/debugobjects.c
parentda02bb4fef0bd7f249b43c41c18b78b77c16d3b3 (diff)
AIR-11882: neutron: Enable neutron memory cache to improve npu benchmark performance
To improve performance, we have implemented zero-copy optimization for neutron NPU, and third-party inference engines such as tflite can use neutron memory directly, thus avoiding to perform memcpy between neutron ddr memory and application context as follows: - Avoid copying input data from application to neutron memory. - Avoid copying output data from neutron memory back to the application. This patch enables the memory cache and let the driver maintain the memory and cache coherency.The main changes are: - Flush the input buffer cache for device before starting inference. - Invalidate the output buffer cache for cpu after inference is complete. - Flush other constant data for device via IOCTL in preparation. Signed-off-by: Jiwei.Fu <jiwei.fu@nxp.com> Reviewed-by: Forrest Shi <xuelin.shi@nxp.com> Acked-by: Jason Liu <jason.hui.liu@nxp.com>
Diffstat (limited to 'lib/debugobjects.c')
0 files changed, 0 insertions, 0 deletions