diff options
author | Jiwei.Fu <jiwei.fu@nxp.com> | 2024-11-13 15:05:46 +0800 |
---|---|---|
committer | Jason Liu <jason.hui.liu@nxp.com> | 2024-11-19 16:15:19 +0800 |
commit | 91817fe5b657734f85eb23d366854e2d235d705b (patch) | |
tree | e96d7c7e6f163c578c3fbe4ec07471e126fde365 /lib/debugobjects.c | |
parent | da02bb4fef0bd7f249b43c41c18b78b77c16d3b3 (diff) |
AIR-11882: neutron: Enable neutron memory cache to improve npu benchmark performance
To improve performance, we have implemented zero-copy optimization for
neutron NPU, and third-party inference engines such as tflite can use
neutron memory directly, thus avoiding to perform memcpy between neutron
ddr memory and application context as follows:
- Avoid copying input data from application to neutron memory.
- Avoid copying output data from neutron memory back to the application.
This patch enables the memory cache and let the driver maintain the
memory and cache coherency.The main changes are:
- Flush the input buffer cache for device before starting inference.
- Invalidate the output buffer cache for cpu after inference is complete.
- Flush other constant data for device via IOCTL in preparation.
Signed-off-by: Jiwei.Fu <jiwei.fu@nxp.com>
Reviewed-by: Forrest Shi <xuelin.shi@nxp.com>
Acked-by: Jason Liu <jason.hui.liu@nxp.com>
Diffstat (limited to 'lib/debugobjects.c')
0 files changed, 0 insertions, 0 deletions