How to use Rockchip 6TOPs NPU to accelerate the processing of network packets, do you have any good ideas?
SoC: Rockchip RK3588
CPU: Quad-core ARM Cortex-A76(up to 2.4GHz) and quad-core Cortex-A55 CPU (up to 1.8GHz)
GPU: Mali-G610 MP4, compatible with OpenGLES 1.1, 2.0, and 3.2, OpenCL up to 2.2 and Vulkan1.2
VPU: 8K@60fps H.265 and VP9 decoder, 8K@30fps H.264 decoder, 4K@60fps AV1 decoder, 8K@30fps H.264 and H.265 encoder
NPU: 6TOPs, supports INT4/INT8/INT16/FP16
That being said, you don't really need network accelerators for this class of CPU. Raspberry Pi 4 can process and route packets at gigabit speeds with its Cortex-A72 using only software. So the RK3588's Cortex-A76 should be up to the task by itself.
What part of packet processing you can offload to a dsp? Like linear determinant of packet checksums or gaussian blur of video data?
Even crypto co-processors usually add latency as data needs to be moved between (pci) buses using cpu and ram.