fpga-based host cpu offload for nma