The system programming languages (C / C++ / Rust) will always have more overhead than FPGAs.
A microsecond is fast for C++. It’s like an eternity for an FPGA.