By using high - performance FPGAs, the Project Brainwave team was able to
serve Deep Neural Networks (DNNs) as hardware microservices, which reduced latency by removing the need of processing of incoming requests by the CPU, and allowed very high throughput, because the FPGA could process requests as fast as the network could stream them.