We can remove the serialization overhead of requests in a few different cases
- When the submitter is the coordinator
- When the consumer is the coordinator
- When the consumer is the same as the submitter
With these changes it should be a lot closer in performance to a normal ExecutorService when used on a single node cluster.