Latency and bandwidth efficient communication through system customization for embedded multiprocessors