Understanding Performance Gains of Accelerator-rich Architectures