Systolic Arrays (contd.)
- Practical realizations (e.g. iWARP) use quite general processors
- Enable variety of algorithms on same hardware
- But dedicated interconnect channels
- Data transfer directly from register to register across channel
- Specialized, and same problems as SIMD
- General purpose systems work well for same algorithms (locality etc.)
Example: Systolic array for 1-D convolution