Question 1 12 pts
Performance. MLPerf Inference: Tiny benchmark latencies are given for two vendors. Determine which vendor has better performance and by how much.
| Benchmark use case | Dataset | Model | Vendor A latency | Vendor B latency |
|---|---|---|---|---|
| Keyword spotting | Google speech commands | DS-CNN | 5 ms | 6 ms |
| Visual wake words | Visual wake words dataset | MobileNetV1 0.25x | 42 ms | 56 ms |
| Image classification | CIFAR10 | ResNet-8 | 90 ms | 81 ms |
| Anomaly detection | ToyADMOS | Deep AutoEncoder | 165 ms | 150 ms |
Use performance as inverse latency. For the speedup of A over B:
speedup_A_over_B = latency_B / latency_A| Benchmark | A-over-B speedup |
|---|---|
| Keyword spotting | 6 / 5 = 1.20 |
| Visual wake words | 56 / 42 = 1.33 |
| Image classification | 81 / 90 = 0.90 |
| Anomaly detection | 150 / 165 = 0.91 |
Use the geometric mean:
GM = (1.20 * 1.33 * 0.90 * 0.91)^(1/4) ~= 1.07Answer: Vendor A achieves better performance by about 1.07x.
This applies the iron-law performance equation reframed as latency ratios, summarized by a geometric mean across heterogeneous benchmarks.