Announcement_gmorph
Our paper GMorph: Accelerating Multi-DNN Inference via Model Fusion accepted at Eurosys’24. The paper proposes “model fusion”, a new approach to fuse multiple task-specific, pre-trained, and heterogeneous DNNs into a single multi-task model to reduce inference latency.