Journal article
Planter: rapid prototyping of in-network machine learning inference
- Abstract:
- In-network machine learning inference provides high throughput and low latency. It is ideally located within the network, power efficient, and improves applications' performance. Despite its advantages, the bar to in-network machine learning research is high, requiring significant expertise in programmable data planes, in addition to knowledge of machine learning and the application area. Existing solutions are mostly one-time efforts, hard to reproduce, change, or port across platforms. In this paper, we present Planter: a modular and efficient open-source framework for rapid prototyping of in-network machine learning models across a range of platforms and pipeline architectures. By identifying general mapping methodologies for machine learning algorithms, Planter introduces new machine learning mappings and improves existing ones. It provides users with several example use cases and supports different datasets, and was already extended by users to new fields and applications. Our evaluation shows that Planter improves machine learning performance compared with previous model-tailored works, while significantly reducing resource consumption and co-existing with network functionality. Planter-supported algorithms run at line rate on unmodified commodity hardware, providing billions of inference decisions per second.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 2.6MB, Terms of use)
-
- Publisher copy:
- 10.1145/3687230.3687232
Authors
- Publisher:
- Association for Computing Machinery
- Journal:
- ACM SIGCOMM Computer Communication Review More from this journal
- Volume:
- 54
- Issue:
- 1
- Pages:
- 2 - 21
- Publication date:
- 2024-08-06
- Acceptance date:
- 2024-04-02
- DOI:
- ISSN:
-
0146-4833
- Language:
-
English
- Keywords:
- Pubs id:
-
1989056
- Local pid:
-
pubs:1989056
- Deposit date:
-
2024-04-10
- ARK identifier:
Terms of use
- Copyright holder:
- Zheng et al
- Copyright date:
- 2024
- Rights statement:
- © 2024 Copyright is held by the owner/author(s). This work was partly funded by VMware and the EU Horizon SMARTEDGE (101092908, UKRI 10056403). We acknowledge support from Intel and NVIDIA. For the purpose of Open Access, the author has applied a CC BY public copyright license to any Author Accepted Manuscript (AAM) version arising from this submission.
- Notes:
- This is the accepted manuscript version of the article. The final version is available from the publisher.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record