Replies: 3 comments 9 replies
-
The first milestone is be to come up with a working API which can:
Some questions in bulk regarding the communication semantics and what we want to support:
|
Beta Was this translation helpful? Give feedback.
-
A word of caution here. You need to implement tag matching, ordering, and type conversion for send/recv on the same process (on potentially different streams / threads) and reductions on all supported operators/types. It sounds like a low hanging fruit to provide empty operations and I understand the appeal but that doesn't meet the semantics we already have. IMO, having an MPI and NCCL backend will be sufficient. Either one of them will be available or can be made available easily. |
Beta Was this translation helpful? Give feedback.
-
Hi folks!
KokkosComm is still a very young project, and the main target for the immediate future is to get down a first meaningful implementation based on MPI to test, refine and validate the core ideas. However, as already hinted in other threads, MPI will probably not be the one and only communication model that we support in the long run, similarly to what KokkosCore does.
IMO the first alternative that we should consider implementing, is the dummy serial mode, which just does... well, nothing. It may seem useless at first sight, but being able to gracefully disable communications is very useful in practice, because the client code can run on a small laptop as well as on a massive supercomputer out of the box, without changing a single line of code. This would be a good first step towards a multi-backend architecture because implementing this is trivial, and it would let us focus on the architectural aspects of the question. I opened this thread to initiate a discussion on this topic of supporting different backends, gather ideas on how to approach the problem, what other backends would we like to support, the semantic implications etc.
Please feel free to suggest any idea that could help achieve this!
Best regards,
Alex
Beta Was this translation helpful? Give feedback.
All reactions