Initial import of OpenSHMEM support #4571

ct-clmsn · 2023-12-21T18:39:32Z

This PR provides llama.cpp support for OpenSHMEM as per #4570 . The implementation provides cmake scripts to detect OpenSHMEM and PMI, makefile support, updates to the README.md, and the openshmem support implementation in C. The implementation is modeled after the existing MPI support.

…ble name

…mem calls

…ontext struct

AutonomicPerfectionist · 2023-12-22T18:57:06Z

Be warned that the current MPI implementation is broken, see #3334 for a mostly working implementation. There is still an issue on that branch related to the KV cache operations. There are further, smaller bugs here and there that I've fixed in a different branch of mine, so if you have any problems with your implementation you should be able to look at my branches for possible solutions

ct-clmsn · 2023-12-22T20:10:17Z

@AutonomicPerfectionist cool, will review #3334 and make modifications as necessary. thank you for the heads up.

ct-clmsn · 2023-12-23T02:13:48Z

@AutonomicPerfectionist is your mpi example program a sound representative sample of the main program driver? (int main)

AutonomicPerfectionist · 2023-12-23T02:31:41Z

That was mainly a playground example I was using to verify my fixes worked, I plan on removing it as soon as I have time. The regular main example on that branch should work with MPI

ct-clmsn · 2023-12-23T15:57:02Z

@AutonomicPerfectionist other question, I read there's a bug in the MPI code related to the 0 rank and the other ranks. Something about an end of stream token, could you expand on that issue?

AutonomicPerfectionist · 2023-12-23T16:27:19Z

The original design didn't have a way to cleanly terminate the worker nodes when the head node exited. It's not really a bug to do with tokens, just that there was no way to signal that it's time to exit. The code on my PR doesn't fix that as it's out of scope for it, but I did fix it on a different branch of mine by using tags and MPI_Probe to determine the type of message being received

ct-clmsn · 2023-12-23T22:26:22Z

@AutonomicPerfectionist interesting, so the compute ranks (ranks != 0) are in a loop, and for the compute ranks to break from the loop, they need a "break message". Then they break the loop and call the MPI finalization function wrapper. Which of your branches has a fix? I'd like to look it over, much appreciated!

AutonomicPerfectionist · 2023-12-25T21:01:23Z

Yep, that's exactly right. The branch that has a fix is mpi-speculative, but be warned that this branch is extremely volatile and is not designed to be merged upstream. It's my research branch for my master's degree, and many of the design decisions were made in light of significant time constraints. There's also a significant number of changes on that branch, most of which are undocumented

ct-clmsn added 10 commits December 20, 2023 22:44

initial import

fcfe07f

added baseline makefile support; fixed several compilation warnings

9604114

updated README.md and Makefile

79be614

improved README.md

6aad7af

added comment

ea1331a

updated README.md, fixed small documentation issues; modified a varia…

0de3b02

…ble name

fixed small segmentation bug; switched to using type-sensitive opensh…

fa49c15

…mem calls

added explicit casting; fixed small memcpy issue

6d08bac

did some formatting

ecf9c79

fixed formatting

46bcbf3

ct-clmsn mentioned this pull request Dec 21, 2023

OpenSHMEM support #4570

Closed

ct-clmsn added 4 commits December 21, 2023 18:25

added correct use of shmem_free

3f2769b

reduced the number of shmem_calloc calls

c8d6770

cleaned up pointer arithmetic; rm'd a member variable of the oshmem c…

d05fcad

…ontext struct

cleaned up pointer arithmetic

eb0f775

added oshmem backend to llama.cpp

71f4c96

updated thread support

d3f1557

cebtenzzre mentioned this pull request Jan 23, 2024

Split workload to multiple computer for better performance nomic-ai/gpt4all#1869

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial import of OpenSHMEM support #4571

Initial import of OpenSHMEM support #4571

ct-clmsn commented Dec 21, 2023 •

edited

Loading

AutonomicPerfectionist commented Dec 22, 2023

ct-clmsn commented Dec 22, 2023

ct-clmsn commented Dec 23, 2023

AutonomicPerfectionist commented Dec 23, 2023

ct-clmsn commented Dec 23, 2023

AutonomicPerfectionist commented Dec 23, 2023

ct-clmsn commented Dec 23, 2023

AutonomicPerfectionist commented Dec 25, 2023

Initial import of OpenSHMEM support #4571

Are you sure you want to change the base?

Initial import of OpenSHMEM support #4571

Conversation

ct-clmsn commented Dec 21, 2023 • edited Loading

AutonomicPerfectionist commented Dec 22, 2023

ct-clmsn commented Dec 22, 2023

ct-clmsn commented Dec 23, 2023

AutonomicPerfectionist commented Dec 23, 2023

ct-clmsn commented Dec 23, 2023

AutonomicPerfectionist commented Dec 23, 2023

ct-clmsn commented Dec 23, 2023

AutonomicPerfectionist commented Dec 25, 2023

ct-clmsn commented Dec 21, 2023 •

edited

Loading