Relationship to llama.cpp #10

dokterbob · 2024-10-18T10:25:18Z

First of all: CONGRATS ON YOUR AMAZING RESEARCH WORK.

Considering that this is using GGML and seems based directly on llama.cpp:

Why is this a separate project to llama.cpp, given that llama.cpp already supports BitNet ternary quants? (ggml-org/llama.cpp#8151)

Are these simply more optimised kernels?
If so, how do they compare to llama's implementation?
Can/should they be contributed back to llama.cpp?

The text was updated successfully, but these errors were encountered:

sd983527 · 2024-10-20T02:35:04Z

Great question! Yes this is inherited from llama.cpp as noted in the Acknowledge section, we had pushed our model support code into llama.cpp via ggml-org/llama.cpp#7931, however there are some framework refinements in bitnet.cpp that have hard conflicts with original llama.cpp code, so that a new repo might be needed.
Comparing to llama.cpp, bitnet.cpp's inference result is exact the same as the "ground truth" one, we will add more explanations in the Readme file later, thanks.

eabase · 2024-10-20T16:27:22Z

@sd983527

..there are some framework refinements in bitnet.cpp that have hard conflicts...

Sorry, but this doesn't sound like a very credible reason. Especially given the MS history of taking over other (often FOSS) code and make it their own. It should be stated clearly up-front and not as a footnote, that this code is a fork from llama.cpp, and exactly why this fork was needed.

Still waiting for someone to address the questions from @dokterbob .

ZipingL · 2024-10-21T09:01:42Z

Can someone just do a pull request of what’s been done in here to llama.cpp? Thanks this is a better practice for me.

ExtReMLapin · 2024-10-21T09:50:50Z

Can someone just do a pull request of what’s been done in here to llama.cpp? Thanks this is a better practice for me.

Maybe try reading the contributor answer next time

ozbillwang · 2024-10-22T13:29:39Z

I share the same concerns.

After checking the submodule in this repository (I personally dislike using submodules in Git), I found that it relies on an outdated fork of the original llama.cpp project. it is 320 commits behind

https://github.com/Eddie-Wang1120/llama.cpp.git

https://github.com/microsoft/BitNet/blob/main/.gitmodules#L3

[submodule "3rdparty/llama.cpp"]
	path = 3rdparty/llama.cpp
	url = https://github.com/Eddie-Wang1120/llama.cpp.git
	branch = merge-dev

Will Microsoft seriously support this project? This repository appears more like a personal project.

eabase · 2024-10-22T23:25:29Z

@ozbillwang

Thanks for investigating! 💯

Will Microsoft seriously support this project?
This repository appears more like a personal project.

Indeed very suspicions, and seem more like some kind of clickbait project. They racked up 10,000 stars in no time, and nearly no commits or useful feedback, since. I hate to sound negative, but I hate even more to get involved in these kind of unethical/corporate side hustles. In addition, I also hate sub-modules! :( Avoiding huge number of various external files may be the very reason why llama.cpp was so successful:

1 screen, 1 editor, 1 page, 1 tab and 1 file! 🥇

eugenehp · 2024-11-08T06:35:53Z

It would be great to see the upstream PR from their fork of llama.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relationship to llama.cpp #10

Relationship to llama.cpp #10

dokterbob commented Oct 18, 2024 •

edited

Loading

sd983527 commented Oct 20, 2024

eabase commented Oct 20, 2024

ZipingL commented Oct 21, 2024 •

edited

Loading

ExtReMLapin commented Oct 21, 2024

ozbillwang commented Oct 22, 2024

eabase commented Oct 22, 2024

eugenehp commented Nov 8, 2024

Relationship to llama.cpp #10

Relationship to llama.cpp #10

Comments

dokterbob commented Oct 18, 2024 • edited Loading

sd983527 commented Oct 20, 2024

eabase commented Oct 20, 2024

ZipingL commented Oct 21, 2024 • edited Loading

ExtReMLapin commented Oct 21, 2024

ozbillwang commented Oct 22, 2024

eabase commented Oct 22, 2024

eugenehp commented Nov 8, 2024

dokterbob commented Oct 18, 2024 •

edited

Loading

ZipingL commented Oct 21, 2024 •

edited

Loading