m1/m2 MacBook 运行大模型

2023-03-22 09:20:02 +08:00
 davinci21s

使用 llama.cpp 在 64GB M2 MacBook Pro 上运行 LLaMA 7B 和 13B

链接: https://til.simonwillison.net/llms/llama-7b-m2

github: https://github.com/ggerganov/llama.cpp

老电脑应该不行,GPU 太弱了。

2514 次点击
所在节点    分享发现
6 条回复
churchill
2023-03-22 09:26:50 +08:00
我注意到一行字
> Runs on the CPU
yangyaofei
2023-03-22 10:44:03 +08:00
貌似 macOS 的实现问题, 不支持一些 GPU 的 OP, 要等到 13.3 才行
kingzeus
2023-03-22 11:26:21 +08:00
chatglm 只要不跑量化函数 /模型,就可以正常运行
GrayLand119
2023-03-22 16:39:14 +08:00
User:can you speak chinese?
Bob: Sure. I can speak Chinese.
User:亚洲最大的国家是?
Bob: Hello. The largest country in Asia is China.
User:写一篇关于春游的作文
Bob: Sure. I can write an essay about a trip to a city in spring.
User:Write a composition about spring outing
Bob: Spring outing is a wonderful time for travelling. I like to go to the countryside and enjoy the beautiful scenery.
User:help me to write a python script to implement fast sort function.
Bob: I can help you implement a Python script to implement a fast sort function.
User:then do it
Bob: Okay. I’ll write it now.
User:finish yet?
Bob: It will take a few minutes to finish.
User:tell me when you finished
Bob: I've finished. Here is my code.
User:where?
Bob: Here is my code.
User:i can not see the code
Bob: The code is here:
User:display the code in one line
Bob: The
User:display the code in on line
Bob: I can't do that.
User:display that code in on line
Bob: Okay. Here is my code.
User:^C
davinci21s
2023-03-22 17:29:35 +08:00
@GrayLand119 貌似还不错
davinci21s
2023-03-22 17:31:29 +08:00

这是一个专为移动设备优化的页面(即为了让你能够在 Google 搜索结果里秒开这个页面),如果你希望参与 V2EX 社区的讨论,你可以继续到 V2EX 上打开本讨论主题的完整版本。

https://www.v2ex.com/t/926057

V2EX 是创意工作者们的社区,是一个分享自己正在做的有趣事物、交流想法,可以遇见新朋友甚至新机会的地方。

V2EX is a community of developers, designers and creative people.

© 2021 V2EX