JerningChan's recent timeline updates

JerningChan

V2EX member #60529, joined on 2014-04-17 00:40:31 +08:00

Today's activity rank 6088

JerningChan 提问技术话题好玩工作信息交易信息城市相关

Per JerningChan's settings, the topics list is hidden

Deals info, including closed deals, is not hidden

JerningChan's recent replies

7h 9m ago

Replied to a topic by ccai › Claude › 平时开发用 Claude Opus 4.6，有必要换 Opus 4.7 吗？

目前来看，要用就用 4.6 ，不要去碰 4.7 ，恶心死你

19h 37m ago

Replied to a topic by zhwguest › OpenAI › 续 claude 降智严重后， codex 也在步其后尘

@iorilu 你自己试试就知道了，问题是即便你想 27b 用的爽，不想折腾，单卡跑，成本是多少？随便起码一张 4090 吧，反正我自己用 2080ti 22g 跑 27b 模型，都只有 20t/s 左右，要用得爽，起码 60t/s 才有一点点效率

19h 39m ago

Replied to a topic by zhwguest › OpenAI › 续 claude 降智严重后， codex 也在步其后尘

@commoccoom 不太行喔，除非你把大部分的层卸载到 cpu+内存，这样稠密型的模型就会比较慢了，moe 的话，我就不太清楚，我之前是 1660s 6g ，跑那种 9b 模型，设大了上下文，就得卸载到 cpu ，就会很慢

23h 22m ago

Replied to a topic by zhwguest › OpenAI › 续 claude 降智严重后， codex 也在步其后尘

@diudiuu 就用 unsloth 的 Qwen3.6-27B-UD-Q4_K_XL.gguf 就很不错，我家里自己用的 35b-a3b 也很不错

1 day ago

Replied to a topic by zhwguest › OpenAI › 续 claude 降智严重后， codex 也在步其后尘

感觉本地部署的 qwen3.6-27b 都比 opus 4.7 要用，写代码的话

今天消耗了 21%的 token opus4.7 居然对话了 3 次都处理不好一个 bug ，还来来回回 3 次都说他已经修改了代码，看输出他是改了，但是完全不知道他改的什么东西，反正就是没效果

然后，换回本地 qwen3.6-27b 一次解决问题

3 days ago

Replied to a topic by libasten › 程序员 › 几个国产大模型，用来对接用在 cc 上编程，哪家性价比高？

@ptstone 这样吗？我还在使用 llama.cpp

4 days ago

Replied to a topic by libasten › 程序员 › 几个国产大模型，用来对接用在 cc 上编程，哪家性价比高？

@ptstone 无，我纠结了下，还是用 llama.cpp 算了

4 days ago

Replied to a topic by libasten › 程序员 › 几个国产大模型，用来对接用在 cc 上编程，哪家性价比高？

@hutng 这个是 2080ti 22g 的设置，公司 5090d 就可以调大上下文到 256k ，然后换 27b 模型

4 days ago

Replied to a topic by libasten › 程序员 › 几个国产大模型，用来对接用在 cc 上编程，哪家性价比高？

@hutng 开啊
-m /data/qwen3.6/Qwen3.6-35B-A3B-UD-IQ4_NL_XL.gguf \
--mmproj /data/qwen3.6/35b-mmproj-F16.gguf \
--ctx-size 73728 \
-ngl 99 \
--host 0.0.0.0 \
--port 8080 \
--temp 0.6 \
--top-p 0.95 \
--top-k 20 \
--min-p 0.00 \
--flash-attn on \
--cont-batching \
--cache-type-k q4_0 \
--cache-type-v q4_0 \
--threads 15 \
--parallel -1 \
-b 1024 \
-ub 1024 \
--chat-template-kwargs '{"enable_thinking": true, "preserve_thinking": true}' \
--metrics \
--jinja \
--alias Qwen3.6

4 days ago

Replied to a topic by CloudSen › 程序员 › 别买阿里 Token Plan [降智严重]

@CloudSen 哈哈哈，有 2080ti ，当然是魔改一下 22g 啊，我用来跑 qwen3.6-35b-a3b 很不错啊，我平常是 hermes 调用，需要改点严谨些的代码就换 cc 调用

» More replies by JerningChan