7747139suy
V2EX  ›  Local LLM

求佬推荐一个本地可部署的音频转文字模型

  •  
  •   7747139suy · Dec 30, 2024 · 2205 views
    This topic created in 506 days ago, the information mentioned may be changed or developed.

    8-16g 显卡可跑,中文能力强

    7 replies    2024-12-30 17:21:58 +08:00
    lpf0309
        1
    lpf0309  
       Dec 30, 2024
    百度 paddlespeech ,阿里的 funasr ,cpu 都能跑
    isSamle
        2
    isSamle  
       Dec 30, 2024
    Whisper/SeamlessM4T/vosk
    mumbler
        3
    mumbler  
       Dec 30, 2024   ❤️ 1
    donaldturinglee
        4
    donaldturinglee  
       Dec 30, 2024
    cuda Whisper
    n
        5
    n  
       Dec 30, 2024
    试了下,有些 cpu 都能跑,有趣。借楼请教这些方案哪个支持识别 speaker 呢?
    n
        6
    n  
       Dec 30, 2024
    哦,不好意思,仔细看了下,几乎都支持。
    hellojay
        7
    hellojay  
       Dec 30, 2024
    @n 好像都没有支持的。哎
    About   ·   Help   ·   Advertise   ·   Blog   ·   API   ·   FAQ   ·   Solana   ·   1187 Online   Highest 6679   ·     Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 · 37ms · UTC 17:43 · PVG 01:43 · LAX 10:43 · JFK 13:43
    ♥ Do have faith in what you're doing.