ysn2233
V2EX  ›  Hadoop

大数据环境中压缩格式用什么比较好?

  •  
  •   ysn2233 · Mar 12, 2020 · 3826 views
    This topic created in 2276 days ago, the information mentioned may be changed or developed.

    因为文件大小可能不一,需要支持 splittable 的,目前看到的貌似有 Bzip2 和 lzo (需要建索引),哪个相对比较好用?

    1 replies    2020-03-12 14:46:44 +08:00
    alya
        1
    alya  
       Mar 12, 2020
    snappy lz4 zstd
    About   ·   Help   ·   Advertise   ·   Blog   ·   API   ·   FAQ   ·   Solana   ·   931 Online   Highest 6679   ·     Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 · 44ms · UTC 22:08 · PVG 06:08 · LAX 15:08 · JFK 18:08
    ♥ Do have faith in what you're doing.