字节流文件读取问题。6962 7470 6462 7773 0001 0000 df67 0132 这种类型的数据如何原样读出？？？？

2018-10-31 16:07:12 +08:00

dapengzhao

用 python 读取后 b'ibtpdbws\x00\x01\x00\x00\xdfg\x012\xd3U\x00\x1e\x00\x00\x00\x00\x00\x01\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x01\x00\x00\x00\x01\x00\x00\x00\x02\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x03P\x00\x00%\x18\x00\x01\x00\x00\x00\x00%\x18\x00\x01\x00\x00
是类似这种。

1105 次点击

所在节点

问与答

7 条回复

ooh

2018-10-31 16:34:50 +08:00

你读的是 Unicode 文本吗？把代码贴出来吧

hundan

2018-10-31 16:37:23 +08:00

这个看起来就不是可见字符吧，啥原样

dapengzhao

2018-10-31 16:43:44 +08:00

@ooh 读的是百度词库的文本，具体模样可以下载一个看看 https://shurufa.baidu.com/dict_list?cid=217。我的需求是将这个文本的字符解析成汉字。由于我发现
result = '1105'
content = binascii.a2b_hex(result).decode('utf-16')
可以解析出汉字。
word= open(地址,'rb')
读出来的是上面那种形式。无法使用。
我还在测试就写了几行代码看看这种行不行得通。

dapengzhao

2018-10-31 16:44:42 +08:00

@hundan https://shurufa.baidu.com/dict_list?cid=217 原文本就是百度词库的文本

Shy07

2018-10-31 17:00:54 +08:00

为啥不去全球最大同性交友网站看看？

https://github.com/buaahsh/bdictDecoder/blob/master/bDictdecoder.py

dapengzhao

2018-10-31 17:10:38 +08:00

@Shy07 我就是从那过来的，知道思路，代码有点不好用

dapengzhao

2018-10-31 17:11:54 +08:00

知道了很简单
fileText = open(filename,'rb')
return fileText.read().hex()（手动狗头）

第 1 页／共 1 页

这是一个专为移动设备优化的页面（即为了让你能够在 Google 搜索结果里秒开这个页面），如果你希望参与 V2EX 社区的讨论，你可以继续到 V2EX 上打开本讨论主题的完整版本。

https://www.v2ex.com/t/503060

V2EX 是创意工作者们的社区，是一个分享自己正在做的有趣事物、交流想法，可以遇见新朋友甚至新机会的地方。

V2EX is a community of developers, designers and creative people.