逐行读取文本时哪种方法更好？

推荐学习书目

› Learn Python the Hard Way

Python Sites

› PyPI - Python Package Index

› http://diveintopython.org/toc/index.html

› Pocoo

值得关注的项目

› PyPy

› Celery

› Jinja2

› Read the Docs

› gevent

› pyenv

› virtualenv

› Stackless Python

› Beautiful Soup

› 结巴中文分词

› Green Unicorn

› Sentry

› Shovel

› Pyflakes

› pytest

Python 编程

› pep8 Checker

Styles

› PEP 8

› Google Python Style Guide

› Code Style from The Hitchhiker's Guide

This topic created in 3705 days ago, the information mentioned may be changed or developed.

我知道逐行读取文本有两种方法： for line in f 以及 for line in f.readlines()。那么这两个方法哪个更好呢？为什么要设计 readlines()这个方法？

逐行

line

方法

读取

10 replies • 2016-06-06 09:02:04 +08:00

congeec

Jun 5, 2016

用 for line in f, readlines()会把文件里的内容都读进内存
https://stackoverflow.com/questions/17246260/python-readlines-usage-and-efficient-practice-for-reading

21grams

Jun 5, 2016 via Android

大文件用第一个，原因显而易见

luofeiyu

Jun 5, 2016

内存受不了。

lukertty

Jun 5, 2016

后者快一点

loading

Jun 5, 2016 via Android

读一个大文件，自己对一下内存就知道了

billlee

Jun 5, 2016

方法一叫做逐行读取文本文件
方法二叫做逐个读取列表元素

OnTheRoad

Jun 5, 2016

第二种方法读取大文件时内存遭不住。
还是第一种方法最 Pythonic 。
参见(stackoverflow)[http://stackoverflow.com/questions/8009882/how-to-read-large-file-line-by-line-in-python]

lll9p

Jun 5, 2016

用 linecache 怎么样？

jamesfjx

Jun 6, 2016 via iPad

还可以配合 yield 命令

https://www.ibm.com/developerworks/cn/opensource/os-cn-python-yield/

最后一个例子

practicer

Jun 6, 2016

for line in f 语句将 file 对象转换成 iterable object ，既然是可迭代对象，一次只加载一个 item ，解释器不会将所有 items 放进内存，因此节省了资源。在 python 2.3 以前，要用 f.xreadlines()方法读大文件，它和 xrange 的作用一样，都是处理 iter(object)，但在 2.3 后，官方明确用 for line in f 读取大文件。

for line in f.readlines() 语句和第一个类似，但是它先执行 f.readlines()，直接把 file 对象中所有的 line items 列表存进内存，在它们之上进行循环读取。

因此，读取大文件时用第一个语句，一般小文件这两个都可以。