python如何随机读取一行

有一个txt里面几百万行，请问用python如何随机的读取其中一行呢？

举报该问题

推荐答案推荐于2017-09-11

#!/usr/bin/env python
# coding: utf-8

def getfilelines(filename, eol='\n', buffsize=4096):
    """计算给定文件有多少行"""
    with open(filename, 'rb') as handle:
        linenum = 0
        buffer = handle.read(buffsize)
        while buffer:
            linenum += buffer.count(eol)
            buffer = handle.read(buffsize)
        return linenum

def readtline(filename, lineno, eol="\n", buffsize=4096):
    """读取文件的指定行"""
    with open(filename, 'rb') as handle:
        readedlines = 0
        buffer = handle.read(buffsize)
        while buffer:
            thisblock = buffer.count(eol)
            if readedlines < lineno < readedlines + thisblock:
                # inthisblock: findthe line content, and return it
                return buffer.split(eol)[lineno - readedlines - 1]
            elif lineno == readedlines + thisblock:
                # need continue read line rest part
                part0 = buffer.split(eol)[-1]
                buffer = handle.read(buffsize)
                part1 = buffer.split(eol)[0]
                return part0 + part1
            readedlines += thisblock
            buffer = handle.read(buffsize)
        else:
            raise IndexError

def getrandomline(filename):
    """读取文件的任意一行"""
    import random
    return readtline(
        filename,
        random.randint(0, getfilelines(filename)),
        )

if __name__ == "__main__":
    import sys
    import os
    if len(sys.argv) == 1:
        print getrandomline("/home/tim/documents/users.csv")
    else:
        for f in filter(os.path.isfile, sys.argv[1:]):
            print getrandomline(f)

对于超大文件建议用逐行或分块的方式处理；逐行处理可能慢一些，但编码更简单清晰一点；上面给出的是按分块方式处理的。

温馨提示：答案为网友推荐，仅供参考

当前网址：http://99.wendadaohang.com/zd/WztvO7WX7ztWjXO7eBj.html

其他回答

第1个回答推荐于2017-09-15

#xx.py
#use:python xx.py file

import random
import sys

def random_read(read_file):
    count = 0
    for count, line in enumerate(open('read_file','rU')):
        count += 1

    line = random.randint(0,count-1)

    f = open('read_file','r').readlines()[line]
    print f

def main():
    if len(sys.argv) != 2:
        print 'use: %s filename' % sys.argv[0]
        sys.exit(1)
    read_file = sys.argv[1]
    random_read(read_file)

if __name__ == "__main__":
    main()

追问

对不起我想要python3的，另外能不用python xx.py file这种命令行形式吗？直接open('test.txt。改好后我再给你加分

追答

改最后面就行了

if __name__ == "__main__":
random_read('test.txt') #最好带路径
开头加个这个#!/usr/bin/env python 运行的时候直接./xx.py就可以了
python3应该库跟语法都不一样吧
这个逻辑很简单你可以自己尝试一下变成python3的代码:)

本回答被提问者采纳

第2个回答 2015-06-24

　　用Python实现的随机读取文件某一行写的简短代码，仅适用于短的文件，大文件需要修改下获取行数代码。

　　#coding=utf-8

　　#! /usr/bin/python

　　import random

　　import linecache

　　def hello():

　　count = len(open('hello.txt','rU').readlines())#获取行数

　　hellonum=random.randrange(1,count, 1)#生成随机行数

　　return linecache.getline('hello.txt',hellonum)#随机读取某行

　　if __name__ == "__main__":

　　hello()
‍

第3个回答 2015-07-11

import random
f=open('a.txt')
randomLineNumber=int(raw_input('Please input a random line no.:').strip())
for i in range(randomLineNumber):
    x=f.tell()# get the current position of the file
    f.readline()
    if x==f.tell():
        print "Error: The file only has %d lines, so, "%i+\
            "it's impossible to get the line you want!!!"
        exit(1)
print f.readline()[:-1]
f.close()

第4个回答 2014-06-19

试试这个方法：http://computer.uoh.edu.cn/python/634.html

相似回答

python 随机抽取excel表中的数据答：先读取一下excel表里的总数，然后随机抽取从1到总数其中的任意5个数字，然后根据这5个数字去excel对应的行去取数据

在Python中如何随机从list中挑选一个元素答：我们首先使用Python的内置库random进行操作。random库提供了随机数生成功能，我们可以使用它来从列表中随机选择元素。具体操作如下：我们使用random.randrange(num_items)函数生成一个随机下标，然后根据此下标从列表中获取对应元素。示例代码如下：运行结果为：接着，我们使用random.choice()函数直接从列表中随机...

Python数据处理027:pandas.DataFrame.sample 随机采样答：frac：要抽取的行数的比例，如果指定了此参数，则忽略n参数。replace：是否允许重复抽取，默认为False。weights：指定每行被抽取的概率，可以是一个列名或与DataFrame长度相同的数组。random_state：随机数生成器的种子或numpy.random.RandomState对象，用于确保结果的可重复性。应用场景：数据分析：从大数据集中...

如何用python实现随机抽取答：1 2 3 4 import random l=[1,2,3,4,5,6,7,8,9,0]x=random.choice(l)x是l中随机抽取的一个元素

Python从列表中随机获取元素方法答：Python从列表中随机获取元素的方法主要依赖于Python的random模块，以下是几种常用的方法：sample：功能：从给定序列中随机抽取k个元素，并以列表形式返回，不会修改原始序列，且取样过程确保不会出现重复值。参数：sequence：一个有序数据类型的序列。k：需要抽取的元素个数。choice：功能：从给定序列中随机...

Python从列表中随机获取元素方法答：实现方法中，我们可以利用randint获取随机下标来从列表中随机选取元素，如：import randomrandom_num = random.randint(0, len(lst) - 1)random_element = lst[random_num]综上所述，Python提供了多种方法从列表中随机抽取元素，通过合理选择适合需求的函数，可以满足不同应用场景中的随机选取需求。

python的random库如何使用答：Python的random库的使用方法如下：导入random库：在Python脚本中，使用import random来导入random库。生成随机整数：使用random.randint来生成一个范围在a和b之间的随机整数。生成随机浮点数：使用random.uniform来生成一个范围在a和b之间的随机浮点数。随机选择列表元素：使用random.choice从给定的序列seq中随机...

在python中如何定义一个函数,能够随机获得一个每一位互不相干的四位数...答：]''.join(map(str, nums[:4]))将四个数字字符串拼接起来，形成一个四位随机数的字符串这样，当您调用get_random_number()函数时，即可获取一个每一位互不相干的四位数组成字符串。程序运行效果如下图：运行效果每次运行，都会生成一个四位数字组成的随机字符串。希望我的回答对您有所帮助！

python里面如何生成随机数?答：在Python编程中，生成随机数是一项常见的任务，这通常通过内置的random模块来完成。对于需要生成一个指定范围内的随机整数，可以使用random.randint(a,b)函数。这个函数接收两个参数a和b，它将返回一个在a和b之间（包括a和b）的随机整数。例如，如果你想要一个在1到10之间的随机整数，可以这样调用：...