我有两个txt文档比较大大概有10m里面有很多行,我想对比这两个文件中相同的行并且删除第二个文件中相同的行而且第二个文件中是utf8编码这要怎么去实现呢请大家帮忙一下最近在学习拍森
# -*- coding: utf-8 -*-
#
# python 2.7
fp1 = file('', 'r')
fp2 = file('', 'r')
fp3 = file('', 'w')
d1 = {}
d2 = {}
isFirst = True
for line in fp1:
if not isFirst:
d1[hash(line)] = line
else:
isFirst = False
fp1.close()
isFirst = True
for line in fp2:
if not isFirst:
d2[hash(line)] = line
else:
isFirst = False
fp2.close()
diff = set(d1.keys()) - set(d2.keys())
for key in diff:
fp3.write(d1[key])
fp3.close()
这样子可以么顺序是不一样的呢
åä¸ä¸ªæ件确å®ä¼ææå°æ°çéå¤æ°æ®çï¼ä¸æ¯å¾å¤ä½ç¡®å®æç
追çæçä½ çéæ±æ¯ï¼å¯¹æ¯è¿ä¸¤ä¸ªæ件ä¸ç¸åçè¡å¹¶ä¸å é¤ç¬¬äºä¸ªæ件ä¸ç¸åçè¡
ä¹å°±æ¯è¯´å¦æä¸åè¡ï¼ä½æ¯å
容ä¸æ ·ï¼é£æ ·ä¹æ¯ä¸å é¤çï¼å¯¹ä¹ï¼