import re
f = open('word.txt','r')
word = f.read()
f.close()
# ------- 1 -------
wList1 = []
word1 = word.split(',')
for i in word1:
wList1.append(i.lower())
wLen1 = len(wList1)
print 'number of word:', wLen1, '\n', wList1
# ------- 2 -------
wList2 = []
word2 = re.findall('[a-zA-Z]+', word)
for i in word2:
wList2.append(i.lower())
wLen2 = len(wList2)
print '\nnumber of word:', wLen2, '\n', wList2
两种方法做的,如果单词之间的符号都一样,直接split('符号')就可以了,若是有不同的符号,就可以用第二种方法中的re库。
