|
代码:- # CalHamlet.py
- def getText():
- txt = open("hamlet.txt", "r").read()
- txt = txt.lower()
- for ch in '!"#$%&()*+,-./:;<=>?@[\\]^_‘{|}~':
- txt = txt.replace(ch, " ") #将文本中特殊字符替换为空格
- return txt
- hamletTxt = getText()
- words = hamletTxt.split()
- counts = {}
- for word in words:
- counts[word] = counts.get(word,0) + 1
- items = list(counts.items())
- items.sort(key=lambda x:x[1], reverse=True)
- for i in range(10):
- word, count = items[i]
- print ("{0:<10}{1:>5}".format(word, count))
复制代码 结果:- the 1138
- and 965
- to 754
- of 669
- you 550
- i 542
- a 542
- my 514
- hamlet 462
- in 436
复制代码 |
本帖子中包含更多资源
您需要 登录 才可以下载或查看,没有帐号?立即注册
x
|