python-统计英语小说词汇频率

37次阅读

共计 265 个字符,预计需要花费 1 分钟才能阅读完成。

`
import pandas as pd
import numpy as np
import re
from collections import Counter

with open (“/home/baba/txt/1.txt”,’r’,encoding=’gbk’) as f:

words=f.read().lower()



rule=re.compile(r’w+’)
words=re.findall(rule,words)

counter_words=Counter(words)

common_words=counter_words.most_common(10)

正文完
 0