|
发表于 2020-9-2 00:34:41
|
显示全部楼层
笨笨地试用了一下正则表达,应该还好:
& K$ d: l4 j, x& o5 n @export('The_Little_Dict')
[ A. J3 A, C$ }$ [8 E! _9 N def The_Little_Dict(self):& N ]8 Y% E, @& l1 Z) a1 u/ g
def_distribution = ''
0 M) b5 h4 P/ p! \$ s- M) u m = re.findall(r'(<link.*?<hr/></div>)|(</div><div\s+class="word-frequency">.*?<div\s+class="coca">)|(<span\s+class="pos">.*?<div\s+class="total">\d+</div>)|(</div><div\s+class="coca2">.*?</span></div></div>)', self.get_html_all())
$ z' y: z! }3 v' q1 b7 g( Q if m:
& ], q6 K6 d# H9 m' ^ for i in range(len(m)):
7 {2 k- y( b- ^ for j in range(len(m)):2 a5 U7 L: A( M2 d
if not (m[j] == ''):* n1 S6 W; k) x+ I$ E" z% O
def_distribution += m[j]0 N# @% e- D% {2 C, n; U) w
return def_distribution) ~! g& Y/ a! j* @
return ''
5 d, J5 Z3 T& p1 D3 z' S$ h% x4 c9 D8 L2 s/ `
# {: w+ ^7 m* ?: G |
本帖子中包含更多资源
您需要 登录 才可以下载或查看,没有账号?免费注册
x
|