|
|
发表于 2020-9-2 00:34:41
|
显示全部楼层
笨笨地试用了一下正则表达,应该还好:0 Y$ p" _2 S( N. W9 p% u
@export('The_Little_Dict')
! t; V3 b) M0 z9 L def The_Little_Dict(self):
/ n/ ]$ Q8 N- m7 j def_distribution = '') S# B7 c8 A1 R
m = re.findall(r'(<link.*?<hr/></div>)|(</div><div\s+class="word-frequency">.*?<div\s+class="coca">)|(<span\s+class="pos">.*?<div\s+class="total">\d+</div>)|(</div><div\s+class="coca2">.*?</span></div></div>)', self.get_html_all())# z4 O9 p9 ?/ b* a8 k
if m:
' Z- \6 h' o3 `9 S for i in range(len(m)):2 r. n; a8 u% }- w8 {2 z& Q+ X [
for j in range(len(m)):) O& V- H( \6 i/ k4 q
if not (m[j] == ''):
4 V, c$ m5 w8 i4 k def_distribution += m[j]
" `+ u$ F7 Z9 C! A( V4 R2 v& C return def_distribution9 L7 K+ \" Z0 T& j! t, J( Z! d( r A
return ''
4 H3 A. D, u m0 h# _
+ p2 O! _, w- f
: k# R; c) E0 a$ t. [7 c7 e, { |
本帖子中包含更多资源
您需要 登录 才可以下载或查看,没有账号?免费注册
x
|