VBA-SZ第3节|文本内容的处理技巧
最近更新:'2019-05-09'
1.Format函数(数字/日期/文本格式化)
2.正则表达式经验技巧
3.正则表达式-环视功能
1.Format函数(数字/日期/文本格式化)
1.1Format:数字格式化
1.1.1Format函数常用格式方法
data:image/s3,"s3://crabby-images/59189/59189cf56fa4bc78b0643322985df33297b49b74" alt=""
data:image/s3,"s3://crabby-images/64505/645053dc94bc27b30b3ff284d671aa5d386e3a99" alt=""
data:image/s3,"s3://crabby-images/8ec3b/8ec3ba1fd24b928abe0e903ffd9173501b58df02" alt=""
Option Explicit
Sub a()
Dim i, s
i = 532.6542
s = Format(i, "currency")
MsgBox s
End Sub
1.1.2Format函数自定义格式
#:代表一位数字
".##":小数点后保留两位
data:image/s3,"s3://crabby-images/d354c/d354c47d51826df21baaf657fe06d8f45e802741" alt=""
Sub a()
Dim i, s
i = 234230.65487
s = Format(i, ".###")
MsgBox s
End Sub
以上案例,如果又想数字增加货币符号,又应该怎么操作呢?
data:image/s3,"s3://crabby-images/82435/824351560a6a39cf4d01067de4944d6479398f42" alt=""
Sub a()
Dim i, s
i = 234230.65487
s = Format(i, "¥.###")
MsgBox s
End Sub
".###"这里是格式控制符,代表小数点后三位,而¥代表字符本身的意义,并非是格式控制符.
data:image/s3,"s3://crabby-images/67dd9/67dd94212a68c08cfd895005197ca60a5276cbbf" alt=""
".###"这里不足三位的小数,还是会保留之前的小数位数,不会补0.
Sub a()
Dim i, s
For i = 3 To 21
s = Format(Cells(i, 2), "¥.###")
Cells(i, 3) = s
Next i
End Sub
data:image/s3,"s3://crabby-images/7c419/7c419a075c18348083f9b5b01b2393c120a3e902" alt=""
使用".000",则不足位数会自动补0
data:image/s3,"s3://crabby-images/a4cec/a4cec65533485c9748674cf6b659c071264b06ce" alt=""
Sub a()
Dim i, s
For i = 3 To 21
s = Format(Cells(i, 2), "¥.000")
Cells(i, 3) = s
Next i
End Sub
1.1.3Format函数常用格式符
data:image/s3,"s3://crabby-images/04b24/04b245c3e863fb30fd4efc9621e75fbca19a9bc8" alt=""
data:image/s3,"s3://crabby-images/0a850/0a8503ca6f64103f5dd70c12bc58bde89123e828" alt=""
以下案例就是使用了分号,正数的格式是¥.000,负数的格式是(¥.000),0格式是零,空值的格式是-
Sub a()
Dim i, s
For i = 3 To 21
s = Format(Cells(i, 2), "¥.000;(¥.000);零;-")
Cells(i, 3) = s
Next i
End Sub
data:image/s3,"s3://crabby-images/a0009/a00092d234f84220b55db69e9da326a0613986f9" alt=""
以下是经常犯错的内容,需要格外注意:
data:image/s3,"s3://crabby-images/f9552/f9552d06388fe1689d13795ae03fc194476e390b" alt=""
1.2Format:日期格式化
1.2.1VBA日期系统自带格式
VBA有很多自带的日期格式,具体如下截图:
data:image/s3,"s3://crabby-images/0d669/0d669482587266ce638193d5d644430da6c917f9" alt=""
Sub dateFormat()
Dim s As String, d As Date
d = Range("b3").Value
s = Format(d, "long date")
MsgBox s
End Sub
代码显示的最终结果为
data:image/s3,"s3://crabby-images/c1137/c1137e604cb046729b0ec78b051fbe9f3bcd683e" alt=""
需要注意事项如下:
在不同版本、语言的操作系统中,日期显示格式可能不同
1.2.2VBA日期自定义格式
1.2.2.1常用格式符:d
data:image/s3,"s3://crabby-images/c8ce9/c8ce9f3e217754ba7c1493b3f6c8b5852c6fc14c" alt=""
如果把2016年8月16日,自定义为8月第16天(公元2016年), Tuesday,需要怎么操作呢?
Sub dateFormat()
Dim s As String, d As Date
d = Range("b3").Value
s = Format(d, "m月d天(公元yyyy年),dddd")
MsgBox s
End Sub
data:image/s3,"s3://crabby-images/eee25/eee2552a72580e9586c2548a40fcc49d0278e5b9" alt=""
1.2.2.2常用格式符:d,m,yyyy,aaaa
如果把8月第16天(公元2016年), Tuesday更改为8月第16天(公元2016年), 星期一,需要怎么操作呢?
Sub dateFormat()
Dim s As String, d As Date
d = Range("b3").Value
s = Format(d, "m月d天(公元yyyy年),aaaa")
MsgBox s
End Sub
data:image/s3,"s3://crabby-images/f39cb/f39cb96ab0b39e9e87e864df08c1385c9ca4ba88" alt=""
1.2.2.3常用格式符:w
常用格式符w又代表什么意思呢?
data:image/s3,"s3://crabby-images/07713/07713c575d00d5b36ed43fd16bb8d36701a80cb1" alt=""
Sub dateFormat1()
Dim s As String, d As Date
d = Range("b3").Value
s = Format(d, "当周第w天,aaaa")
MsgBox s
End Sub
data:image/s3,"s3://crabby-images/efb01/efb016a181afd56defac2184979ffb677f35aeef" alt=""
常用格式符w,使星期一为当周的第一天,又是怎么自定义呢?
data:image/s3,"s3://crabby-images/1f90f/1f90fdde2c8b2003308afc099a79a1f08748c687" alt=""
Sub dateFormat1()
Dim s As String, d As Date
d = Range("b3").Value
s = Format(d, "当周第w天,aaaa", vbMonday)
MsgBox s
End Sub
data:image/s3,"s3://crabby-images/a17d8/a17d810a907b0429f1143b5e51882229bd4f1b36" alt=""
常用格式符w,使星期五为当周的第一天,又是怎么自定义呢?
Sub dateFormat1()
Dim s As String, d As Date
d = Range("b3").Value
s = Format(d, "当周第w天,aaaa", vbFriday)
MsgBox s
End Sub
data:image/s3,"s3://crabby-images/408aa/408aa9a7c72bdece5dfb736618d021ffb823de7e" alt=""
1.2.2.4常用格式符:ww
常用格式符ww代表的是原日期所在的星期,是一年中的第几周.
data:image/s3,"s3://crabby-images/da2a9/da2a96a33d263124e98adc2bed55993e159bf1b2" alt=""
Sub dateFormat1()
Dim s As String, d As Date
d = Range("b3").Value
s = Format(d, "本年度第ww周")
MsgBox s
End Sub
data:image/s3,"s3://crabby-images/1d574/1d574290f71c6d982d7d6d525c672c68faed3ea4" alt=""
常用格式符:ww在第几周存在有争议的地方.因此就有了firstWeekofYear,以此规定全年的第一周.
data:image/s3,"s3://crabby-images/6111c/6111c0b4c36edee990bf6696535da4154bea27d1" alt=""
firstWeekofYear这个参数有四个取值,具体如下:
data:image/s3,"s3://crabby-images/b4404/b44046371098b1b82f540c96de20b3c34a427602" alt=""
data:image/s3,"s3://crabby-images/5bf1c/5bf1cbf7b54bab577fb035fc0155ca2168ce6b1e" alt=""
data:image/s3,"s3://crabby-images/68445/68445cd0eb26880dfc4f531d723ccb2e48d6a2cd" alt=""
data:image/s3,"s3://crabby-images/822fe/822fe68066d911f0653010464c89062e2125811f" alt=""
firstWeekofYear与firstDayofWeek也相互关联,具体如下:
data:image/s3,"s3://crabby-images/0e6ce/0e6ce6743f7a41820d1397166eb6656919ce11b7" alt=""
a,m,d等这些字母在使用过程中会被认为是格式控制符,如下截图的m会被认为是月份数字.
data:image/s3,"s3://crabby-images/c525d/c525dec55f1667427f67bdf2f72ae320e4b015b5" alt=""
如果想正常显示这些字母应该在字母前面加上\
data:image/s3,"s3://crabby-images/cdbe7/cdbe7631d67a86985ca42b7aab4f4b436acdeb64" alt=""
1.3Format:文本格式化
data:image/s3,"s3://crabby-images/2e179/2e17905a03c0dee2f07d7ef8d699bcd0777a8b7d" alt=""
1.3.1Format:@符合的使用
@符合的使用及其复杂,网上说清楚的也比较少,这一模块需要反复学习.
1.3.1.1@少于原文本字符数
Sub demoStr()
Dim s As String, r As String
s = "张宏义"
r = Format(s, "@, ")
MsgBox r
End Sub
data:image/s3,"s3://crabby-images/c84ea/c84ea3beb449125944b0fef9ba7514e0e019f52d" alt=""
data:image/s3,"s3://crabby-images/cabb3/cabb35614fe44cba7729bc10e108ca5c5514748d" alt=""
data:image/s3,"s3://crabby-images/11b52/11b52e0ff606e6078aff2014ded852bc36541a7a" alt=""
1.3.1.2@超出原文本字符数
Sub demoStr()
Dim s As String, r As String
s = "张宏义"
r = Format(s, "@@, @@@@")
MsgBox r
End Sub
data:image/s3,"s3://crabby-images/4c026/4c026ae224417fc0ed0c6807e4c0fa5ca0897110" alt=""
1.3.2Format:!符合的使用
1.3.2.1 !@配合使用,@少于原文本字符数
Sub demoStr()
Dim s As String, r As String
s = "张宏义"
r = Format(s, "!@, @")
MsgBox r
End Sub
data:image/s3,"s3://crabby-images/6a4c1/6a4c1d25b357b3d8cdb5fe0623cd1c0d95f40a4a" alt=""
1.3.2.2 !@配合使用,@超出原文本字符数
Sub demoStr()
Dim s As String, r As String
s = "张宏义"
r = Format(s, "!@, @@@@")
MsgBox r
End Sub
data:image/s3,"s3://crabby-images/80d30/80d3086eb0e192e2b0731bfacb679a6f9ebee7cf" alt=""
1.4 Format其他参考用法
data:image/s3,"s3://crabby-images/7ad37/7ad37af057353705f2e7447baa1599a7c81c30e5" alt=""
2.正则表达式经验技巧
2.1正则表达式:捕获组
捕获组:使用圆括号,将匹配结果中的局部内容单独抽取出来。
2.1.1案例1:将客户的区号和本地号码提取出来.
data:image/s3,"s3://crabby-images/5f00e/5f00e9ee6000bc175d85e6744c2d079cdd0f4c16" alt=""
1.首先我们写出一个正则表达式.以上截图的数字特征是:多位数字-多位数字,正则表达式可以写成\d+-\d
data:image/s3,"s3://crabby-images/bcd25/bcd256002c9845af07129af477e1c52c50b5dddd" alt=""
2.需要将区号和本地号区分开,就用().
data:image/s3,"s3://crabby-images/d5cea/d5ceae980331fd6333c139ccaa50bf82818cbc49" alt=""
3.在VBA上写相关的正则表达代码
Option Explicit
Sub matchDemo()
Dim i As Long, s As String, myReg As Object
Dim myMatches As Object, myMatch As Object
s = Range("B2").Value
Set myReg = CreateObject("vbscript.regexp")
myReg.Global = True
myReg.Pattern = "(\d+)-(\d+)"
Set myMatches = myReg.Execute(s)
i = 8
For Each myMatch In myMatches
Cells(i, 2) = myMatch.submatches(0)
Cells(i, 3) = myMatch.submatches(1)
i = i + 1
Next myMatch
End Sub
data:image/s3,"s3://crabby-images/b1439/b143948f0ea5cc5e9e5ea901b8d636a01df7cace" alt=""
2.1.2案例2:多级菜单按-用()分解开
data:image/s3,"s3://crabby-images/475f9/475f96057e9bd2b5929383bfd08956baeb646a4f" alt=""
按照案例1的方法,发现查找的结果不符合要求,使用多个嵌套的捕获组比较复杂,并非达到我们所要求的结果.
data:image/s3,"s3://crabby-images/586a0/586a045a8e7d1c8a22f031f69b01178d9762d938" alt=""
错误的原因是,量词对分组有效,对捕获组却是没有任何意义的.分组以及捕获组都是用()表示.在查找的过程,正则表达式先对分组有效,查找之后,再对捕获组的数据进行分割.如果捕获组多个有效时,则返回最后一个结果.
data:image/s3,"s3://crabby-images/38369/38369929fdc51973c2efc6605d2c639cdec53388" alt=""
data:image/s3,"s3://crabby-images/68beb/68beb3c4cc4ecacbe32555eb688f1211b5a2fc28" alt=""
data:image/s3,"s3://crabby-images/f6f65/f6f659e6f1a0bb77174bdd04b7931985cf50c5ab" alt=""
为了查找结果的准确性,编写两个正则对象.第一个正则表达式是查找所有完整的电话号码.第二个正则表达式则查找一个电话号码中的关键数字.
data:image/s3,"s3://crabby-images/037c2/037c20f070fbdc5fb8a43477e746b93941ab72e0" alt=""
Sub matchDemo()
Dim i As Long, s As String, j As Long
Dim myReg1 As Object, matches1 As Object
Dim myReg2 As Object, matches2 As Object
s = Range("B2").Value
Set myReg1 = CreateObject("vbscript.regexp")
myReg1.Global = True
myReg1.Pattern = "(\d+)(-\d+)+"
Set myReg2 = CreateObject("vbscript.regexp")
myReg2.Global = True
myReg2.Pattern = "\d+"
Set matches1 = myReg1.Execute(s)
For i = 0 To matches1.Count - 1
Set matches2 = myReg2.Execute(matches1(i).Value)
For j = 0 To matches2.Count - 1
Cells(i + 7, j + 2) = matches2(j).Value
Next j
Next i
End Sub
data:image/s3,"s3://crabby-images/bdba4/bdba4a60a2d8f3003b176b032f6a8d0461560493" alt=""
2.2正则表达式:非捕获组
data:image/s3,"s3://crabby-images/7066b/7066bebacb3a09102f37099ce4f6051445c8aa73" alt=""
data:image/s3,"s3://crabby-images/59757/59757f92b8bd3472b872ecc31c99573c4652de87" alt=""
3.正则表达式-环视功能
3.1肯定顺序环视:(?=abc)右边是abc
data:image/s3,"s3://crabby-images/85326/853269f207ab01e6cd2d07abc310534fbe7af1a2" alt=""
要求将A1单元格的内容,对北京的关键字眼的词进行分割,并将分割的词分别放到A2之后的单元格,具体如截图.
data:image/s3,"s3://crabby-images/c6fd7/c6fd735a255e75d26c059cf5d95864688237164d" alt=""
1.对A1单元格的内容进行属性特点的分析:都是有北京开头,直到下一个北京结束.正则表达式是"北京\S+北京"
data:image/s3,"s3://crabby-images/06241/0624198eec97004309e55d20b828ecd77888e9db" alt=""
Sub lookRoundDemo()
Dim reg As Object, matches As Object
Dim i As Long, s As String
s = Cells(1, 1)
Set reg = CreateObject("vbscript.regexp")
reg.Global = True
'reg.MultiLine是多行模式
reg.MultiLine = True
reg.Pattern = "北京\S+北京"
Set matches = reg.Execute(s)
For i = 0 To matches.Count - 1
Cells(i + 2, 1) = matches(i).Value
Next i
End Sub
最终显示的结果为:
data:image/s3,"s3://crabby-images/99513/995137f446d38ca462e53d1388b3ee959476bd35" alt=""
从结果可以看出,正则表达式未达到切分的效果,只是少了最后一个东字.导致的原因是,正则表达式执行的是贪婪搜索.最长的结果更符合贪婪的搜索结果.
data:image/s3,"s3://crabby-images/e32ea/e32ea6a5757dc843ed4b0cd5d35a3139b7e0402f" alt=""
2.对A1单元格的内容进行属性特点的分析:都是有北京开头,直到下一个北京结束.避免贪婪搜索,实行懒惰搜索,正则表达式是"北京\S+?北京"
Sub lookRoundDemo()
Dim reg As Object, matches As Object
Dim i As Long, s As String
s = Cells(1, 1)
Set reg = CreateObject("vbscript.regexp")
reg.Global = True
'reg.MultiLine是多行模式
reg.MultiLine = True
reg.Pattern = "北京\S+?北京"
Set matches = reg.Execute(s)
For i = 0 To matches.Count - 1
Cells(i + 2, 1) = matches(i).Value
Next i
End Sub
最终显示的结果为:
data:image/s3,"s3://crabby-images/31bc5/31bc544c0356aebc7dc4d5d804a9a6330ea1663e" alt=""
从结果可以看出,正则表达式未达到切分的效果,得到的结果是北京西北京和北京北北京.导致的原因是,正则表达式执行的是贪婪搜索是不会回头的,对已经扫描过的文字不会再次扫描.
- 第一次搜索的结果是:北京西北京.下次扫描本应该是:站北京北北京南站北京东.因站是开头,非北京开头,下次扫描实际是:北京北北京南站北京东
- 第二次搜索的结果是:北京北北京,下次扫描是:南站北京东.因南站是开头,非北京开头,以下找不到正则的相关结果,故结束.
具体扫描结果可查看如下截图:
data:image/s3,"s3://crabby-images/4168e/4168e5cdda81916bbe84d915e4d43b5c2653d815" alt=""
3.对A1单元格的内容进行属性特点的分析:都是有北京开头,直到不是北字的结束.正则表达式是"北京[^北]"
Sub lookRoundDemo()
Dim reg As Object, matches As Object
Dim i As Long, s As String
s = Cells(1, 1)
Set reg = CreateObject("vbscript.regexp")
reg.Global = True
'reg.MultiLine是多行模式
reg.MultiLine = True
reg.Pattern = "北京[^北]"
Set matches = reg.Execute(s)
For i = 0 To matches.Count - 1
Cells(i + 2, 1) = matches(i).Value
Next i
End Sub
最终显示的结果为:
data:image/s3,"s3://crabby-images/c3907/c39070d530cbfeae5948f410928ddc7a190aea24" alt=""
从结果可以看出,正则表达式未达到切分的效果,得到的结果是少了北京北.主要是正则表达式是搜索北京开头以不出现北字为节点进行分割,因此对北京北排除在外.
是否可对正则表达式进行如下截图的更改呢?其实也是不可行的.
data:image/s3,"s3://crabby-images/2c170/2c1706ec838af387ffb7329b2bec2baf607b0ff5" alt=""
4.对A1单元格的内容进行属性特点的分析:都是有北京开头,直到不是北京的光标位置结束.正则表达式是"北京\S+?(?=北京)"
Sub lookRoundDemo()
Dim reg As Object, matches As Object
Dim i As Long, s As String
s = Cells(1, 1)
Set reg = CreateObject("vbscript.regexp")
reg.Global = True
'reg.MultiLine是多行模式
reg.MultiLine = True
reg.Pattern = "北京\S+?(?=北京)"
Set matches = reg.Execute(s)
For i = 0 To matches.Count - 1
Cells(i + 2, 1) = matches(i).Value
Next i
End Sub
最终显示的结果为:
data:image/s3,"s3://crabby-images/294ef/294ef1633ae6bd95e4d1d4fcae4aa5676f01dbdb" alt=""
出错导致的原因分析如下截图:
data:image/s3,"s3://crabby-images/16e71/16e71bad4ebe1051887d1780649ec6431e80c940" alt=""
data:image/s3,"s3://crabby-images/3e930/3e930d573a34f0a008ea1f2bd4fa2faacd0efd86" alt=""
从结果可以看出,正则表达式未达到切分的效果,得到的结果是少了北京东.原因是搜索到后面北京东的光标右侧没有北京,无法匹配导致.
5.对A1单元格的内容进行属性特点的分析:都是有北京开头,直到不是北京的光标位置或最后一行的末尾结束.正则表达式是"北京\S+?(?=北京|$)"
Sub lookRoundDemo()
Dim reg As Object, matches As Object
Dim i As Long, s As String
s = Cells(1, 1)
Set reg = CreateObject("vbscript.regexp")
reg.Global = True
'reg.MultiLine是多行模式
reg.MultiLine = True
reg.Pattern = "北京\S+?(?=北京|$)"
Set matches = reg.Execute(s)
For i = 0 To matches.Count - 1
Cells(i + 2, 1) = matches(i).Value
Next i
End Sub
最终显示的结果如下:
data:image/s3,"s3://crabby-images/74bd5/74bd57eccdfa9d8f84a6bd49818f221738395fd6" alt=""
6.其他方法
data:image/s3,"s3://crabby-images/03a93/03a93312d35cf6319f880801bbc9c21f1882f476" alt=""
data:image/s3,"s3://crabby-images/0b149/0b1498ec6b8ad10a7e2921156952a044a90ef71a" alt=""
3.2否定顺序环视:(?!abc)右边不是是abc
案例:对区号进行去除,只显示本地号码,并将结果返回在A2以下的单元格
data:image/s3,"s3://crabby-images/550ac/550acff7d9cc7747acdeec75a4629d8cb0f74cad" alt=""
正则表达式:"\d+(?!\d|-)"
Sub lookRoundDemo()
Dim reg As Object, matches As Object
Dim i As Long, s As String
s = Cells(1, 1)
Set reg = CreateObject("vbscript.regexp")
reg.Global = True
'reg.MultiLine是多行模式
reg.MultiLine = True
reg.Pattern = "\d+(?!\d|-)"
Set matches = reg.Execute(s)
For i = 0 To matches.Count - 1
Cells(i + 2, 1) = matches(i).Value
Next i
End Sub
data:image/s3,"s3://crabby-images/803fb/803fb3d9ae50e7072902a1425adbdd86f92f38aa" alt=""
3.3肯定逆序环视&肯否定逆序环视:
VBA暂时不接受这两种环视
data:image/s3,"s3://crabby-images/eeb48/eeb485b5b3364b5a85c1c1a1a3a732290fe38f38" alt=""
4.正则表达式:环视实现位置
4.1案例1:将B列的原始金额转换为规范格式
data:image/s3,"s3://crabby-images/5d678/5d678b606c648ebd13fc23edbbe4b1e55f548be2" alt=""
很多方式都可以实现,比较简单的是正则表达式.用逗号分开,其实就是用光标找到合适的位置进行分开.
data:image/s3,"s3://crabby-images/3007c/3007c6963414d08bc92eb58fab38168ec7f43b26" alt=""
4.1.1 正则表达式是"(?=(\d\d\d\d)+元)"
data:image/s3,"s3://crabby-images/3740a/3740ac0297f53b7062ba2a8b470e9b1d4ec35b5c" alt=""
从测试结果可以看出4220元前面也有个逗号,出现的原因是光标在4220的时候,4220刚好有4个数字并且有个元,刚好符合要求.从而可以看出,这个正则表达式其实还不够严谨.
data:image/s3,"s3://crabby-images/aca93/aca9380e39a64460016d43a8ec0fca881a47b105" alt=""
4.1.2 正则表达式是"(?<=\d)(?=(\d\d\d\d)+元)"
符合要求的是光标的左边有个数字并且光标的右边有四个数字+元
data:image/s3,"s3://crabby-images/fc9e9/fc9e973b244f018a09252a5d4483c1d09547971b" alt=""
data:image/s3,"s3://crabby-images/926b9/926b935f56a77802e99274454a0e2ead5fa56088" alt=""
从下面结果可以看出符合要求,但是VBA不支持逆序环视
data:image/s3,"s3://crabby-images/50c2e/50c2edd1d4c5acf85a08de120072c59a2cd59cbf" alt=""
4.1.3 正则表达式是"(\d)(?=(\d\d\d\d)+元)"&捕获组
VBA不支持逆序环视,但是可以用捕获组代替.(\d)的用$1进行捕获,而(?=(\d\d\d\d)+元)这部分光标的内容用逗号代替.
data:image/s3,"s3://crabby-images/b5cc8/b5cc806d73054dc9fa0f2c5a05bbfa63e5932d98" alt=""
Sub demo()
Dim i As Long, s As String, reg As Object
Set reg = CreateObject("vbscript.regexp")
reg.Global = True
reg.Pattern = "(\d)(?=(\d\d\d\d)+元)"
For i = 3 To 11
s = Cells(i, 2)
Cells(i, 3) = reg.Replace(s, "$1,")
Next i
End Sub
代码显示的最终结果为:
data:image/s3,"s3://crabby-images/c6c06/c6c06b92dd3577f6b47c001843ddc9ea058645dd" alt=""
4.2案例2:网页信息的转换
将以下截图标黄的新闻标题进行提取.
data:image/s3,"s3://crabby-images/51f09/51f09052150a9656ab071730be1784e140d0e528" alt=""
4.2.1找到唯一标识
data:image/s3,"s3://crabby-images/a119b/a119b3a34f091c9ab3c48220677d4aa001ac1817" alt=""
4.2.2简化无关字符
使用正则表达式,对无关字符需要简化
data:image/s3,"s3://crabby-images/ac9f3/ac9f3101755a828249cc9769193540d4b08efe88" alt=""
data:image/s3,"s3://crabby-images/7bc13/7bc132e019e2e64e281c8287c82badfb3c47f474" alt=""
使用正则表达式,避免贪婪搜索和警惕换行问题,以下截图是常用的方法.
data:image/s3,"s3://crabby-images/3ab89/3ab89f8664da58daae3fc91db6fdf8aae41e27f2" alt=""
4.2.3酌情设计多层,正则表达式
以下案例就需要酌情设计的
data:image/s3,"s3://crabby-images/c1c59/c1c59174367aaa185da467dcc5f6699698dbdc91" alt=""
4.3:正则表达式的书籍推荐
data:image/s3,"s3://crabby-images/22b4c/22b4c27f2034ea05a6645ee09a32fac6c64d5ac5" alt=""