繁体中文
设为首页
加入收藏
当前位置:ASP技术首页 >> ASP应用 >> 抓取网页萃取网页内容的代码

抓取网页萃取网页内容的代码

2006-01-15 08:00:00  作者:  来源:互联网  浏览次数:0  文字大小:【】【】【
简介:  dim sUrl  sUrl="http://travel.state.gov/visa/frvi_bulletincurrent.html"  Function streamtochar(StrStream)  set stream=CreateObject("ADODB.Stream")  stream.type=1  stream.Mo...
关键字:网页 代码 内容

  dim sUrl

 sUrl="http://travel.state.gov/visa/frvi_bulletincurrent.html"

 Function streamtochar(StrStream)

 set stream=CreateObject("ADODB.Stream")

 stream.type=1

 stream.Mode=3

 stream.Open

 stream.Write Strstream

 stream.Position= 0

 stream.Type= 2

 stream.Charset="gb2312"

 streamtochar= stream.ReadText

 stream.Close

 set stream=nothing

 End Function

 i = i + 1

 function getContentByUrl(url)

 set XmlHttp = CreateObject("MSXML2.XMLHTTP")

 XmlHttp.open "GET",url,false

 XmlHttp.send

 getContentByUrl = streamtochar(oXmlHttp.responseBody)

 set XmlHttp=nothing

 end function



 function getRealContent(url)

 sContent = getContentByUrl(url)

 getRealContent=sContent

 end function



html= getContentByUrl(surl)

 url_start=inStr(html," " )

 url_end=inStr(html," ")

 url=Mid(html,url_start,url_end-url_start)

 url=replace(url,"“)



 Date_start=inStr(html,"Washington, D.C. ")+57

 Date_end=inStr(html," A. STATUTORY")-14

 Date_T=Mid(html,Date_start,Date_end-Date_start)

责任编辑:admin
相关文章