VB.net Same Data Splitting Differently Between Browser Control Scrape and Webrequest -
so have browser control in vb.net open, scrape table lines , parse data splitting on environment.newline.
i load same site webrequest , data there (not hidden behind js or anything) , row scrapes same data without line breaks in between time.
little puzzled one. i've searched around , related thing saw encoding webrequest defaults utf-8 top of page i'm trying read is:
<meta charset="utf-8">
so i'm not sure why there difference between browser seeing , webrequest seeing.
browser scrape:
each element htmlelement in me.bookie.document.getelementsbytagname("tr") if element.getattribute("class") = "row"
which works great.
the webrequest scrape:
for each node htmlnode in smarkdocument.documentnode.selectnodes("//tr[@class='row']")
reads same data seems ignore line breaks.
the string split same syntax not removing line breaks or , far i'm aware there's no option in webrequest include/exclude them.
i'd appreciate pointers on this.
Comments
Post a Comment