Does BeautifulSoup still work on two concatenated HTML files?

I have a script that concatenates two HTML files into one. It literally just inserts the 2nd HTML code right after the first.

htmlfile1 = urllib.urlopen(url1) htmlfile2 = urllib.urlopen(url2) htmltext1 = htmlfile1.read() htmltext2 = htmlfile2.read() name=symbolslist[i]+'.html' o=open(name, "w") o.write(htmltext1) o.write(htmltext2) o.close()

In my other thread I seem to be having trouble parsing information on the 2nd HTML part using bs4, when the solution is correct.

I have no issue parsing information on the first HTML.

thread: beautifulsoup parsing - dealing with superscript?

Therefore I was wondering if Beautiful Soup works or not on concatenated HTML.

Category:python Views:0 Time:2018-02-11

Related post

Copyright (C) dskims.com, All Rights Reserved.

processed in 0.444 (s). 11 q(s)